Galent · 17 hours ago
Data Scientist – Model Reproduction & Validation
Galent is seeking a Data Scientist with a primary focus on model reproduction, feature engineering logic, and performance validation. The role involves ensuring alignment and developing machine learning models, requiring strong technical skills and the ability to communicate results to non-technical stakeholders.
Responsibilities
Model reproduction, feature engineering logic, performance validation, and ensuring alignment
Qualification
Required
4–6 years of experience in applied machine learning or data science
Strong hands-on experience with Python, scikit-learn, XGBoost, LightGBM, CatBoost, or similar libraries
Experience developing ML models in Databricks with Python or PySpark
Strong knowledge of feature engineering, model training workflows, and evaluation techniques
Experience working with large, structured datasets (financial or transactional data preferred)
Ability to write clear documentation and communicate technical results to non-technical stakeholders
4+ years of hands-on experience developing, deploying, and maintaining machine-learning models
Advanced proficiency in Python (NumPy, pandas, scikit-learn, PyTorch or TensorFlow)
Strong statistical and mathematical foundation, including regression, classification probability, optimization, etc
Experience building end-to-end ML pipelines: data ingestion, cleaning, feature engineering, modeling, evaluation, deployment
Experience working within client environments, including adapting to unfamiliar Infrastructure, constraints, and security requirements
Experience with cloud platforms (AWS, Azure, or GCP) and on-prem environments
Advanced SQL ability and experience with big-data tools (Spark, Databricks, Hadoop)
Company
Galent
Galent is an AI-native digital engineering firm at the forefront of the AI revolution, dedicated to delivering unified, enterprise-ready AI solutions that transform businesses and industries.
Funding
Current Stage
Late StageCompany data provided by crunchbase