SIGN IN
Data Scientist jobs in United States
cer-icon
Apply on Employer Site
company-logo

Galent · 3 hours ago

Data Scientist

Galent is a company focused on data-driven solutions, and they are seeking a Data Scientist to work on model reproduction and feature engineering. The role involves developing and validating predictive models, collaborating with data engineers, and ensuring model performance aligns with client requirements.
Computer Software
Hiring Manager
Ruban Alwin
linkedin

Responsibilities

Rebuild and port existing client Python based models into customer’s Databricks platform
Develop, train, and validate predictive models using Python, PySpark, and ML frameworks such as scikitlearn, XGBoost, and Spark MLlib
Develop, validate and reproduce feature engineering logic and ensure parity with client models
Train, retain, validate, and benchmark model performance using customer provided datasets while maintain performance parity with baseline models
Work with data engineers to define feature requirements and ensure datasets support model needs
Perform model diagnostics, bias checks, stability checks, and accuracy assessments
Prepare model documentation, validation summaries, and stakeholder ready insights
Support scoring pipeline design and ensure reproducibility across Dev/QA/Prod
Collaborate with compliance and platform teams to ensure adherence to governance
Perform model diagnostics, hyperparameter tuning, and stability analysis
Evaluate model performance across population segments and time periods
Work with platform and engineering teams to support scoring pipeline deployment across Dev/QA/Prod

Qualification

PythonMachine LearningDatabricksFeature EngineeringPySparkSQLCloud PlatformsStatistical KnowledgeDocumentationCollaboration

Required

4–6 years of experience in applied machine learning or data science
Strong hands-on experience with Python, scikit-learn, XGBoost, LightGBM, CatBoost, or similar libraries
Experience developing ML models in Databricks with Python or PySpark
Strong knowledge of feature engineering, model training workflows, and evaluation techniques
Experience working with large structured datasets (financial or transactional data preferred)
Ability to write clear documentation and communicate technical results to non-technical stakeholders
4+ years of hands-on experience developing, deploying, and maintaining machine-learning models
Advanced proficiency in Python (NumPy, pandas, scikit-learn, PyTorch or TensorFlow)
Strong statistical and mathematical foundation, including regression, classification probability, optimization, etc
Experience building end-to-end ML pipelines: data ingestion, cleaning, feature engineering, modeling, evaluation, deployment
Experience working within client environments, including adapting to unfamiliar Infrastructure, constraints, and security requirements
Experience with cloud platforms (AWS, Azure, or GCP) and on-prem environments
Advanced SQL ability and experience with big-data tools (Spark, Databricks, Hadoop)

Company

Galent

twitter
company-logo
Galent is an AI-native digital engineering firm at the forefront of the AI revolution, dedicated to delivering unified, enterprise-ready AI solutions that transform businesses and industries.

Funding

Current Stage
Late Stage
Company data provided by crunchbase