Principal Data Scientist (AI)- REMOTE (US) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Hexagon Asset Lifecycle Intelligence · 4 hours ago

Principal Data Scientist (AI)- REMOTE (US)

Hexagon Asset Lifecycle Intelligence is a global leader in digital reality solutions, and they are seeking a hands-on Data Scientist to build predictive models and implement Generative AI features. The role involves developing, deploying, and maintaining ML systems in production environments while collaborating with cross-functional teams to translate business needs into ML solutions.

Software
badNo H1Bnote
Hiring Manager
Kelli Kratzenberg
linkedin

Responsibilities

Build and deploy Generative AI features using foundation models (AWS Bedrock, OpenAI, Anthropic Claude) and RAG architectures with vector databases for compliance document understanding
Design agentic AI systems that autonomously handle compliance workflows, document review, regulatory mapping, and multi-step reasoning tasks
Implement comprehensive LLM evaluation frameworks with automated pipelines, custom metrics, benchmark datasets, and safety guardrails ensuring regulatory compliance
Build end-to-end MLOps pipelines for model training, deployment, monitoring, versioning, and automated retraining with drift detection
Develop predictive models for compliance risk scoring, regulatory change impact, anomaly detection, and time-series forecasting
Write production-quality Python code for data processing, feature engineering, API development (FastAPI/Flask), and ETL/ELT workflows
Lead A/B experiments and product analytics to measure AI feature impact and drive data-driven decision-making
Create explainability frameworks (SHAP/LIME) and monitoring dashboards ensuring transparency and regulatory adherence
Collaborate with cross-functional teams to translate business needs into ML solutions and communicate insights to stakeholders

Qualification

PythonMachine Learning & NLPGenerative AI & LLMsMLOps & ModelOpsCloud & AWSSQLStatistics & ExperimentationVisualizationLLM EvaluationCommunication skillsWork independently

Required

7+ years in data science, ML engineering, or related roles
3+ years building NLP/generative AI applications and implementing MLOps in production
Bachelor's or Master's degree in Data Science, Computer Science, Statistics, or related field (PhD preferred)
Track record of deploying ML systems processing large-scale datasets with proper monitoring and governance
Python (5+ years): Production-level experience with Pandas, NumPy, scikit-learn, XGBoost, TensorFlow/PyTorch, Hugging Face Transformers, FastAPI/Flask, MLflow, and pytest
SQL: Advanced proficiency with complex queries, window functions, and optimization
Machine Learning & NLP: Strong foundation in supervised/unsupervised learning, deep learning, document understanding, text classification, and semantic analysis
Generative AI & LLMs: Hands-on experience with foundation models (GPT, Claude, Llama), prompt engineering, RAG architectures, and vector databases (Pinecone, Weaviate, Chroma)
MLOps & ModelOps: End-to-end experience with ML pipelines, experiment tracking (MLflow, W&B), model versioning, feature stores, drift detection, CI/CD for ML, and Docker containerization
LLM Evaluation: Experience with evaluation frameworks (RAGAS, DeepEval), custom metrics, benchmark datasets, and human-in-the-loop validation
Cloud & AWS: Experience with AWS services including SageMaker, Bedrock, S3, Lambda, EC2, and CloudWatch
Statistics & Experimentation: Strong foundation in statistics, A/B testing, causal inference, and experimental design
Visualization: Proficiency with Tableau, Power BI, or Python visualization libraries

Preferred

Experience with agentic AI frameworks (LangGraph, LangChain, AutoGen, CrewAI)
Knowledge of Life Sciences/regulated industries (FDA, EMA, ISO, GxP) and compliance management systems
Familiarity with big data tools (Spark, Databricks, Snowflake), orchestration (Airflow, Kubeflow), and monitoring tools (Datadog, Prometheus)
Experience with LLM fine-tuning, document processing libraries, multi-modal AI, or distributed training
Understanding of ML governance, bias detection, model risk management, and data privacy regulations (GDPR, CCPA, HIPAA)
Experience working in agile environments with Jira
AWS ML certifications or similar credentials

Benefits

At Hexagon, if you can see it, you can do it.
Hexagon’s Asset Lifecyle Intelligence division puts their trust in you so that you can bring your ideas to life.
We have emerged as one of the most engaged and enabled workplaces.
We are committed to creating an environment that is truly supportive by providing the resources you need to fully support your ambitions, no matter who you are or where you are in the world.
In the recently concluded workplace effectiveness survey by Korn Ferry, a global HR advisory firm, Hexagon, Asset Lifecycle Intelligence division has emerged as one of the most Engaged and Enabled workplaces, when compared to similar organizations that Korn Ferry partners with.
Everyone is welcome.
At Hexagon, we believe that diverse and inclusive teams are critical to the success of our people and our business.
Everyone is welcome—as an inclusive workplace, we do not discriminate.
In fact, we embrace differences and are fully committed to creating equal opportunities, an inclusive environment, and fairness for all.
Respect is the cornerstone of how we operate, so speak up and be yourself.
You are valued here.

Company

Hexagon Asset Lifecycle Intelligence

company-logo
Hexagon’s Asset Lifecycle Intelligence division helps clients design, construct and operate more profitable, safe and sustainable industrial facilities.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Melanie Eakes
Executive Vice President - Chief Technology Officer
linkedin
leader-logo
Ismo Piirainen
Director, Pre-Sales
linkedin
Company data provided by crunchbase