Apply on Employer Site

Hexagon Asset Lifecycle Intelligence · 4 hours ago

Principal Data Scientist (AI)- REMOTE (US)

United States

Full-time

Remote

Senior Level, Lead/Staff

7+ years exp

Hexagon Asset Lifecycle Intelligence is a global leader in digital reality solutions, and they are seeking a hands-on Data Scientist to build predictive models and implement Generative AI features. The role involves developing, deploying, and maintaining ML systems in production environments while collaborating with cross-functional teams to translate business needs into ML solutions.

Software

No H1B

Hiring Manager

Kelli Kratzenberg

Responsibilities

Build and deploy Generative AI features using foundation models (AWS Bedrock, OpenAI, Anthropic Claude) and RAG architectures with vector databases for compliance document understanding

Design agentic AI systems that autonomously handle compliance workflows, document review, regulatory mapping, and multi-step reasoning tasks

Implement comprehensive LLM evaluation frameworks with automated pipelines, custom metrics, benchmark datasets, and safety guardrails ensuring regulatory compliance

Build end-to-end MLOps pipelines for model training, deployment, monitoring, versioning, and automated retraining with drift detection

Develop predictive models for compliance risk scoring, regulatory change impact, anomaly detection, and time-series forecasting

Write production-quality Python code for data processing, feature engineering, API development (FastAPI/Flask), and ETL/ELT workflows

Lead A/B experiments and product analytics to measure AI feature impact and drive data-driven decision-making

Create explainability frameworks (SHAP/LIME) and monitoring dashboards ensuring transparency and regulatory adherence

Collaborate with cross-functional teams to translate business needs into ML solutions and communicate insights to stakeholders

Qualification

PythonMachine Learning & NLPGenerative AI & LLMsMLOps & ModelOpsCloud & AWSSQLStatistics & ExperimentationVisualizationLLM EvaluationCommunication skillsWork independently

Required

7+ years in data science, ML engineering, or related roles

3+ years building NLP/generative AI applications and implementing MLOps in production

Bachelor's or Master's degree in Data Science, Computer Science, Statistics, or related field (PhD preferred)

Track record of deploying ML systems processing large-scale datasets with proper monitoring and governance

Python (5+ years): Production-level experience with Pandas, NumPy, scikit-learn, XGBoost, TensorFlow/PyTorch, Hugging Face Transformers, FastAPI/Flask, MLflow, and pytest

SQL: Advanced proficiency with complex queries, window functions, and optimization

Machine Learning & NLP: Strong foundation in supervised/unsupervised learning, deep learning, document understanding, text classification, and semantic analysis

Generative AI & LLMs: Hands-on experience with foundation models (GPT, Claude, Llama), prompt engineering, RAG architectures, and vector databases (Pinecone, Weaviate, Chroma)

MLOps & ModelOps: End-to-end experience with ML pipelines, experiment tracking (MLflow, W&B), model versioning, feature stores, drift detection, CI/CD for ML, and Docker containerization

LLM Evaluation: Experience with evaluation frameworks (RAGAS, DeepEval), custom metrics, benchmark datasets, and human-in-the-loop validation

Cloud & AWS: Experience with AWS services including SageMaker, Bedrock, S3, Lambda, EC2, and CloudWatch

Statistics & Experimentation: Strong foundation in statistics, A/B testing, causal inference, and experimental design

Visualization: Proficiency with Tableau, Power BI, or Python visualization libraries

Preferred

Experience with agentic AI frameworks (LangGraph, LangChain, AutoGen, CrewAI)

Knowledge of Life Sciences/regulated industries (FDA, EMA, ISO, GxP) and compliance management systems

Familiarity with big data tools (Spark, Databricks, Snowflake), orchestration (Airflow, Kubeflow), and monitoring tools (Datadog, Prometheus)

Experience with LLM fine-tuning, document processing libraries, multi-modal AI, or distributed training

Understanding of ML governance, bias detection, model risk management, and data privacy regulations (GDPR, CCPA, HIPAA)

Experience working in agile environments with Jira

AWS ML certifications or similar credentials

Benefits

At Hexagon, if you can see it, you can do it.

Hexagon’s Asset Lifecyle Intelligence division puts their trust in you so that you can bring your ideas to life.

We have emerged as one of the most engaged and enabled workplaces.

We are committed to creating an environment that is truly supportive by providing the resources you need to fully support your ambitions, no matter who you are or where you are in the world.

In the recently concluded workplace effectiveness survey by Korn Ferry, a global HR advisory firm, Hexagon, Asset Lifecycle Intelligence division has emerged as one of the most Engaged and Enabled workplaces, when compared to similar organizations that Korn Ferry partners with.

Everyone is welcome.

At Hexagon, we believe that diverse and inclusive teams are critical to the success of our people and our business.

Everyone is welcome—as an inclusive workplace, we do not discriminate.

In fact, we embrace differences and are fully committed to creating equal opportunities, an inclusive environment, and fairness for all.

Respect is the cornerstone of how we operate, so speak up and be yourself.

You are valued here.

Company

Hexagon Asset Lifecycle Intelligence

Glassdoor3.9

Hexagon’s Asset Lifecycle Intelligence division helps clients design, construct and operate more profitable, safe and sustainable industrial facilities.

Founded in 2017

Madison, Alabama, USA

10001+ employees

https://hexagonppm.com/

Funding

Current Stage

Late Stage

Leadership Team

Melanie Eakes

Executive Vice President - Chief Technology Officer

Ismo Piirainen

Director, Pre-Sales

Company data provided by crunchbase