Senior AI Platform Engineer, Atlas AI jobs in United States
cer-icon
Apply on Employer Site
company-logo

Cognite · 11 hours ago

Senior AI Platform Engineer, Atlas AI

Cognite operates at the forefront of industrial digitalization, building AI and data solutions that solve some of the world’s hardest, highest-impact problems. The AI Platform Engineer will engineer, build, and operate the production-grade, multi-cloud platform that enables internal and partner teams to build, deploy, and manage industrial AI agents.

Artificial Intelligence (AI)Industrial AutomationInformation TechnologyMachine LearningSaaS
check
H1B Sponsor Likelynote
Hiring Manager
Tereza Behuncikova
linkedin

Responsibilities

Design, build, and maintain the core Python SDKs and services for the Atlas AI platform. Create clean abstractions that empower Solution Engineers to easily define and test agents and workflows
Build the core agentic runtime, ensuring it is scalable, meets its SLOs, and can reliably manage the state, orchestration, and execution of industrial agents
Develop a robust, governed, and secure framework for AI agent tool-use. Engineer the platform components that allow solution engineers to safely add new tools (e.g., API calls, database queries) and that manage the secure execution, monitoring, and access control for those tools
Manage the LLM serving layer, including deploying and optimizing models for low-latency/high-throughput inference. Build and maintain model routing logic to select the most appropriate model (e.g., performance vs. cost) for a given task
Implement evaluation and observability for all AI services. Create standardized frameworks for systematically evaluating the performance, accuracy, cost, and safety of LLMs and agentic workflows. Drive the implementation of robust, automated testing strategies for LLM-based systems
Own the full development lifecycle for services in a production SaaS environment. This includes establishing automated code coverage goals, rigorous code reviews, defining SLOs, participating in on-call rotations, and ensuring a fast and effective incident response process
Work closely with the Lead Architect to translate the technical vision into implemented, production-grade services. Act as a key partner for the Solution Engineers (your internal customers) to understand their needs and abstract common patterns into reusable, robust platform components
Stay up to date on the latest developments in the field, and mentor junior developers

Qualification

PythonMLOpsKubernetesSaaS DevelopmentAI/ML ModelsInfrastructure as CodeAPI DesignCommunication SkillsMentoring

Required

Bachelor's or Master's degree in Computer Science or a related field, or equivalent practical experience
8+ years of professional experience in backend software engineering, platform engineering, or MLOps, with a proven track record of architecting and operating complex systems at scale
2+ years of hands-on experience building applications or platforms on top of AI/ML models or LLMs
Expert-level proficiency in Python and a strong background in software architecture, robust API design, and building maintainable, well-documented SDKs for other developers
Hands-on experience with Kubernetes (K8s) and building services on managed PaaS in a multi-cloud environment (AWS, Azure, GCP). Strong understanding of Infrastructure as Code (e.g., Terraform)
Proven experience building and operating production-grade SaaS software. Understanding of the full development life cycle, including CI/CD, monitoring, telemetry, and on-call incident response
Practical experience with LLM orchestration frameworks (Bedrock, Vertex, Semantic Kernel, LangChain)
Strong verbal and written communication skills, with the ability to articulate complex technical designs and decisions clearly

Preferred

Hands-on experience deploying and managing LLMs in production using high-performance serving frameworks
Experience with MLOps/LLMOps tools for tracing, monitoring, and evaluating LLM applications (LangSmith, Arize, Phoenix, or equivalent)
Experience with RAG Infrastructure, embedding generation pipelines, vector database integrations, and high-performance vector similarity search APIs

Company

Cognite develops an industrial IoT data platform that enables digital transformation of heavy-asset industries.

H1B Sponsorship

Cognite has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2024 (4)
2023 (2)
2022 (2)
2021 (5)
2020 (5)

Funding

Current Stage
Late Stage
Total Funding
$338.21M
Key Investors
Saudi AramcoTCVAccel
2022-02-02Secondary Market· $113M
2021-05-19Series B· $150M
2020-10-27Series A· $75M

Leadership Team

leader-logo
Girish Rishi
Chair and CEO
linkedin
leader-logo
Fredrik Anfinsen
Co-Founder and Director of Product Development
linkedin
Company data provided by crunchbase