ServiceNow · 3 weeks ago
Staff Machine Learning Engineer
ServiceNow, a global market leader in AI-enhanced technology, is seeking a Staff Machine Learning Engineer to contribute to the design and implementation of infrastructure and platform features that support AI workloads. The role involves collaboration with various teams to ensure efficient performance of GPU clusters and includes responsibilities such as coding, mentoring, and improving software engineering practices.
Agentic AIBusiness Process Automation (BPA)Cloud ManagementEnterprise SoftwareRobotic Process Automation (RPA)SaaS
Responsibilities
Contribute to the design, development and implementation of infrastructure, platform, deployment and observability features that power AI workloads
Collaborate with researchers, AI engineers, and infrastructure teams to ensure our GPU clusters perform efficiently, scale well, and remain reliable
Contribute to the continuous improvement of the SRE practice by turning operational use cases into requirements for software tooling
Contribute to the execution of deployment and support activities for AI/ML developers
Build high-quality, clean, scalable and reusable code by enforcing best practices around software engineering architecture and processes (Code Reviews, Unit testing, etc.)
Work with the product owners to understand detailed requirements and own your code from design, implementation, test automation and delivery of high-quality product to our users
Experience with operating LLMs on NVIDIA GPUs
Be a mentor for colleagues and help promote knowledge-sharing
Qualification
Required
Experience in leveraging or critically thinking about how to integrate AI into work processes, decision-making, or problem-solving. This may include using AI-powered tools, automating workflows, analyzing AI-driven insights, or exploring AI's potential impact on the function or industry
Proficient in prompt engineering and developing LLM based features
Working experience building VoIP systems using SIP protocol and SBC/PBX/PSTN infrastructures
Experience in using AI productivity tools such as Cursor, Windsurf, etc
Exposure with operating LLMs on NVIDIA GPUs
4+ years of development experience with Python, GoLang, Java or similar languages
4+ years of experience operating highly available distributed workloads on Kubernetes following a DevOps approach
Experience with DevOps tooling (e.g. Helm / Ansible / Kubernetes / Prometheus /Splunk/ GitLab CI)
Strong working experience operating distributed systems built on Linux and J2EE
Experience with software-defined networking, infrastructure as code and configuration management
Experience building software for compliance and security in regulated environments
Ability to drive outcome in projects with material technical risk
Asset: 4+ years of experience with infrastructure and platform operations, deployments, SRE, and DevOps with a continued focus on improving Platform health
Benefits
Health plans
Flexible spending accounts
401(k) Plan with company match
ESPP
Matching donations
Flexible time away plan
Family leave programs
Company
ServiceNow
ServiceNow is an AI platform that delivers IT operations, field service management and app engine solutions.
H1B Sponsorship
ServiceNow has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (910)
2024 (876)
2023 (807)
2022 (840)
2021 (447)
2020 (439)
Funding
Current Stage
Public CompanyTotal Funding
$83.7MKey Investors
Sequoia CapitalJMI Equity
2022-12-09Post Ipo Equity
2012-07-29IPO
2012-03-20Private Equity· $10.98M
Recent News
2026-01-07
Company data provided by crunchbase