AI/ML Software Engineer – Agentic AI & Production Systems jobs in United States
cer-icon
Apply on Employer Site
company-logo

Apexon · 2 days ago

AI/ML Software Engineer – Agentic AI & Production Systems

Apexon is seeking an AI/ML Software Engineer to launch and implement GenAI agentic solutions aimed at improving productivity in large-scale production environments. The role involves developing AI solutions that can diagnose and address production runtime challenges to enhance operational efficiency.

Information Technology & Services
badNo H1Bnote
Hiring Manager
Madhusudhan K
linkedin

Responsibilities

You will be responsible for launching and implementing GenAI agentic solutions aimed at reducing the risk and cost of managing large-scale production environments with varying complexities
You will address various production runtime challenges by developing agentic AI solutions that can diagnose, reason, and take actions in production environments to improve productivity and address issues related to production support

Qualification

PythonML systems designLarge Language ModelsApplied statisticsCloud infrastructureAnalytical problem-solvingCollaboration

Required

5+ years of software development in one or more languages (Python, C/C++, Go, Java); strong hands-on experience building and maintaining large-scale Python applications preferred
3+ years designing, architecting, testing, and launching production ML systems, including model deployment/serving, evaluation and monitoring, data processing pipelines, and model fine-tuning workflows
Practical experience with Large Language Models (LLMs): API integration, prompt engineering, fine-tuning/adaptation, and building applications using RAG and tool-using agents (vector retrieval, function calling, secure tool execution)
Understanding of different LLMs, both commercial and open source, and their capabilities (e.g., OpenAI, Gemini, Llama, Qwen, Claude)
Solid grasp of applied statistics, core ML concepts, algorithms, and data structures to deliver efficient and reliable solutions
Strong analytical problem-solving, ownership, and urgency; ability to communicate complex ideas simply and collaborate effectively across global teams with a focus on measurable business impact

Preferred

Proficiency building and operating on cloud infrastructure (ideally AWS), including containerized services (ECS/EKS), serverless (Lambda), data services (S3, DynamoDB, Redshift), orchestration (Step Functions), model serving (SageMaker), and infra-as-code (Terraform/CloudFormation)

Company

Apexon is a digital-first technology services firm, accelerating business transformation and delivering human-centric digital experiences.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Radha Krishnan
Chairman of the Board
linkedin
leader-logo
Shalin Shah
Chief Business Development Officer
linkedin
Company data provided by crunchbase