Apexon · 2 days ago
AI/ML Software Engineer – Agentic AI & Production Systems
Apexon is seeking an AI/ML Software Engineer to launch and implement GenAI agentic solutions aimed at improving productivity in large-scale production environments. The role involves developing AI solutions that can diagnose and address production runtime challenges to enhance operational efficiency.
Responsibilities
You will be responsible for launching and implementing GenAI agentic solutions aimed at reducing the risk and cost of managing large-scale production environments with varying complexities
You will address various production runtime challenges by developing agentic AI solutions that can diagnose, reason, and take actions in production environments to improve productivity and address issues related to production support
Qualification
Required
5+ years of software development in one or more languages (Python, C/C++, Go, Java); strong hands-on experience building and maintaining large-scale Python applications preferred
3+ years designing, architecting, testing, and launching production ML systems, including model deployment/serving, evaluation and monitoring, data processing pipelines, and model fine-tuning workflows
Practical experience with Large Language Models (LLMs): API integration, prompt engineering, fine-tuning/adaptation, and building applications using RAG and tool-using agents (vector retrieval, function calling, secure tool execution)
Understanding of different LLMs, both commercial and open source, and their capabilities (e.g., OpenAI, Gemini, Llama, Qwen, Claude)
Solid grasp of applied statistics, core ML concepts, algorithms, and data structures to deliver efficient and reliable solutions
Strong analytical problem-solving, ownership, and urgency; ability to communicate complex ideas simply and collaborate effectively across global teams with a focus on measurable business impact
Preferred
Proficiency building and operating on cloud infrastructure (ideally AWS), including containerized services (ECS/EKS), serverless (Lambda), data services (S3, DynamoDB, Redshift), orchestration (Step Functions), model serving (SageMaker), and infra-as-code (Terraform/CloudFormation)
Company
Apexon
Apexon is a digital-first technology services firm, accelerating business transformation and delivering human-centric digital experiences.
Funding
Current Stage
Late StageRecent News
2025-11-19
2025-10-15
Company data provided by crunchbase