SRS Consulting Inc · 2 hours ago
AI/ML Engineer - Direct Client
SRS Consulting Inc is seeking an AI/ML Platform Engineer to help build and scale their next-generation AI platform. This role focuses on enabling traditional AI and LLM-based systems, integrating models with enterprise tools, and collaborating with cross-functional teams to deploy machine-learning models that drive meaningful outcomes in behavioral health tech and EHR platforms.
Responsibilities
Design and implement MCP servers that expose internal data/services to LLMs
Build secure, structured endpoints for model context access
Integrate MCP services with model inference APIs
Implement and operate a vector search engine
Deploy models into production (cloud, on-premise or hybrid) and integrate with upstream/downstream systems (EHR modules, APIs, micro-services, dashboards)
Monitor model performance in live settings (accuracy, drift, bias, fairness, reproducibility), and iterate on models to maintain or improve reliability and relevance
Build/maintain machine learning pipelines and work with the data platform team to connect AI workloads to core datasets
Ensure security, permissions and monitoring of AI systems
Implement cost monitoring and usage tracking for AI workloads across internal teams
Partner with cross-functional stakeholders (data scientists, data engineers, SDEs) to deploy these capabilities
Stay informed about emerging AI/ML techniques, tools and best practices (including AI ethics, bias mitigation, interpretability), and proactively bring forward improvements or innovation
Contribute to a culture of continuous improvement, knowledge-sharing and mentoring of junior team members
Qualification
Required
Proficiency in Python (or analogous language) and strong familiarity with ML frameworks/libraries (ex: TensorFlow, PyTorch, scikit-learn)
Experience building APIs, services or microservices
Knowledge of vector databases or search systems
Experience with LLM application patterns: RAG, embeddings, prompt orchestration and tool calling
Experience with basic MLOps practices: model deployment, monitoring, pipeline automation, CI/CD
Demonstrated ability to deploy models into production or near-production environments (cloud environments like AWS, Azure, GCP or containerised/micro-services infrastructure). GCP experience is strongly preferred
A collaborative mindset, dependable execution, drive to reflect and improve, and humility to ask questions and learn
Bachelor's degree (or equivalent) in Computer Science, Data Science, Statistics, Engineering or a related field
5+ years of platform/infrastructure engineering experience, with demonstrable recent work on LLM-based systems
Preferred
Experience in healthcare, behavioral health, EHR systems or regulated industries
Familiarity with MLOps practices: CI/CD for models, model monitoring, drift detection, model governance
Experience with NLP (clinical text) or computer vision (imaging) tasks
Familiarity with cloud-native services for ML (e.g., AWS SageMaker, Azure ML, GCP AI Platform) and related infrastructure (Docker, Kubernetes)
Awareness of AI ethics, bias/fairness issues, model interpretability techniques
Experience mentoring others or leading small technical initiatives
Company
SRS Consulting Inc
SRS Consulting Inc.
H1B Sponsorship
SRS Consulting Inc has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (81)
2024 (92)
2023 (135)
2022 (147)
2021 (168)
2020 (244)
Funding
Current Stage
Late StageCompany data provided by crunchbase