Machine Learning Operations Engineer @ NAVA Software Solutions | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
Machine Learning Operations Engineer jobs in Jersey City, NJH1B Visa Sponsored Machine Learning Operations Engineer jobs in Jersey City, NJ
29 applicantsPosted by Agency
company-logo

NAVA Software Solutions ยท 3 days ago

Machine Learning Operations Engineer

Wonder how qualified you are to the job?

ftfMaximize your interview chances
Cloud InfrastructureInformation Technology
check
H1B Sponsorship
check
Growth Opportunities

Insider Connection @NAVA Software Solutions

Discover valuable connections within the company who might provide insights and potential referrals, giving your job application an inside edge.

Responsibilities

LLM-Optimized MLOps Infrastructure: Design and implement MLOps infrastructure on AWS tailored for LLMs, leveraging services like SageMaker, EC2 (with GPU instances), S3, ECS/EKS, Lambda, and more.
LLM Deployment Pipelines: Build and manage CI/CD pipelines specifically for LLM deployment, addressing unique challenges like model size, inference optimization, and versioning.
LLMOps Practices: Implement LLMOps best practices for monitoring model performance, drift detection, prompt management, and feedback loops for continuous improvement.
RESTful API Development: Design and develop RESTful APIs to expose LLM capabilities to other applications and services, ensuring scalability, security, and optimal performance.
Model Optimization: Apply techniques like quantization, distillation, and pruning to optimize LLM models for efficient inference on AWS infrastructure.
Monitoring and Observability: Establish comprehensive monitoring and alerting mechanisms to track LLM performance, latency, resource utilization, and potential biases.
Prompt Engineering and Management: Develop strategies for prompt engineering and management to enhance LLM outputs and ensure consistency and safety.
Collaboration: Work closely with data scientists, researchers, and software engineers to integrate LLM models into production systems effectively.
Cost Optimization: Continuously optimize LLMOps processes and infrastructure for cost-efficiency while maintaining high performance and reliability.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

MLOpsLLMsAWSSageMakerEC2S3ECSEKSLambdaAPI GatewayPythonTerraformCloudFormationREST APIFlaskFastAPIHugging Face TransformersMonitoringLoggingPrometheusGrafanaCloudWatchDockerKubernetesInfrastructure-as-CodeProblem-SolvingCommunicationCollaboration

Required

3+ years of experience in MLOps or a related field, with hands-on experience in deploying and managing LLMs.
Strong proficiency in AWS services relevant to MLOps and LLMs, including SageMaker, EC2 (with GPU instances), S3, ECS/EKS, Lambda, and API Gateway.
Deep understanding of LLM architectures (e.g., Transformers), training techniques, and inference optimization strategies.
Proficiency in Python and experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation), REST API frameworks (e.g., Flask, FastAPI), and LLM libraries (e.g., Hugging Face Transformers).
Familiarity with monitoring and logging tools for LLMs, such as Prometheus, Grafana, and CloudWatch.
Experience with Docker and container orchestration (e.g., Kubernetes, ECS) for LLM deployment.
Excellent problem-solving and troubleshooting skills in the context of LLMs and MLOps.
Strong communication and collaboration skills to effectively work with cross-functional teams

Company

NAVA Software Solutions

twittertwitter
company-logo
Nava Software Solutions specializes in application development, IT staff augmentation, cloud infrastructure support and training solutions.

H1B Sponsorship

NAVA Software Solutions has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Trends of Total Sponsorships
2023 (3)
2022 (4)
2021 (6)
2020 (7)

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Anupam Dayal
Sr. Partnerships & Program Manager
linkedin
leader-logo
Kim Lewis
Sr. Product Owner & Partnership Manager
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot