200+ applicants

Company

Original Job Post

Velocity Tech Inc · 2 days ago

Senior Machine Learning Engineer

United States

Contract

Remote

Mid Level

3+ years exp

Wonder how qualified you are to the job?

Maximize your interview chances

IT Services and IT Consulting

Hiring Manager

Hima T

Insider Connection @Velocity Tech Inc

Discover valuable connections within the company who might provide insights and potential referrals, giving your job application an inside edge.

Responsibilities

LLM-Optimized MLOps Infrastructure: Design and implement MLOps infrastructure on AWS tailored for LLMs, leveraging services like SageMaker, EC2 (with GPU instances), S3, ECS/EKS, Lambda, and more.

LLM Deployment Pipelines: Build and manage CI/CD pipelines specifically for LLM deployment, addressing unique challenges like model size, inference optimization, and versioning.

LLMOps Practices: Implement LLMOps best practices for monitoring model performance, drift detection, prompt management, and feedback loops for continuous improvement.

RESTful API Development: Design and develop RESTful APIs to expose LLM capabilities to other applications and services, ensuring scalability, security, and optimal performance.

Model Optimization: Apply techniques like quantization, distillation, and pruning to optimize LLM models for efficient inference on AWS infrastructure.

Monitoring and Observability: Establish comprehensive monitoring and alerting mechanisms to track LLM performance, latency, resource utilization, and potential biases.

Prompt Engineering and Management: Develop strategies for prompt engineering and management to enhance LLM outputs and ensure consistency and safety.

Collaboration: Work closely with data scientists, researchers, and software engineers to integrate LLM models into production systems effectively.

Cost Optimization: Continuously optimize LLMOps processes and infrastructure for cost-efficiency while maintaining high performance and reliability.

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

MLOpsLLMsAWSSageMakerEC2S3ECSEKSLambdaAPI GatewayPythonTerraformCloudFormationREST APIFlaskFastAPIHugging Face TransformersMonitoringLoggingPrometheusGrafanaCloudWatchDockerKubernetesInfrastructure-as-CodeProblem-SolvingCommunicationCollaboration

Required

3+ years of experience in MLOps or a related field, with hands-on experience in deploying and managing LLMs.

Strong proficiency in AWS services relevant to MLOps and LLMs, including SageMaker, EC2 (with GPU instances), S3, ECS/EKS, Lambda, and API Gateway.

Deep understanding of LLM architectures (e.g., Transformers), training techniques, and inference optimization strategies.

Proficiency in Python and experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation), REST API frameworks (e.g., Flask, FastAPI), and LLM libraries (e.g., Hugging Face Transformers).

Familiarity with monitoring and logging tools for LLMs, such as Prometheus, Grafana, and CloudWatch.

Experience with Docker and container orchestration (e.g., Kubernetes, ECS) for LLM deployment.

Excellent problem-solving and troubleshooting skills in the context of LLMs and MLOps.

Strong communication and collaboration skills to effectively work with cross-functional teams.

Company

Velocity Tech Inc

We are an IT service augmentation and consulting company that provides reliable ,cost-effective and quality IT and Software Engineering services to our clients.

Bedford, Texas

11-50 employees

https://www.velocitytechinc.com/

Funding

Current Stage

Early Stage

Company data provided by crunchbase

Orion

Your AI Copilot