Harnham · 18 hours ago
Machine Learning Infrastructure Engineer
Harnham is an early-age cutting-edge organization seeking a Machine Learning Engineer to drive infrastructure scalability across state-of-the-art GenAI products. The role involves building and scaling ML infrastructure, optimizing models for performance, and architecting deployment strategies to support growth and complexity.
Responsibilities
Build and scale ML infrastructure capable of serving high-volume, low-latency model inference
Optimize models and pipelines for performance, cost, and reliability in production environments
Productionize research and experimental models into scalable, maintainable ML systems
Architect infrastructure and deployment strategies that support continuous growth and evolving model complexity
Drive infrastructure development to handle a variety of models
Qualification
Required
Experience in ML platform infrastructure and deployment, including scaling training / inference, concurrency, queuing, back pressure, orchestration
Design and operate high-performance model serving systems with proven ownership of system stability, not just in deployment
Engineer solutions that efficiently manage parallel inference workloads at scale
Tune end-to-end serving pipelines to maximize responsiveness and overall system capacity
Python
AWS native stack
Docker, containers, SageMaker, Kubernetes
Company
Harnham
Harnham has actively chosen to focus on Data and Analytics.
H1B Sponsorship
Harnham has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (1)
Funding
Current Stage
Growth StageTotal Funding
unknownKey Investors
BGF Ventures
2022-05-01Seed
Recent News
Company data provided by crunchbase