Audio Inference Engineer, Model Efficiency jobs in United States
cer-icon
Apply on Employer Site
company-logo

Cohere · 3 hours ago

Audio Inference Engineer, Model Efficiency

Cohere is a company dedicated to scaling intelligence to serve humanity by training and deploying frontier AI models. The Audio Inference Engineer will focus on optimizing audio inference serving efficiency and enhancing core metrics through collaboration with various teams.

Artificial Intelligence (AI)Foundational AIGenerative AIMachine LearningNatural Language Processing
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Build reliable machine learning systems and optimize audio inference serving efficiency using innovative techniques
Advance core audio model serving metrics, including latency, throughput, and quality
Identify bottlenecks and deliver creative solutions for audio processing and streaming workloads
Collaborate closely with both the training and serving infrastructure teams to ensure seamless integration between model development and deployment

Qualification

Audio inference systemsC++PythonDeep learning modelsGPU programmingReal-time streamingMachine learning frameworksInference frameworksSequence modelingBias for actionResults-oriented mindset

Required

Significant experience developing high-performance audio or machine learning inference systems
Proficiency with programming languages such as C++ and Python
Hands-on experience with deep learning models for audio, speech, or language applications
A bias for action and a strong results-oriented mindset

Preferred

GPU programming, low-level system optimization, model parallelization techniques over multiple GPUs
Experience with duplex real-time streaming architectures
Internals of machine learning frameworks for audio (such as PyTorch, TensorFlow, or specialized audio libraries)
Experience with inference framework like vLLM, SGLang, Tensort-LLM, or custom distributed inference systems
Sequence modeling (e.g., transformers for audio/speech) and end-to-end audio pipeline optimization

Benefits

An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
6 weeks of vacation (30 working days!)

Company

Cohere

twittertwittertwitter
company-logo
Cohere is an enterprise AI firm developing secure and private AI technology to address real-world business challenges.

H1B Sponsorship

Cohere has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (11)
2024 (14)
2023 (13)
2022 (5)
2021 (2)

Funding

Current Stage
Late Stage
Total Funding
$1.71B
Key Investors
Government of CanadaTiger Global ManagementIndex Ventures
2025-09-24Series D· $100M
2025-08-14Series D· $500M
2025-06-17Secondary Market

Leadership Team

leader-logo
Aidan Gomez
cofounder + ceo
linkedin
leader-logo
Ivan Zhang
Co-Founder
linkedin
Company data provided by crunchbase