Cohere · 3 hours ago
Audio Inference Engineer, Model Efficiency
Cohere is a company dedicated to scaling intelligence to serve humanity by training and deploying frontier AI models. The Audio Inference Engineer will focus on optimizing audio inference serving efficiency and enhancing core metrics through collaboration with various teams.
Artificial Intelligence (AI)Foundational AIGenerative AIMachine LearningNatural Language Processing
Responsibilities
Build reliable machine learning systems and optimize audio inference serving efficiency using innovative techniques
Advance core audio model serving metrics, including latency, throughput, and quality
Identify bottlenecks and deliver creative solutions for audio processing and streaming workloads
Collaborate closely with both the training and serving infrastructure teams to ensure seamless integration between model development and deployment
Qualification
Required
Significant experience developing high-performance audio or machine learning inference systems
Proficiency with programming languages such as C++ and Python
Hands-on experience with deep learning models for audio, speech, or language applications
A bias for action and a strong results-oriented mindset
Preferred
GPU programming, low-level system optimization, model parallelization techniques over multiple GPUs
Experience with duplex real-time streaming architectures
Internals of machine learning frameworks for audio (such as PyTorch, TensorFlow, or specialized audio libraries)
Experience with inference framework like vLLM, SGLang, Tensort-LLM, or custom distributed inference systems
Sequence modeling (e.g., transformers for audio/speech) and end-to-end audio pipeline optimization
Benefits
An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
6 weeks of vacation (30 working days!)
Company
Cohere
Cohere is an enterprise AI firm developing secure and private AI technology to address real-world business challenges.
H1B Sponsorship
Cohere has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (11)
2024 (14)
2023 (13)
2022 (5)
2021 (2)
Funding
Current Stage
Late StageTotal Funding
$1.71BKey Investors
Government of CanadaTiger Global ManagementIndex Ventures
2025-09-24Series D· $100M
2025-08-14Series D· $500M
2025-06-17Secondary Market
Recent News
Beyond Bylines
2026-01-11
2026-01-06
Crunchbase News
2026-01-06
Company data provided by crunchbase