Co-Founder / Lead AI Engineer (Real-Time Audio Processing for Speech) jobs in United States
info-icon
This job has closed.
company-logo

colombianinsider.com · 2 hours ago

Co-Founder / Lead AI Engineer (Real-Time Audio Processing for Speech)

Colombian Insider is building a commercial-grade, fully offline, real-time voice conversion communication system for Windows. They are seeking a Co-Founder / Lead AI Engineer to own the entire AI lifecycle, from data curation and training to optimizing the C++ inference engine, with a commitment to delivering the product within 6-12 months.

Staffing & Recruiting

Responsibilities

Own the entire AI lifecycle from data curation and training to optimizing the C++ inference engine
Implement the C++ inference wrapper using the ONNX Runtime C++ API
Clean and curate speech datasets and run automated QA metrics
Optimize models for inference speed on older GPUs

Qualification

PyTorchC++ (C++17/20)Inference OptimizationSpeech SynthesisData CurationWindows EnvironmentFinancial StabilityPartner MentalityDirectML ExperienceDSP KnowledgeAudio Quality AssessmentVersion ControlReliabilityCommunicationExtreme Ownership

Required

Core AI/ML: Deep experience with PyTorch and Speech Synthesis architectures. You must have practical experience with RVC (Retrieval-based Voice Conversion), Soft-VC, HiFi-GAN, and Content Encoders like ContentVec or HuBERT
Inference Optimization: Proven experience converting PyTorch models to ONNX. You must understand Static INT8 Quantization and how to optimize models for inference speed on older GPUs (avoiding dynamic shapes, managing receptive fields)
C++ Implementation: Proficiency in Modern C++ (C++17/20). You will not just train models in Python; you must implement the C++ inference wrapper using the ONNX Runtime C++ API
Data Science: Experience cleaning and curating speech datasets (e.g., L2-ARCTIC, LJSpeech) and running automated QA metrics (WER, MOS/NISQA)
Environment: Must have access to a Windows environment (native or dual-boot) to test DirectML compatibility
Financial Runway: You have the financial stability to work without a paycheck until we reach revenue
Reliability & Grit: You are a finisher. You do not flake when technical challenges arise. You have a track record of seeing projects through to completion
Communication: You are responsive and communicative. As a remote partner, 'going dark' for days is not an option
Extreme Ownership: You don't wait for tickets. You understand the high-level goal (Low Latency + Naturalness) and you proactively solve problems
Partner Mentality: You are not looking for a boss; you are looking for a business partner. You care about the product's success as much as the code quality

Preferred

DirectML Experience: Specific experience optimizing ONNX models for Windows DirectML to support Intel/AMD/NVIDIA GPUs simultaneously
DSP Knowledge: Understanding of Digital Signal Processing (Circular buffers, FFTs, overlap-add, crossfading) to better collaborate with the Audio Systems Engineer
Audio Quality Assessment: Familiarity with objective audio metrics (PESQ, STOI, NISQA) and how to automate them
Version Control: Disciplined use of Git (Branching strategies, PRs)

Benefits

Equity (Ownership)
Revenue Share

Company

colombianinsider.com

twitter
company-logo

Funding

Current Stage
Early Stage
Company data provided by crunchbase