Search Services · 1 week ago
AI Systems Engineer
Search Services is a global digital transformation and technology solutions leader founded in 2009, partnering with over 160 organizations worldwide. They are seeking an AI Systems Engineer to design and optimize real-time, multimodal AI systems that integrate speech, vision, and large language models, enabling responsive AI experiences.
AccountingFinanceRecruitingStaffing Agency
Responsibilities
Architect ultra-low-latency AI systems integrating speech-to-text, language models, text-to-speech, and computer vision
Develop real-time streaming and inference pipelines using WebRTC, websockets, and gRPC
Design and integrate conversational flows with grounding, emotional tone, and memory
Deploy and optimize GPU workloads at scale using Docker, Kubernetes, and Triton
Build hybrid agent architectures combining LLMs, vision models, and custom logic
Train, fine-tune, and optimize AI models across speech, vision, and transformer domains
Develop retrieval-augmented generation (RAG) pipelines and multi-agent orchestration
Write clean, modular, production-grade code that ships fast and scales elegantly
Collaborate cross-functionally to build living, interactive AI products
Qualification
Required
Expertise in speech AI, including streaming STT/TTS pipelines and latency tuning
Experience integrating LLMs for conversational AI, prompt design, and guardrails
Strong background in real-time engineering: WebRTC, sockets, gRPC, GPU streaming
Proficiency in computer vision frameworks such as YOLO, SAM, and object tracking
Hands-on experience with AI orchestration tools such as LangChain, Langflow, or CrewAI
Advanced skills in ML infrastructure (Docker, Kubernetes, cloud GPU optimization)
Fluency in Python (PyTorch/TensorFlow), TypeScript/Node, FastAPI, and API design
Strong systems-thinking mindset — able to design agents that act, not just respond
Preferred
Experience with model quantization, distillation, or Triton inference servers
Edge deployment expertise (Jetson, ARM, mobile models)
Background in audio DSP, emotion recognition, or prosody modeling
Experience building agent 'personality engines' or affective AI systems