webAI ยท 6 hours ago
Senior Machine Learning Engineer
webAI is pioneering the future of artificial intelligence by establishing the first distributed AI infrastructure dedicated to personalized AI. They are seeking a Senior Machine Learning Engineer to support Public Sector initiatives focused on building and optimizing production-ready AI systems for secure and distributed environments.
Computer Software
Responsibilities
Productionize AI models from research prototypes into scalable, deployable systems used in real world applications
Develop, fine tune, and optimize models using PyTorch, TensorFlow, or Hugging Face Transformers, adapting both open and closed source models
Implement model optimization techniques such as quantization, pruning, distillation, and hardware specific acceleration
Engineer systems for dynamic model adaptation using low rank adaptation (LoRA), parameter efficient fine tuning (PEFT), and on device inference strategies
Build and maintain Retrieval Augmented Generation (RAG) pipelines, including vector database integration for contextual retrieval
Work with multi modal AI systems across computer vision, audio, and natural language domains
Employ synthetic data generation and digital twinning techniques (GANs, diffusion models, or simulation based) to create robust datasets for edge cases
Develop GPU accelerated and low level system code in C, C++, or Rust for performance critical operations
Optimize model execution for distributed and resource constrained environments, ensuring reliability under variable connectivity conditions
Collaborate cross functionally with Infrastructure, MLOps, and Security teams to deliver secure, compliant, and high performance AI solutions for government partners
Qualification
Required
Active US Security clearance or eligibility and willingness to obtain a US Security clearance
5+ years of experience in applied AI, ML engineering, or production AI systems
Deep proficiency in PyTorch, TensorFlow, or Hugging Face Transformers
Proven experience deploying AI models across cloud, edge, and mobile hardware environments
Expertise in model compression and optimization (quantization, pruning, distillation)
Strong understanding of GPU computing, CUDA, and performance profiling
Experience building RAG pipelines and integrating vector databases (e.g., FAISS, Milvus, Pinecone)
Familiarity with multi modal models and synthetic data generation methods
Low level programming experience in C, C++, or Rust with understanding of computer memory and concurrency
Strong algorithmic and problem solving skills, especially in distributed or constrained compute environments
Preferred
Experience with edge AI, federated learning, or offline inference systems
Familiarity with distributed training frameworks such as DeepSpeed or Ray
Understanding of AI governance and compliance frameworks relevant to public sector deployments
Experience integrating models into large scale distributed systems or microservice architectures
Excellent communication and technical documentation skills for collaboration across multi disciplinary teams
Benefits
Competitive salary and performance-based incentives.
Comprehensive health, dental, and vision benefits package.
401k Match (US-based only)
$200/mos Health and Wellness Stipend
$400/year Continuing Education Credit
$500/year Function Health subscription (US-based only)
Free parking, for in-office employees
Unlimited Approved PTO
Parental Leave for Eligible Employees
Supplemental Life Insurance
Company
webAI
webAI is designed to streamline the training, deployment, and execution of AI models by offering a unified execution layer for AI that seamlessly integrates cloud-based services and local devices.
Funding
Current Stage
Growth StageRecent News
Google Patent
2024-04-16
Company data provided by crunchbase