Senior AI/ML Engineer (GenAI & LLM Systems) jobs in United States
cer-icon
Apply on Employer Site
company-logo

TEKHQS ยท 2 hours ago

Senior AI/ML Engineer (GenAI & LLM Systems)

TEKHQS is a global technology solutions provider headquartered in Lake Forest, California, and they are seeking a Senior AI/ML Engineer to design, fine-tune, and deploy production-grade Generative AI and LLM-powered systems. This hands-on role involves working on scalable AI platforms and integrating intelligent systems into enterprise workflows.

Information TechnologySoftwareWeb Development
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design, fine-tune, and optimize transformer-based models (GPT, LLaMA, Mistral, T5) for production use cases
Build and maintain end-to-end GenAI pipelines: data processing, training, evaluation, deployment, and monitoring
Implement Retrieval-Augmented Generation (RAG) systems using vector databases and hybrid search
Optimize inference for latency, throughput, and cost efficiency
Work with multi-modal AI (text, embeddings, images, audio where applicable)
Integrate AI services into enterprise applications, ERP systems, and SaaS platforms
Collaborate with product, backend, and cloud teams to deliver scalable AI solutions
Apply best practices in ML governance, security, and responsible AI

Qualification

PyTorchTransformer architecturesLLMsModel optimizationRAG pipelinesVector databasesQuantization techniquesDistributed trainingDockerKubernetesPython engineeringCI/CD for MLMonitoring ML systemsSoft skills

Required

Strong experience with PyTorch and transformer architectures
Hands-on experience with LLMs, embeddings, fine-tuning (LoRA/QLoRA), and prompt engineering
Solid understanding of training vs inference tradeoffs, evaluation metrics, and model behavior
Experience with RAG pipelines, vector databases (Pinecone, Weaviate, FAISS, Chroma)
Familiarity with RLHF concepts (DPO, PPO, reward modeling) is a plus
Tokenization concepts (BPE, SentencePiece, Tiktoken)
Quantization and optimization techniques (GPTQ, AWQ, int8, fp16)
Model serving using vLLM, Triton, HuggingFace TGI, or similar
Experience deploying models on AWS, Azure, or GCP
Distributed training or inference using DeepSpeed, FSDP, Accelerate
Data pipelines using Parquet, WebDataset, or cloud storage
CI/CD for ML workflows
Strong Python engineering practices
Docker and Kubernetes for ML workloads
Experience with monitoring, logging, and profiling ML systems
Bachelors or Masters degree in Computer Science, AI, Data Science, or related field
4+ years of professional ML experience, with 3+ years in GenAI/LLMs
Proven experience deploying AI systems to production

Preferred

Experience with ERP-integrated AI solutions (NetSuite, SAP, Dynamics)
Exposure to multi-agent systems, orchestration frameworks, or AutoGen/LangGraph
Open-source contributions or published technical work

Company

TEKHQS

twittertwittertwitter
company-logo
TekhQs is a US head quartered Customer Software Development Company, it has been providing digital solutions.

H1B Sponsorship

TEKHQS has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (18)
2024 (14)
2023 (15)
2022 (26)
2021 (11)
2020 (7)

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Alexandra Luna
Chief Executive Officer
linkedin
leader-logo
Lee Loyola
Vice President, Strategic Alliances & Partnerships
linkedin
Company data provided by crunchbase