Apply on Employer Site

TEKHQS · 2 hours ago

Senior AI/ML Engineer (GenAI & LLM Systems)

California, United States

Full-time

Onsite

Mid, Senior Level

4+ years exp

TEKHQS is a global technology solutions provider headquartered in Lake Forest, California, and they are seeking a Senior AI/ML Engineer to design, fine-tune, and deploy production-grade Generative AI and LLM-powered systems. This hands-on role involves working on scalable AI platforms and integrating intelligent systems into enterprise workflows.

Information TechnologySoftwareWeb Development

Growth Opportunities

H1B Sponsor Likely

Responsibilities

Design, fine-tune, and optimize transformer-based models (GPT, LLaMA, Mistral, T5) for production use cases

Build and maintain end-to-end GenAI pipelines: data processing, training, evaluation, deployment, and monitoring

Implement Retrieval-Augmented Generation (RAG) systems using vector databases and hybrid search

Optimize inference for latency, throughput, and cost efficiency

Work with multi-modal AI (text, embeddings, images, audio where applicable)

Integrate AI services into enterprise applications, ERP systems, and SaaS platforms

Collaborate with product, backend, and cloud teams to deliver scalable AI solutions

Apply best practices in ML governance, security, and responsible AI

Qualification

PyTorchTransformer architecturesLLMsModel optimizationRAG pipelinesVector databasesQuantization techniquesDistributed trainingDockerKubernetesPython engineeringCI/CD for MLMonitoring ML systemsSoft skills

Required

Strong experience with PyTorch and transformer architectures

Hands-on experience with LLMs, embeddings, fine-tuning (LoRA/QLoRA), and prompt engineering

Solid understanding of training vs inference tradeoffs, evaluation metrics, and model behavior

Experience with RAG pipelines, vector databases (Pinecone, Weaviate, FAISS, Chroma)

Familiarity with RLHF concepts (DPO, PPO, reward modeling) is a plus

Tokenization concepts (BPE, SentencePiece, Tiktoken)

Quantization and optimization techniques (GPTQ, AWQ, int8, fp16)

Model serving using vLLM, Triton, HuggingFace TGI, or similar

Experience deploying models on AWS, Azure, or GCP

Distributed training or inference using DeepSpeed, FSDP, Accelerate

Data pipelines using Parquet, WebDataset, or cloud storage

CI/CD for ML workflows

Strong Python engineering practices

Docker and Kubernetes for ML workloads

Experience with monitoring, logging, and profiling ML systems

Bachelors or Masters degree in Computer Science, AI, Data Science, or related field

4+ years of professional ML experience, with 3+ years in GenAI/LLMs

Proven experience deploying AI systems to production

Preferred

Experience with ERP-integrated AI solutions (NetSuite, SAP, Dynamics)

Exposure to multi-agent systems, orchestration frameworks, or AutoGen/LangGraph

Open-source contributions or published technical work

Company

TEKHQS

TekhQs is a US head quartered Customer Software Development Company, it has been providing digital solutions.

Founded in 2011

Irvine, California, USA

201-500 employees

https://www.tekhqs.com/

H1B Sponsorship

TEKHQS has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (18)

2024 (14)

2023 (15)

2022 (26)

2021 (11)

2020 (7)

Funding

Current Stage

Growth Stage

Leadership Team

Alexandra Luna

Chief Executive Officer

Lee Loyola

Vice President, Strategic Alliances & Partnerships

Company data provided by crunchbase