HCLTech · 14 hours ago
Senior Engineer – Cognitive Infrastructure
HCLTech is seeking a Senior Engineer in Cognitive Infrastructure to work closely with key Tech OEMs and internal stakeholders to generate business opportunities in AI. The role involves designing and operating hybrid Kubernetes clusters, deploying NVIDIA GPU operators, and building MLOps pipelines to support AI infrastructure solutions.
Responsibilities
Design & Operate Hybrid Kubernetes clusters on AWS/GCP/Azure and on‑prem (bare‑metal, DGX, Grace Hopper)
Deploy & manage the NVIDIA GPU Operator (drivers, CUDA, MIG, device plugins) and create GPU‑aware scheduling policies
Build production‑grade MLOps pipelines with Kubeflow Pipelines, GitOps (Argo CD/Flux), MLflow/DVC
Deploy & operate LLMs using NVIDIA Triton, vLLM, TensorRT‑LLM, or custom FastAPI/GRPC services – include quantization, dynamic batching, safety‑filter integration and per‑tenant quota enforcement
Integrate vector databases (Milvus, Pinecone, Qdrant, Weaviate, FAISS) for retrieval‑augmented generation and similarity search
Implement observability (Prometheus, Grafana, Loki/ELK, OpenTelemetry) and define SLO/SLI dashboards
Enforce security & compliance – RBAC, OPA/Gatekeeper, Vault/KMS, image signing, GDPR/HIPAA guidelines
Optimize cost & capacity – GPU quota controls, spot‑instance usage, auto‑scaling, transparent cost reporting
Enable teams – turn notebooks into reproducible pipelines, run office‑hours, write docs/tutorials
Drive technology roadmap – evaluate new NVIDIA releases, open‑source projects (Kubeflow, LangChain, vLLM, TGI etc.) and lead PoCs
Qualification
Required
12+ Years Total Experience
8+ years building & operating production Kubernetes (cloud + on‑prem)
Deep knowledge of NVIDIA GPU Operator stack (drivers, CUDA, MIG)
Strong hands‑on with Kubeflow Pipelines or equivalent MLOps tools
Experience deploying LLMs at scale (quantization, LoRA, inference optimization)
Proficiency in Python (PyTorch, TensorFlow, HuggingFace, LangChain) and IaC (Helm, Kustomize, Terraform)
Experience with vector search engines (Milvus, Pinecone, etc.)
Solid observability/SRE background (Prometheus, Grafana, OpenTelemetry)
Security‑first mindset (RBAC, OPA, Vault, image signing)
Techno-Commercial skills are a must
Preferred
Work with NVIDIA DGX / Grace Hopper hardware
Knowledge of OpenShift, k3s, or edge‑focused deployments
Experience with LWS, Kserve, or serverless inference
Open‑source contributions (Kubernetes, Kubeflow, Triton, Milvus, vLLM)
Certifications – CKA, Any Cloud AI/ML Certification, Nvidia Certifications
Benefits
Continuous opportunities for you to find your spark and grow with us
Transparent communication with senior level employees
Learning and career development programs at every level
Opportunities to experiment in different roles or even pivot industries
Company
HCLTech
HCLTech is a global IT company offering digital, engineering, and cloud solutions partnering with businesses for transformation.
H1B Sponsorship
HCLTech has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2975)
2024 (3974)
2023 (3649)
2022 (3861)
2021 (4093)
2020 (4317)
Funding
Current Stage
Public CompanyTotal Funding
$220MKey Investors
ChrysCapital
2008-07-10Post Ipo Equity· $220M
2000-01-06IPO
Leadership Team
Recent News
2026-01-14
Company data provided by crunchbase