Senior Engineer – Cognitive Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

HCLTech · 1 day ago

Senior Engineer – Cognitive Infrastructure

HCLTech is seeking a Senior Engineer in Cognitive Infrastructure to work closely with key Tech OEMs and internal stakeholders to generate business opportunities in AI. The role involves designing and operating hybrid Kubernetes clusters, deploying NVIDIA GPU operators, and building MLOps pipelines to support AI infrastructure solutions.

Information and Communications Technology (ICT)IT ManagementOutsourcingSoftwareTelecommunications
check
H1B Sponsor Likelynote
Hiring Manager
Trilochan Joshi
linkedin

Responsibilities

Design & Operate Hybrid Kubernetes clusters on AWS/GCP/Azure and on‑prem (bare‑metal, DGX, Grace Hopper)
Deploy & manage the NVIDIA GPU Operator (drivers, CUDA, MIG, device plugins) and create GPU‑aware scheduling policies
Build production‑grade MLOps pipelines with Kubeflow Pipelines, GitOps (Argo CD/Flux), MLflow/DVC
Deploy & operate LLMs using NVIDIA Triton, vLLM, TensorRT‑LLM, or custom FastAPI/GRPC services – include quantization, dynamic batching, safety‑filter integration and per‑tenant quota enforcement
Integrate vector databases (Milvus, Pinecone, Qdrant, Weaviate, FAISS) for retrieval‑augmented generation and similarity search
Implement observability (Prometheus, Grafana, Loki/ELK, OpenTelemetry) and define SLO/SLI dashboards
Enforce security & compliance – RBAC, OPA/Gatekeeper, Vault/KMS, image signing, GDPR/HIPAA guidelines
Optimize cost & capacity – GPU quota controls, spot‑instance usage, auto‑scaling, transparent cost reporting
Enable teams – turn notebooks into reproducible pipelines, run office‑hours, write docs/tutorials
Drive technology roadmap – evaluate new NVIDIA releases, open‑source projects (Kubeflow, LangChain, vLLM, TGI etc.) and lead PoCs

Qualification

KubernetesNVIDIA GPU OperatorMLOpsPythonVector databasesObservability toolsSecurity complianceTechno-Commercial skillsOpenShiftCloud AI/ML Certification

Required

12+ Years Total Experience
8+ years building & operating production Kubernetes (cloud + on‑prem)
Deep knowledge of NVIDIA GPU Operator stack (drivers, CUDA, MIG)
Strong hands‑on with Kubeflow Pipelines or equivalent MLOps tools
Experience deploying LLMs at scale (quantization, LoRA, inference optimization)
Proficiency in Python (PyTorch, TensorFlow, HuggingFace, LangChain) and IaC (Helm, Kustomize, Terraform)
Experience with vector search engines (Milvus, Pinecone, etc.)
Solid observability/SRE background (Prometheus, Grafana, OpenTelemetry)
Security‑first mindset (RBAC, OPA, Vault, image signing)
Techno-Commercial skills are a must

Preferred

Work with NVIDIA DGX / Grace Hopper hardware
Knowledge of OpenShift, k3s, or edge‑focused deployments
Experience with LWS, Kserve, or serverless inference
Open‑source contributions (Kubernetes, Kubeflow, Triton, Milvus, vLLM)
Certifications – CKA, Any Cloud AI/ML Certification, Nvidia Certifications

Benefits

Continuous opportunities for you to find your spark and grow with us
Transparent communication with senior level employees
Learning and career development programs at every level
Opportunities to experiment in different roles or even pivot industries

Company

HCLTech is a global IT company offering digital, engineering, and cloud solutions partnering with businesses for transformation.

H1B Sponsorship

HCLTech has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2975)
2024 (3974)
2023 (3649)
2022 (3861)
2021 (4093)
2020 (4317)

Funding

Current Stage
Public Company
Total Funding
$220M
Key Investors
ChrysCapital
2008-07-10Post Ipo Equity· $220M
2000-01-06IPO

Leadership Team

leader-logo
Vijayakumar C.
Chief Executive Officer
linkedin
leader-logo
Alan Flower
Executive Vice President - CTO & Global Head, AI & Cloud Native Labs
linkedin
Company data provided by crunchbase