Terraform Engineer - Azure-Centric jobs in United States
cer-icon
Apply on Employer Site
company-logo

ChatGPT Jobs · 8 hours ago

Terraform Engineer - Azure-Centric

ChatGPT Jobs is seeking a highly skilled Senior Terraform Engineer with deep expertise in Azure services to join their Enterprise AI Platform team. This role focuses on deploying Machine Learning and Generative AI models in scalable, secure, enterprise environments, while automating infrastructure provisioning and optimizing deployments for performance and compliance.

Computer Software
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Design, implement, and maintain Infrastructure as Code (IaC) solutions using Terraform to provision and manage Azure resources, including:
Azure Machine Learning (Azure ML)
Azure AI Studio
Azure Kubernetes Service (AKS)
Azure Databricks
Related services supporting ML and GenAI model deployment
Develop and enforce IaC best practices, including:
Modular Terraform design
Remote state management (Azure Storage backends)
Drift detection
Automated policy and security testing using tools such as Terragrunt and Checkov
Deploy and orchestrate ML and GenAI models on enterprise ML platforms
Enable end-to-end automation across the ML lifecycle, from model training through inference
Integrate AI/ML workflows with CI/CD pipelines (Azure DevOps, GitHub Actions)
Collaborate with data scientists, ML engineers, and cross-functional teams to design multi-cloud architectures, with Azure as the primary platform and AWS/Google Cloud Platform integrations
Support hybrid deployments, data sovereignty requirements, and disaster recovery strategies
Implement cross-cloud networking, identity federation, and resource orchestration
Optimize cloud infrastructure for AI/ML workloads, including:
Compute clusters
Storage (Azure Blob Storage, Azure Data Lake Storage – ADLS)
Networking (Virtual Networks, Private Endpoints)
Security controls (Azure RBAC, Azure Key Vault, Azure Sentinel)
Ensure infrastructure meets enterprise security, availability, and compliance standards (e.g., GDPR, SOC 2)
Implement MLOps best practices, including:
Model versioning
Monitoring
Logging
Alerting
Leverage observability tools such as Azure Monitor, Prometheus, and MLflow to ensure reliable, production-grade deployments
Troubleshoot and resolve infrastructure issues in production AI environments
Ensure high availability, scalability, and reliability of AI platforms
Conduct code reviews, mentor junior engineers, and contribute to documentation for ML/GenAI-specific IaC patterns
Stay current with emerging Azure ML services, including:
Azure OpenAI Service
Prompt Flow
Participate in on-call rotations and incident response for critical AI infrastructure

Qualification

TerraformAzure servicesMachine LearningMulti-cloud architecturesInfrastructure as CodePythonKubernetesAgile teamsProblem-solving

Required

Bachelor's or Master's degree in Computer Science, Engineering, or a related field (or equivalent professional experience)
5+ years of experience as a Cloud Engineer, DevOps Engineer, or similar role
At least 3 years of hands-on experience with Terraform for IaC in Azure environments
Proven experience deploying ML and GenAI models using Azure ML, including: Model training, Model registration, Managed endpoints, Inference pipelines
Strong hands-on experience with multi-cloud architectures: Azure required, AWS and/or Google Cloud Platform preferred
In-depth understanding of Terraform concepts, including: Modules, Providers (AzureRM), Variables and outputs, Workspaces and backends
Solid understanding of the machine learning lifecycle, including: Data ingestion, Feature engineering, Model serving, Scaling in enterprise AI platforms (Azure ML, SageMaker, Vertex AI)
Experience with containerization and orchestration tools: Docker, Kubernetes (AKS), Helm
Proficiency in scripting languages such as Python, PowerShell, or Bash
Familiarity with cloud security best practices for ML environments, including: Encryption, Access controls, Vulnerability scanning
Strong problem-solving skills and experience working in Agile teams

Preferred

Relevant certifications, including: Microsoft Certified: Azure DevOps Engineer Expert, Azure AI Engineer Associate, HashiCorp Certified: Terraform Associate
Experience with additional IaC tools such as: ARM Templates, Bicep, Pulumi (for hybrid Azure setups)
Background in MLOps tooling, including: Kubeflow, MLflow, Azure ML Pipelines
Experience with cloud cost optimization for AI workloads using tools like Azure Cost Management
Prior experience working in regulated industries (finance, healthcare, etc.) with compliance-driven infrastructure requirements

Company

ChatGPT Jobs

twitter
company-logo
We find the best job offers for experts in ChatGPT and related technologies.

Funding

Current Stage
Early Stage
Company data provided by crunchbase