Lead AWS Cloud Operations Engineer jobs in United States
info-icon
This job has closed.
company-logo

MDAEdge · 3 months ago

Lead AWS Cloud Operations Engineer

MDAEdge is a company focused on cloud operations, and they are seeking a Lead AWS Cloud Operations Engineer. The role involves managing and optimizing cloud infrastructure, overseeing AWS services, and leading a team of cloud engineers to ensure high availability and performance.

Human Resources

Responsibilities

Manage and maintain cloud infrastructure to ensure high availability, reliability, and performance
Serve as the primary escalation point for all cloud infrastructure issues
Monitor cloud resource performance and cost efficiency
Lead major incident management and communicate timely updates to stakeholders
Perform due diligence and impact analysis before implementing changes to cloud platforms
Lead and mentor a team of cloud engineers to ensure performance and collaboration
Manage daily operations and ensure alignment with organizational objectives
Develop and implement incident management processes and conduct root cause analysis
Identify and automate repetitive infrastructure tasks using IaC principles
Continuously improve operational processes and standard operating procedures
Implement and enforce security controls, ensuring compliance with standards such as GDPR and HIPAA
Monitor cloud usage and conduct capacity planning to balance efficiency and scalability
Develop and test disaster recovery and business continuity plans
Collaborate with IT, business units, and vendors to deliver scalable cloud solutions
Document cloud configurations, processes, and reports, ensuring accessibility and version control

Qualification

AWS EC2AWS ECSAWS EKSAWS RDSAWS S3AWS LambdaAWS SageMakerAWS CloudFrontInfrastructure as Code TerraformInfrastructure as Code CloudFormationCloud architecture understandingAzureOCI experienceScripting in PythonSystem administration WindowsSystem administration LinuxNetworking knowledge DNSNetworking knowledge DHCPVendor managementContinuous improvement mindsetITILITSM familiarityLeadership experienceCommunication skillsProblem-solving skills

Required

Manage and optimize control towers, organizational policies, and multi-account environments
Oversee AWS backups, SSM patching, AMI deployments, and configuration pushes across multiple accounts
Manage and maintain core AWS services including EC2, ECS, EKS, RDS, S3, SageMaker, CloudFront, and Lambda
Implement S3, SFTP, and site externalization methods
Develop Infrastructure as Code (IaC) using Terraform, CloudFormation, and Python
Manage IAM policies, access controls, and permissions
Manage and maintain cloud infrastructure to ensure high availability, reliability, and performance
Serve as the primary escalation point for all cloud infrastructure issues
Monitor cloud resource performance and cost efficiency
Lead major incident management and communicate timely updates to stakeholders
Perform due diligence and impact analysis before implementing changes to cloud platforms
Lead and mentor a team of cloud engineers to ensure performance and collaboration
Manage daily operations and ensure alignment with organizational objectives
Develop and implement incident management processes and conduct root cause analysis
Identify and automate repetitive infrastructure tasks using IaC principles
Continuously improve operational processes and standard operating procedures
Implement and enforce security controls, ensuring compliance with standards such as GDPR and HIPAA
Monitor cloud usage and conduct capacity planning to balance efficiency and scalability
Develop and test disaster recovery and business continuity plans
Collaborate with IT, business units, and vendors to deliver scalable cloud solutions
Document cloud configurations, processes, and reports, ensuring accessibility and version control
Proficiency in AWS (EC2, ECS, EKS, RDS, S3, Lambda, SageMaker, CloudFront)
Experience with Azure and OCI cloud environments
Infrastructure as Code (Terraform, CloudFormation, Ansible, Puppet, Chef)
Scripting in Python and PowerShell
Strong understanding of cloud architecture, monitoring, and automation tools
System administration experience (Windows, Linux, VMware, Active Directory, Azure AD SSO)
Strong networking knowledge (DNS, DHCP, PKI, LAN/WAN)
Demonstrated experience in leading teams and managing cloud operations
Strong communication and stakeholder management across technical and business functions
Proactive problem-solver with excellent analytical and root cause analysis skills
Self-motivated with a continuous improvement mindset
Experienced in vendor management and contract negotiations
Bachelor's degree in Computer Science, Information Technology, Electrical Engineering, or equivalent
Experience in cloud operations and team leadership in technical environments

Preferred

AWS Certified Solutions Architect – Associate or Professional
Microsoft Certified: Azure Architect
Familiarity with DevOps tools (CI/CD, Jenkins, Git)
Experience with ITIL or ITSM frameworks

Company

MDAEdge

twitter
company-logo
At MDAEdge, we help our clients reinvent innovation, optimize operations, and reshape perceptions—ensuring they remain at the forefront in today’s fast-evolving world.

Funding

Current Stage
Growth Stage
Company data provided by crunchbase