Senior DevOps Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

ALTEN ยท 4 months ago

Senior DevOps Engineer

ALTEN is seeking a Senior DevOps Engineer to join their EIT DevOps Team, responsible for managing and optimizing cloud infrastructure for Digital Services in the federal sector. This role involves ensuring application efficiency and high availability while collaborating closely with development and operations teams.

Information Technology
check
H1B Sponsor Likelynote

Responsibilities

Cloud Infrastructure Management: Deploy, manage, and maintain cloud infrastructure across AWS, Azure, and/or GCP, ensuring compliance for government workloads
Infrastructure Automation: Automate infrastructure provisioning using Infrastructure as Code (IaC) tools like Terraform, OpenTofu, or AWS CloudFormation
Deployment Pipeline Streamlining: Collaborate with development teams to streamline CI/CD pipelines using tools such as GitLab and OpenTofu for efficient infrastructure and application delivery
Performance Optimization: Monitor system performance, participate in capacity planning, and optimize application and infrastructure performance by tuning configurations and identifying bottlenecks
Automation Development: Develop scripts and tools to automate routine operations, including patching, scaling, and monitoring
Self-Healing Systems: Design and implement self-healing systems that proactively detect and resolve faults
Data Integrity & Availability: Manage backup and disaster recovery strategies to ensure data integrity and availability across environments
Security & Compliance: Perform regular security audits and vulnerability patching, adhering to government compliance requirements (e.g., FedRAMP, NIST)
Real-time Incident Resolution: Respond to and resolve infrastructure incidents and outages in real-time, minimizing disruption
Root Cause Analysis (RCA): Conduct RCA for production issues and implement long-term corrective actions
On-Call Participation: Participate in an on-call rotation, escalating and coordinating responses to high-severity issues
Incident Documentation: Document incidents, responses, and postmortems to capture lessons learned
Complex Problem Diagnosis: Diagnose complex infrastructure and application problems, including database performance issues, latency, and service connectivity challenges
Comprehensive Logging & Telemetry: Ensure comprehensive logging and telemetry to support incident response, performance tuning, and auditing
Observability Improvements: Drive observability improvements by collaborating with Engineering and Platform teams to enhance system reliability and traceability
Application Incident Leadership: Lead resolution efforts for application-level incidents, ensuring coordinated response across teams
Application Lifecycle Management: Oversee application lifecycle management, including version upgrades, security patches, and regional rollouts
Knowledge Base Contribution: Contribute to a shared knowledge base, documenting recurring issues and resolution steps
Scaling Strategies: Support scaling strategies to meet regional demand, ensuring infrastructure resilience and compliance with service-level objectives (SLOs)
Strong written and verbal communication skills, with the ability to clearly document procedures, incidents, and solutions
Effective at producing support documentation and conducting knowledge transfer or training sessions
Demonstrated ability to work independently with minimal supervision in a fast-paced, collaborative, and globally distributed team

Qualification

Cloud Infrastructure ManagementInfrastructure as Code (IaC)CI/CD Pipeline StreamliningSecurity & CompliancePerformance OptimizationAutomation DevelopmentIncident ManagementKnowledge ManagementProblem DiagnosisCommunication Skills

Required

Extensive knowledge of cloud services and DevOps best practices
Experience with cloud infrastructure management across AWS, Azure, and/or GCP
Proficiency in Infrastructure as Code (IaC) tools like Terraform, OpenTofu, or AWS CloudFormation
Experience with CI/CD pipelines using tools such as GitLab and OpenTofu
Ability to monitor system performance and participate in capacity planning
Experience in developing scripts and tools to automate routine operations
Ability to design and implement self-healing systems
Experience managing backup and disaster recovery strategies
Ability to perform regular security audits and vulnerability patching
Experience in real-time incident resolution and root cause analysis (RCA)
Participation in on-call rotation for high-severity issues
Ability to document incidents, responses, and postmortems
Experience diagnosing complex infrastructure and application problems
Ability to ensure comprehensive logging and telemetry
Experience driving observability improvements
Ability to lead resolution efforts for application-level incidents
Experience overseeing application lifecycle management
Ability to contribute to a shared knowledge base
Experience supporting scaling strategies for infrastructure resilience
Strong written and verbal communication skills
Ability to produce support documentation and conduct knowledge transfer or training sessions
Demonstrated ability to work independently with minimal supervision
Motivated, proactive mindset with a commitment to delivering high-quality, secure, and reliable systems

Company

Alten is high-technology consulting and engineering group. The group's services are supplied to technical departments.

H1B Sponsorship

ALTEN has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (101)
2024 (89)
2023 (76)
2022 (64)
2021 (80)
2020 (69)

Funding

Current Stage
Public Company
Total Funding
unknown
1999-04-01IPO

Leadership Team

leader-logo
Lesly Mouchnino
UX Designer
linkedin
Company data provided by crunchbase