Cloud Reliability & Support Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Peraton · 1 month ago

Cloud Reliability & Support Engineer

Peraton is a next-generation national security company that drives missions of consequence spanning the globe. They are seeking a Cloud Reliability & Support Engineer to provide expert Level 3 Anomaly Resolution and operational excellence for their Department of Defense customer, focusing on troubleshooting and ensuring the stability of cloud applications.

Information TechnologyRobotics
badNo H1BnoteSecurity Clearance RequirednoteU.S. Citizen Onlynote

Responsibilities

Serve as the primary technical resource for complex, escalated incidents that are contained within the tenant's RHOSP project/resources
Expertly troubleshoot issues on tenant RHEL instances, including kernel panics, package conflicts, file system errors, and performance degradation (CPU, memory, I/O)
Diagnose issues related to the tenant's consumption of OpenStack services (e.g., Nova instance failures, Neutron port issues, Cinder volume attachment problems)
Utilize monitoring tools to perform deep-dive analysis and isolate the root cause of service disruptions within the OpenStack data plane
Own the technical execution and documentation of RCAs, focusing on issues rooted in RHEL/RHOSP misconfiguration or resource limitations
Maintain partnership with Red Hat vendor to stay up to date with the latest advancements in Red Hat products and industry best practices to maintain effective and innovative infrastructures

Qualification

RHEL administrationRed Hat OpenStackCloud operationsSystem diagnosticsScripting languagesContainer technologiesNetworking knowledgeLog analysisAnalytical thinkingCommunication skillsOrganizational skills

Required

This position requires the candidate to possess a minimum of Top-Secret clearance with the ability to obtain TS/SCI. The candidate must maintain the clearance
Associates degree and 10+ years of experience in a systems engineering related field; OR bachelor's degree in computer science, computer engineering, or related field and 8+ years of experience in a systems engineering related field; or a master's degree in computer science, cloud computing, or related field and 6+ years of experience in a systems engineering related field. Additional four (4) years of relevant experience will be considered in lieu of a degree
Meet DoD 8140 foundational requirements for a System Developer with a proficiency of advanced
4+ years of hands-on experience in a cloud operations, system reliability engineer (SRE), or highly technical Level-3 support role within a Linux/Private Cloud environment
Deep-level expertise with RHEL/CentOS administration, networking, and system diagnostics
Strong understanding of Red Hat OpenStack service interaction (Nova, Neutron)
Proficiency with key observability tools and log analysis on Linux systems (e.g., systemd-journald, specialized OpenStack logs)
Expert skill in diagnosing resource contention and failure patterns in distributed systems on a Linux operating system
Proficiency in Linux systems administration, cloud computing, and virtualization, with a strong understanding of both public and private cloud environments
Strong communication and organizational skills in coordination with customers / tenants

Preferred

Certifications: Red Hat Certified Engineer (RHCE) or equivalent is highly preferred
You have strong skills in scripting languages such as Python (specifically for OpenStack SDK interaction)
Hands-on experience with container technologies (Docker, Kubernetes) and demonstrable experience with OpenShift Container Platform
A solid grasp of enterprise networking, firewalls, and security best practices
Strong analytical and conceptual thinking skills to troubleshoot complex issues and optimize performance
Ability to learn independently, adapt to an evolving environment, and stay current with industry trends

Benefits

Heavily subsidized employee benefits coverage for you and your dependents
25 days of PTO accrued annually up to a generous PTO cap
Eligible to participate in an attractive bonus plan

Company

Peraton Fearlessly solving the toughest national security challenges.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Thomas Terjesen
Chief Information Officer
linkedin
Company data provided by crunchbase