System Reliability Engineer, Infrastructure R&D jobs in United States
cer-icon
Apply on Employer Site
company-logo

Veeam Software · 8 hours ago

System Reliability Engineer, Infrastructure R&D

Veeam Software is the #1 global market leader in data resilience, providing businesses with control over their data. They are seeking a System Reliability Engineer to manage R&D infrastructure, ensuring reliable operation and supporting Azure DevOps Server while collaborating with R&D teams.

Cloud InfrastructureData CenterData ManagementEnterprise SoftwareSoftwareVirtualization
check
Comp. & Benefits
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Deploy and manage physical and virtual infrastructure for R&D teams, from bare-metal server setup to high-density, heterogeneous virtualized clusters
Be available for periodic on-site visits to data centers to support physical hardware deployment, maintenance, and issue resolution
Administer and support Azure DevOps Server (On-Premises and Cloud) for source code version control
Assist R&D teams with troubleshooting and optimizing build processes
Diagnose and resolve performance issues in high-utilization virtualization clusters and storage systems
Design optimized, purpose-specific server and storage hardware configurations in collaboration with procurement teams
Investigate and resolve issues reported by R&D teams and automated monitoring tools through thorough root cause analysis
Contribute to the design and implementation of disaster recovery strategies
Maintain and enhance internal documentation
Identify and implement opportunities for process automation and efficiency improvements

Qualification

Azure DevOps ServerVirtualization clustersActive DirectoryPowerShell scriptingMicrosoft AzureSQL ServerVMware vSphereCommunication skillsProblem-solvingTeam collaboration

Required

Self-sufficient, proactive, and results oriented
Strong verbal and written communication skills, with the ability to explain complex topics to audiences with varying levels of technical expertise
5+ years of experience administering and troubleshooting Active Directory, Hyper-V, SQL Server, and VMware vSphere products
3+ years of experience designing, implementing, and troubleshooting sophisticated, highly utilized virtualization clusters built on shared storage and complex network topology
3+ years of experience administering Azure DevOps Server (Microsoft Team Foundation Server), including data migration between different platform versions
Experience administering Microsoft Azure
Experience writing advanced PowerShell scripts, including those that utilize 3rd-party modules
Experience configuring monitoring systems from scratch, with a focus on optimizing triggers and alerts
Deep knowledge of the OSI model and network traffic virtualization
Be available for periodic on-site visits to data centers to support physical hardware deployment, maintenance, and issue resolution
Deploy and manage physical and virtual infrastructure for R&D teams, from bare-metal server setup to high-density, heterogeneous virtualized clusters
Administer and support Azure DevOps Server (On-Premises and Cloud) for source code version control
Assist R&D teams with troubleshooting and optimizing build processes
Diagnose and resolve performance issues in high-utilization virtualization clusters and storage systems
Design optimized, purpose-specific server and storage hardware configurations in collaboration with procurement teams
Investigate and resolve issues reported by R&D teams and automated monitoring tools through thorough root cause analysis
Contribute to the design and implementation of disaster recovery strategies
Maintain and enhance internal documentation
Identify and implement opportunities for process automation and efficiency improvements

Preferred

Familiarity with *nix systems such as Linux, macOS, and AIX
Familiarity with Git and TeamCity
Experience designing and implementing Disaster Recovery Plans
Familiarity with off-site and GFS backup strategies using Veeam products such as Backup & Replication and Veeam Agents
Familiarity with the technical nuances of software development (from source code to RTM product)
Familiarity with hardware capacity planning and procurement processes in large organizations

Benefits

Unlimited paid time off, plus 3 global VeeaMe Days for self-care
Paid parental leave: 8 weeks for all parents, 16 weeks for birthing parents
Medical, dental, and vision coverage from day one
Mental health support, therapy sessions, and digital wellness tools via SupportLinc EAP
401(k) retirement plan with matching contributions up to annual limits
Fertility, adoption, and surrogacy support through Maven, plus paid volunteer time
AirVet: 24/7 virtual veterinary care at no cost
Legal services, identity protection, and supplemental health insurance options
Tax-advantaged spending accounts for healthcare, dependent care, and commuting
Professional training and education, including courses and workshops, internal meetups, and unlimited access to our online learning platforms (LinkedIn Learning, Athena, O’Reilly) and mentoring through our MentorLab program

Company

Veeam Software

company-logo
Veeam provides data resilience and data management solutions for cloud, virtual, and physical environments.

Funding

Current Stage
Late Stage
Total Funding
$2.51B
Key Investors
MicrosoftTPGInsight Partners
2025-02-25Corporate Round· $10M
2024-12-04Secondary Market· $2B
2020-01-09Acquired

Leadership Team

leader-logo
Anand Eswaran
President and CEO
linkedin
leader-logo
William H. Largent
Chief Executive Officer
linkedin
Company data provided by crunchbase