SIGN IN
Site Reliability Developer 3 jobs in United States
cer-icon
Apply on Employer Site
company-logo

NetSuite · 2 months ago

Site Reliability Developer 3

NetSuite is part of Oracle Cloud Infrastructure, focused on building the future of the cloud for Enterprises. They are seeking a Senior Site Reliability Engineer to solve technical challenges by defining, designing, deploying, and troubleshooting key Network Automation services with an emphasis on scalability, security, and performance.
Cloud ComputingEnterprise SoftwareSaaSElectronicsSoftwareAppsComputerCRMiOS
badNo H1BnoteSecurity Clearance RequirednoteU.S. Citizen Onlynote

Responsibilities

Design and manage distributed Unix-based systems, particularly Oracle Linux
Implement auto-scaling and self-healing infrastructure to ensure uptime and durability
Tune system internals, including kernel parameters, networking, and filesystems, for high performance
Maintain timely OS patching and compliance posture across environments
Integrate systems with enterprise identity services such as Active Directory, LDAP, and Kerberos
Develop and maintain infrastructure automation using Ansible and Terraform
Automate deployment pipelines, service configurations, and patch management
Develop scripts and services in Python and Bash to enhance infrastructure delivery workflows
Extend APIs and platform automation to drive efficiency and repeatability
Develop observability stacks using tools like Prometheus, Grafana, and other open-source telemetry tools
Create dashboards and SLO/SLI-based alerts for real-time monitoring of production systems
Participate in a global 24/7 on-call rotation, leading responses for high-severity incidents
Conduct post-incident analysis (RCA) and drive remediations that improve long-term reliability
Partner with development teams to embed reliability in deployment pipelines
Help define system architecture standards and maintain robust platform documentation
Mentor engineers in Unix performance, observability, and debugging practices
Champion a culture of automation, resilience, and continuous improvement

Qualification

Distributed systems designInfrastructure automationObservability toolsUnix-based systemsPython scriptingAnsibleTerraformIncident responseCollaborationMentoring

Required

3 to 5+ years of experience in Site Reliability Engineering or related fields
Proficiency in English (reading, writing, speaking)
Experience with distributed Unix-based systems, particularly Oracle Linux
Ability to implement auto-scaling and self-healing infrastructure
Experience tuning system internals, including kernel parameters, networking, and filesystems
Knowledge of OS patching and compliance posture management
Experience integrating systems with enterprise identity services such as Active Directory, LDAP, and Kerberos
Proficiency in infrastructure automation using Ansible and Terraform
Experience automating deployment pipelines, service configurations, and patch management
Ability to develop scripts and services in Python and Bash
Experience extending APIs and platform automation
Knowledge of observability stacks using tools like Prometheus and Grafana
Ability to create dashboards and SLO/SLI-based alerts
Willingness to participate in a global 24/7 on-call rotation
Experience conducting post-incident analysis (RCA)
Ability to partner with development teams to embed reliability in deployment pipelines
Experience defining system architecture standards and maintaining platform documentation
Ability to mentor engineers in Unix performance, observability, and debugging practices
Commitment to a culture of automation, resilience, and continuous improvement

Company

NetSuite

company-logo
NetSuite is cloud computing company dedicated to delivering business applications over the internet.

Funding

Current Stage
Public Company
Total Funding
$157.79M
Key Investors
Meritech Capital PartnersTako VenturesStarVest Partners
2016-07-28Acquired
2007-12-20IPO
2007-02-05Secondary Market· $17.87M

Leadership Team

leader-logo
Brian Chess
SVP Technology and AI
linkedin
E
Eli Johnson
Vice President, Global Sales Productivity
linkedin
Company data provided by crunchbase