Software Engineer III, Reliability jobs in United States
cer-icon
Apply on Employer Site
company-logo

Box · 1 day ago

Software Engineer III, Reliability

Box is the leader in Intelligent Content Management, enabling organizations to fuel collaboration and transform business workflows with enterprise AI. The Software Engineer III on the Reliability Engineering team will focus on ensuring the platform's performance, scalability, and reliability by analyzing system behaviors and designing scalable solutions.

Cloud ComputingEnterprise SoftwareFile SharingFlash StorageWeb Hosting
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Partner with product and platform engineering teams to assess service designs for scalability and performance risks, ensuring systems are built for long-term growth
Analyze production workloads, system metrics, and load test results to identify bottlenecks, resource inefficiencies, and architectural scaling limits
Design and build frameworks for load testing, capacity modeling, and performance validation that enable teams to proactively address scale concerns
Drive improvements in backend service efficiency, API response times, and resource utilization across Box’s globally distributed platform
Collaborate with SRE, infrastructure, and platform teams to optimize scaling strategies, auto-scaling policies, and resource allocation
Build automation and tooling that integrate performance validation into CI/CD pipelines, enabling early detection of regressions
Participate in root cause analysis of performance-related incidents, identify systemic issues, and drive cross-team remediation efforts
Contribute to the evolution of observability standards (SLIs, SLOs, latency/error budgets) that measure and guide service health

Qualification

Performance EngineeringBackend SystemsDistributed SystemsLoad Testing ToolsCloud InfrastructureObservability ToolsGoJavaProblem-SolvingCollaborationCommunication

Required

3+ years of experience in software engineering, performance engineering, or site reliability engineering, with a focus on backend systems and scalability
Proficient in one or more programming languages such as Go or Java, with an emphasis on building performant services
Strong understanding of distributed systems, concurrency, resource contention, and efficient system design
Hands-on experience analyzing and improving application and system performance across compute, storage, database, and networking layers
Familiarity with load testing and performance benchmarking tools (e.g., Locust, JMeter, Gatling, or custom frameworks)
Experience working with cloud infrastructure (AWS, GCP) and container orchestration (Kubernetes)
Proficient with observability tools and telemetry systems (e.g., Prometheus, Chronosphere, Grafana, Datadog, ELK)
Excellent problem-solving and analytical skills, with a data-driven approach to diagnosing complex system behaviors
Strong collaboration and communication skills; comfortable partnering across engineering teams to drive reliability improvements

Preferred

Experience with service mesh technologies (Istio, Envoy) and cloud-native networking performance optimization
Exposure to capacity planning, cost optimization, and long-term resource forecasting in cloud environments
Familiarity with incident response processes, post-incident reviews, and reliability improvement practices
Experience contributing to internal platforms, developer tooling, or performance automation frameworks

Benefits

Healthcare benefits
Box Benefits + Perks

Company

Box is an online file sharing and cloud content management service offering unlimited storage, custom branding, and administrative controls.

H1B Sponsorship

Box has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (95)
2024 (93)
2023 (58)
2022 (100)
2021 (109)
2020 (114)

Funding

Current Stage
Public Company
Total Funding
$1.46B
Key Investors
Kohlberg Kravis RobertsFuture FiftyGeneral Atlantic
2024-09-18Post Ipo Debt· $400M
2021-04-08Post Ipo Equity· $500M
2015-01-23IPO

Leadership Team

B
Ben Kus
Chief Technology Officer
linkedin
leader-logo
Dylan Smith
CFO
linkedin
Company data provided by crunchbase