Runpod · 5 months ago
Principal Software Engineer - Storage Systems
Runpod is pioneering the future of AI and machine learning, offering cutting-edge cloud infrastructure for full-stack AI applications. They are seeking a Principal Software Engineer - Storage Systems to design and deploy scalable cloud storage architectures and optimize storage functionalities as part of their growing cloud platform.
AI InfrastructureArtificial Intelligence (AI)Cloud InfrastructureGPU
Responsibilities
Design and deploy scalable cloud storage architectures (object, block, and file storage) on Runpod’s GPU and AI centered cloud
Implement backup, disaster recovery, and data lifecycle management strategies
Monitor storage systems for performance, availability, and security compliance
Collaborate with Engineering, SRE, Product Management, and our partner companies to define storage architecture and tiering strategies
Contribute to architectural discussions and decisions
Optimize storage costs and usage through continuous analysis and improvements
Stay up-to-date with industry trends and emerging technologies
Ensure adherence to data governance, compliance, and security policies
Troubleshoot and resolve cloud storage issues efficiently
Qualification
Required
Bachelor's degree in Computer Science, Information Systems, or a related field, or equivalent experience
Proficiency with one or more programming or scripting languages - for example, C, Go, Rust, Python, Javascript, Typescript, Bash
3+ years of experience in storage engineering or systems administration
Hands-on experience with high performance/enterprise storage platforms (e.g., Dell EMC, 3PAR/Nimble, IBM, Pure Storage, VAST Data, CephFS)
Linux scripting and Infrastructure as Code skills (Bash, Terraform, etc)
Strong problem-solving skills and ability to work in a collaborative environment
Excellent communication skills and attention to detail
Successful completion of a background check
Preferred
Experience developing, deploying and/or supporting machine learning applications, training and/or inference
Familiarity with data classification and tiering strategies
Experience with storage integration in virtualized and containerized environments
Understanding of IAM, encryption, and other security best practices in cloud environments
Familiarity implementing encryption with customer-managed keys
Knowledge of compliance standards (e.g., SOX, HIPAA, GDPR) related to storage and data handling
Experience with RDMA/RocE/kernel bypass and fast-path optimization for low-latency, high-throughput systems
Benefits
Meaningful equity in a fast-growing AI infra company- everyone on the team receives stock options — your impact drives our growth, and you share in the upside.
Generous medical, dental & vision plans — we cover 100% for all employees and partial for dependents.
Flexible PTO- take the time you need to recharge
$1,200 Home Office & Equipment Stipend- We set you up for success from day one with gear and support to create your ideal workspace
Company
Runpod
Runpod is a cloud platform designed for GPUs, enabling developers to deploy customized full-stack AI applications.
H1B Sponsorship
Runpod has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (4)
2024 (3)
Funding
Current Stage
Growth StageTotal Funding
$22M2024-05-08Seed· $20M
2023-03-30Pre Seed· $2M
Recent News
Crunchbase News
2025-12-26
intelcapital.com
2025-12-04
2025-10-23
Company data provided by crunchbase