Arc Institute · 4 months ago
Infrastructure Engineer
Arc Institute is a new scientific institution focused on understanding and treating complex human diseases through curiosity-driven research and technology development. They are seeking an Infrastructure Engineer to design and optimize their Hybrid Cloud Infrastructure Platform, ensuring the availability and performance of compute, networking, and storage systems to support cutting-edge bioinformatic projects.
BiotechnologyNon ProfitTraining
Responsibilities
Oversee the operation and optimization of our private cloud GPU cluster, focusing on enhancing availability, performance, and user experience
Design and execute strategies for automating system configurations efficiently and safely, ensuring minimal disruption to production
Develop a unified compute capacity platform with fixed and autoscaling resources across private and public cloud resources
Facilitate efficient, high-throughput, and seamless data transfer between instruments and compute environments
Enable the continuous integration and deployment of long-running services and databases across our hybrid platform
Elevate system reliability by achieving additional 'nines' of availability
Develop and maintain comprehensive security protocols, including network security measures, access controls, vulnerability assessments, and continuous monitoring, to protect infrastructure and data from potential threats and breaches
Collaborate with scientists to assess their computational requirements and deliver tailored resources and support
Create and maintain comprehensive documentation for system configurations, operational procedures, security policies, and end-user guidance through a well-organized Wiki
Qualification
Required
Bachelor's degree in Computer Science, Information Technology, or a related field
Extensive experience with distributed systems, including cloud platforms (AWS, GCP, or Azure) and/or HPC environments (Slurm, Kubernetes, Grid Engine, Torque, etc.)
Advanced Linux system administration skills, including performance tuning and troubleshooting
Proficiency in bare-metal system provisioning (Ansible, Puppet, Chef, Virtualization and/or Containerization)
Proven ability in scripting languages like Python, Bash, or Perl
Familiarity with network protocols, storage systems, and high-speed interconnects (InfiniBand, RoCE)
Working knowledge with monitoring tools like Nagios, Prometheus/Grafana, or New Relic
Experience developing and maintaining software that interacts with Nvidia GPUs, including drivers and diagnostic tools (CUDA, nvcc, nccl, etc.)
Strong understanding of security best practices with hands-on experience in implementing and maintaining security measures
Excellent problem-solving skills and the ability to work under pressure
Strong communication and collaboration skills
Ability to work hybrid onsite (3 days per week) in Palo Alto, CA
Benefits
Annual discretionary bonus
Company
Arc Institute
Arc Institute is a biomedical science and research technology company.
H1B Sponsorship
Arc Institute has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (24)
2024 (13)
2023 (3)
Funding
Current Stage
Growth StageRecent News
Genetic Engineering News
2026-01-16
Company data provided by crunchbase