Thinking Machines Lab · 1 month ago
Infrastructure Engineer, Security
Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. They are seeking an infrastructure engineer to own and evolve the security infrastructure that underpins their foundation models, ensuring systems are secure, reliable, and scalable while partnering with research and product teams.
Artificial Intelligence (AI)Foundational AIGenerative AIInformation TechnologyProduct ResearchSoftware
Responsibilities
Architect security patterns for platforms and services, including network segmentation, service-to-service authentication, RBAC, and policy enforcement in Kubernetes and cloud environments
Manage identity, access, and secrets for humans and services: workload and cross-cloud identity, least-privilege IAM, and secrets management
Build secure platforms for data ingestion, processing, and curation: classification, encryption, access controls, and safe sharing patterns across teams
Write threat models and review designs with researchers and engineers to help them ship features and experiments in a safe, scalable way
Automate security checks and build guardrails: policy-as-code, secure infrastructure baselines, validation in CI/CD, and tools that make the secure path the easiest one
Qualification
Required
Bachelor's degree or equivalent experience in engineering, or similar
Strong background with containers and orchestration (e.g., Kubernetes) and how to secure them (namespaces, network policies, pod security, admission controls, etc.)
Practical experience with Infrastructure as Code (Terraform or similar), including secure patterns for provisioning networks, IAM, and shared services
Solid understanding of cloud networking and security: VPCs, load balancers, service discovery, mTLS, firewalls, and zero-trust-style architectures
Proficiency with a systems language such as Rust and scripting in Python for building platform components and internal tools
Evidence of owning complex, production-critical systems, including debugging issues that span infra, security, and application layers
Preferred
Experience with ML infrastructure, GPU clusters, or large-scale training environments (schedulers, job queues, shared storage, multi-tenant clusters)
Background in AI labs, HPC environments, or ML-heavy organizations where both security and performance are first-class concerns
Experience profiling and tuning high-throughput systems, and an ability to reason about the cost of additional security layers
Talks, blogs, or publications on infrastructure security, distributed systems, or performance engineering
Open-source contributions to security, orchestration, observability, or infrastructure tooling
Familiarity with securing specialized hardware (GPUs, TPUs) and their integrations into training and inference pipelines
Benefits
Generous health, dental, and vision benefits
Unlimited PTO
Paid parental leave
Relocation support as needed
Company
Thinking Machines Lab
Thinking Machines Lab is an AI research and product company that aims to increase understanding and customization of AI systems.
H1B Sponsorship
Thinking Machines Lab has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (9)
Funding
Current Stage
Early StageTotal Funding
$2.01BKey Investors
Andreessen HorowitzMinistry of Economy, Culture and Innovation
2025-06-20Seed· $2B
2025-05-05Grant· $9.98M
Recent News
Morningstar.com
2026-01-11
Business Insider
2026-01-06
Crunchbase News
2026-01-02
Company data provided by crunchbase