hud (YC W25) ยท 4 months ago
Systems Engineer, Infrastructure
HUD (YC W25) is developing agentic evals for Computer Use Agents (CUAs) that browse the web. They are seeking a systems/full-stack engineer to help build out the technical infrastructure that enables comprehensive CUA testing at scale.
Computer Software
Responsibilities
Build out HUD's existing CUA evaluation framework
Optimize our evaluation infrastructure at scale
Qualification
Required
Experience with AWS, Kubernetes, Docker, Redis, Linux, Python, PostgreSQL
Systems design, performance security, CI/CD management experience
Preferred
Hands-on experience with scalable infrastructure design and implementation
Contributed to large-scale system architecture projects
Built reliable, high-performance distributed systems
Worked with containerized applications and orchestration platforms
Startup experience in early-stage technology companies with ability to work independently in fast-paced environments
Strong communication skills for remote collaboration across time zones
Familiarity with current AI tools and LLM capabilities
Understanding of LLM evaluation frameworks and methodologies
Evidence of rapid learning and adaptability in technical environments
Benefits
Visa Sponsorship : We provide support for relocation and visas for strong full-time candidates to USA or Singapore.
Company
hud (YC W25)
The all-in-one platform for evaluations on computer use and browser use AI agents.
Funding
Current Stage
Early StageCompany data provided by crunchbase