Output Biosciences ยท 1 day ago
Machine Learning Engineer (NYC)
Output Biosciences is a biotechnology company focused on building generative foundational models that decode biological systems. As a Machine Learning Engineer, you will develop AI systems for complex biological reasoning, build foundational models, and engineer data pipelines to manage large biological datasets.
Artificial Intelligence (AI)BiopharmaTherapeutics
Responsibilities
You will build foundational models for biology capable of reading and writing biology at scale
You will develop deep generative models for biological applications, exploring innovative architectures to capture the complexities of multi-scale biological systems
You will work on distributed training systems to scale our models to billions of parameters, optimizing for performance and efficiency across multi-GPU and multi-node setups while handling large-scale biological datasets
You will engineer efficient data pipelines to manage and process massive biological datasets, addressing challenges in data loading, splitting, and memory optimization
You will develop and implement robust evaluation frameworks for complex biological models, ensuring data integrity and preventing leakage across dataset splits
Qualification
Required
You have a Bachelor's in Computer Science, Machine Learning, or a related technical field
You have 3+ years of experience in developing and implementing deep generative learning models
You have experience pre-training models and are proficient in distributed computing environments
You are proficient in Python and have expertise in at least one major deep learning framework (PyTorch, TensorFlow, or JAX)
You have experience with deep learning and generative architectures such as transformers, diffusion models and autoencoders
You are skilled in working with terra-scale datasets and scaling models to billions of parameters
You have a strong understanding of machine learning fundamentals, including various model architectures, optimization techniques, and evaluation metrics
You have experience in designing and implementing efficient data pipelines for processing and managing large datasets
You are experienced in developing robust evaluation frameworks and ensuring data integrity in machine learning projects
You are experienced in code organization, version control, and collaborative software development practices
You have excellent problem-solving skills and the ability to quickly adapt to new challenges
You exhibit a proactive approach to problem-solving, thinking beyond the specific task, taking ownership of challenges, and pride in solving them
You have a mature mindset in ambiguous situations, helping to frame questions and seek clarity while making decisions in the face of uncertainty
You have excellent communication skills and can clearly articulate complex technical concepts
You are motivated by making a real impact and are committed to tackling problems of significant consequence with determination and creativity
Preferred
You have experience applying machine learning to biology or chemistry
You have contributed to open-source machine learning projects or published research papers in the field of AI/ML
You have experience optimizing machine learning models for high-performance computing environments
You are familiar with ML-Ops practices and tools for managing ML experiments and deployments
Benefits
Excellent medical, dental, and vision coverage
Company
Output Biosciences
Output is pioneering Biologically-Aware Generative AI to finally understand complex biological systems and generate breakthrough medicines.