Flowmentum, Inc. · 9 hours ago
ML Systems Engineer – Multimodal & Foundation Models
Flowmentum, Inc. is building multimodal AI systems within a large-scale production ecosystem that combines vision, language, and structured signals. The role involves designing, training, evaluating, and deploying multimodal transformer models in scalable production environments, collaborating closely with product and research stakeholders.
Responsibilities
Architecting and implementing multimodal models in PyTorch
Designing and maintaining robust ML data pipelines
Optimizing training and fine-tuning workflows (SFT, RL-style iteration)
Building scalable inference pipelines for large models (100M+ parameters)
Improving evaluation frameworks and model observability
Translating research ideas into production-grade systems
Qualification
Required
Strong PyTorch model architecture experience
Hands-on experience implementing or modifying transformers
Understanding of attention mechanisms and modern training strategies
Experience building ML training/evaluation pipelines
Experience deploying and serving ML models in distributed/cloud environments
Ability to debug ambiguous model failures in real systems
Experience reading and implementing research papers
Familiarity with supervised fine-tuning (SFT)
Exposure to reinforcement learning approaches (RLHF or similar)
Preferred
Experience with multimodal models (vision + language preferred)
Experience serving large transformer models
Familiarity with distributed inference frameworks
GPU optimization and performance tuning
Experience with multimodal foundation models (e.g., CLIP-style, VLMs)
Company
Flowmentum, Inc.
Experienced well connected Tech Recruiters, connecting top talent in emerging and cutting-edge technology areas.
Funding
Current Stage
Early StageCompany data provided by crunchbase