Factored · 1 week ago
Machine Learning Engineer (RAGs)
Factored is a company founded by Andrew Ng and a team of AI experts to address the shortage of qualified AI engineers. They are seeking a skilled Machine Learning Engineer focused on Retrieval-Augmented Generation models to design, develop, and deploy advanced machine learning applications for high-profile clients.
Artificial Intelligence (AI)Machine LearningSoftware
Responsibilities
Design, develop, and optimize Retrieval-Augmented Generation (RAG) models that integrate retrieval-based and generation-based approaches to solve complex, real-world problems for our high-profile clients
Improve the performance of RAG models through cutting-edge algorithms, innovative techniques, and model fine-tuning
Collaborate with client Data and Engineering teams to establish and build robust machine learning infrastructure to meet project goals
Work closely with leadership teams from our clients to identify and leverage AI/ML opportunities that can provide transformative solutions
Fine-tune and adapt large language models (LLMs) for specific tasks and domains within the RAG framework
Partner with cross-functional client teams to deploy RAG models into production environments, ensuring seamless integration and long-term success
Apply advanced machine learning techniques, including LLMs, to develop effective AI solutions tailored to client needs
Write clean, maintainable, and scalable code, ensuring all development is well-documented and testable
Prioritize user experience and customer needs in all product development efforts
Design and develop frameworks for GenAI products, such as search interfaces, chatbots, and summarization tools
Build and implement machine learning models and algorithms that directly contribute to client growth and success through innovative, AI-driven solutions
Provide technical leadership in identifying and evaluating AI/ML opportunities that empower clients to deliver exceptional solutions
Qualification
Required
Bachelor's or Master's degree in Computer Science, Statistics, Mathematics, or a related field
5+ years of hands-on experience developing and deploying machine learning models in production environments
4+ years of experience with production NLP and deep learning models using frameworks like PyTorch and TensorFlow
At least 1+ year of experience with Retrieval-Augmented Generation (RAG) and other advanced techniques to optimize model performance
Proven experience writing production-level code, with strong proficiency in Python
Expertise in working with large language models (LLMs) such as GPT, Gemini, and Claude, along with proficiency in LLM frameworks like LangChain
Strong understanding of prompting techniques, and the trade-offs between prompting and fine-tuning
Experience with cloud platforms such as AWS or GCP (AWS preferred), or equivalent on-premise platforms
Company
Factored
Factored helps companies build world-class, rigorously vetted data science, machine-learning, and AI engineering teams.
H1B Sponsorship
Factored has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2023 (2)
Funding
Current Stage
Growth StageTotal Funding
$2.5MKey Investors
AI Fund
2020-04-15Pre Seed· $2.5M
Recent News
2025-11-05
The Fintech Times
2024-01-26
Company data provided by crunchbase