Machine Learning Engineer (RAGs) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Factored · 1 week ago

Machine Learning Engineer (RAGs)

Factored is a company founded by Andrew Ng and a team of AI experts to address the shortage of qualified AI engineers. They are seeking a skilled Machine Learning Engineer focused on Retrieval-Augmented Generation models to design, develop, and deploy advanced machine learning applications for high-profile clients.

Artificial Intelligence (AI)Machine LearningSoftware
check
H1B Sponsor Likelynote

Responsibilities

Design, develop, and optimize Retrieval-Augmented Generation (RAG) models that integrate retrieval-based and generation-based approaches to solve complex, real-world problems for our high-profile clients
Improve the performance of RAG models through cutting-edge algorithms, innovative techniques, and model fine-tuning
Collaborate with client Data and Engineering teams to establish and build robust machine learning infrastructure to meet project goals
Work closely with leadership teams from our clients to identify and leverage AI/ML opportunities that can provide transformative solutions
Fine-tune and adapt large language models (LLMs) for specific tasks and domains within the RAG framework
Partner with cross-functional client teams to deploy RAG models into production environments, ensuring seamless integration and long-term success
Apply advanced machine learning techniques, including LLMs, to develop effective AI solutions tailored to client needs
Write clean, maintainable, and scalable code, ensuring all development is well-documented and testable
Prioritize user experience and customer needs in all product development efforts
Design and develop frameworks for GenAI products, such as search interfaces, chatbots, and summarization tools
Build and implement machine learning models and algorithms that directly contribute to client growth and success through innovative, AI-driven solutions
Provide technical leadership in identifying and evaluating AI/ML opportunities that empower clients to deliver exceptional solutions

Qualification

Machine Learning ModelsRetrieval-Augmented GenerationPythonNLPDeep LearningLarge Language ModelsCloud PlatformsTechnical LeadershipCollaborationCommunication SkillsProblem Solving

Required

Bachelor's or Master's degree in Computer Science, Statistics, Mathematics, or a related field
5+ years of hands-on experience developing and deploying machine learning models in production environments
4+ years of experience with production NLP and deep learning models using frameworks like PyTorch and TensorFlow
At least 1+ year of experience with Retrieval-Augmented Generation (RAG) and other advanced techniques to optimize model performance
Proven experience writing production-level code, with strong proficiency in Python
Expertise in working with large language models (LLMs) such as GPT, Gemini, and Claude, along with proficiency in LLM frameworks like LangChain
Strong understanding of prompting techniques, and the trade-offs between prompting and fine-tuning
Experience with cloud platforms such as AWS or GCP (AWS preferred), or equivalent on-premise platforms

Company

Factored

twittertwittertwitter
company-logo
Factored helps companies build world-class, rigorously vetted data science, machine-learning, and AI engineering teams.

H1B Sponsorship

Factored has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2023 (2)

Funding

Current Stage
Growth Stage
Total Funding
$2.5M
Key Investors
AI Fund
2020-04-15Pre Seed· $2.5M

Leadership Team

leader-logo
Israel Niezen
Chief Executive Officer
linkedin
Company data provided by crunchbase