LlamaIndex · 1 month ago
Multimodal AI Engineer, Document Understanding
LlamaIndex is redefining document workflows with AI agents, and they are seeking exceptional AI engineers for their core document understanding team. In this role, you will work on machine learning models that enhance document parsing and understanding, impacting thousands of developers and contributing to open-source frameworks.
Artificial Intelligence (AI)Developer APIsEnterprise SoftwareSoftware
Responsibilities
Develop, train, and optimize machine learning models for document structure understanding, table extraction, layout analysis, and multimodal content processing
Build robust data pipelines, evaluation frameworks, and experimentation infrastructure
Design and implement production ML systems that handle complex, real-world documents at scale
Stay current with latest advances in vision-language models, document AI, and multimodal learning
Collaborate with engineering teams to integrate ML innovations into production APIs
Contribute to both our open-source frameworks and enterprise offerings
Drive technical decisions while balancing research exploration with product delivery
Qualification
Required
3-7 years of experience in machine learning engineering or applied research
Strong software engineering fundamentals with production Python experience (modern tooling: uv, ruff, mypy, Pydantic)
Hands-on experience training, fine-tuning, or deploying ML models in production
Deep understanding of modern ML techniques, particularly in computer vision, NLP, or multimodal learning
Experience with at least one of: data pipeline development, model training/fine-tuning, or ML infrastructure
Ability to read and implement from research papers and technical specifications
Track record of executing with high intensity in fast-paced environments
Strong technical communication skills and comfort with open-source collaboration
Preferred
Experience with vision-language models, transformer architectures, or model fine-tuning (LoRA, QLoRA)
Experience building evaluation frameworks, benchmarks, or data quality pipelines
Experience with model serving frameworks (vLLM, TensorRT, ONNX) or MLOps tools
Experience specifically with document understanding, OCR, or layout analysis
Contributions to open-source ML projects or frameworks
Experience with LLM applications and RAG systems
Strong understanding of model optimization techniques (quantization, distillation, pruning)
Experience with Docker/Kubernetes and distributed systems
Active participation in ML research community
Benefits
Competitive base salary and equity compensation
Comprehensive medical/dental/vision coverage for you and your family
Unlimited paid time off policy
Daily catered lunch and snacks in the San Francisco office
Budget for conferences, research materials, and professional development
Access to cutting-edge compute resources and research tools
Company
LlamaIndex
LlamaIndex enables enterprises to build AI Knowledge Assistants, improving data management, utilization, and enterprise intelligence.
H1B Sponsorship
LlamaIndex has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (5)
Funding
Current Stage
Early StageTotal Funding
$29.5MKey Investors
Amazon Web ServicesNorwestGreylock
2025-10-09Non Equity Assistance· $1M
2025-05-01Series Unknown
2025-03-04Series A· $19M
Recent News
llamaindex.ai
2025-12-05
2025-10-15
Company data provided by crunchbase