Multimodal AI Engineer, Document Understanding jobs in United States
cer-icon
Apply on Employer Site
company-logo

LlamaIndex · 1 month ago

Multimodal AI Engineer, Document Understanding

LlamaIndex is redefining document workflows with AI agents, and they are seeking exceptional AI engineers for their core document understanding team. In this role, you will work on machine learning models that enhance document parsing and understanding, impacting thousands of developers and contributing to open-source frameworks.

Artificial Intelligence (AI)Developer APIsEnterprise SoftwareSoftware
check
H1B Sponsor Likelynote

Responsibilities

Develop, train, and optimize machine learning models for document structure understanding, table extraction, layout analysis, and multimodal content processing
Build robust data pipelines, evaluation frameworks, and experimentation infrastructure
Design and implement production ML systems that handle complex, real-world documents at scale
Stay current with latest advances in vision-language models, document AI, and multimodal learning
Collaborate with engineering teams to integrate ML innovations into production APIs
Contribute to both our open-source frameworks and enterprise offerings
Drive technical decisions while balancing research exploration with product delivery

Qualification

Machine Learning EngineeringProduction PythonComputer VisionNatural Language ProcessingMultimodal LearningData Pipeline DevelopmentModel Fine-TuningML InfrastructureModel Optimization TechniquesOpen-Source CollaborationFast-Paced ExecutionDocker/KubernetesDistributed SystemsTechnical Communication

Required

3-7 years of experience in machine learning engineering or applied research
Strong software engineering fundamentals with production Python experience (modern tooling: uv, ruff, mypy, Pydantic)
Hands-on experience training, fine-tuning, or deploying ML models in production
Deep understanding of modern ML techniques, particularly in computer vision, NLP, or multimodal learning
Experience with at least one of: data pipeline development, model training/fine-tuning, or ML infrastructure
Ability to read and implement from research papers and technical specifications
Track record of executing with high intensity in fast-paced environments
Strong technical communication skills and comfort with open-source collaboration

Preferred

Experience with vision-language models, transformer architectures, or model fine-tuning (LoRA, QLoRA)
Experience building evaluation frameworks, benchmarks, or data quality pipelines
Experience with model serving frameworks (vLLM, TensorRT, ONNX) or MLOps tools
Experience specifically with document understanding, OCR, or layout analysis
Contributions to open-source ML projects or frameworks
Experience with LLM applications and RAG systems
Strong understanding of model optimization techniques (quantization, distillation, pruning)
Experience with Docker/Kubernetes and distributed systems
Active participation in ML research community

Benefits

Competitive base salary and equity compensation
Comprehensive medical/dental/vision coverage for you and your family
Unlimited paid time off policy
Daily catered lunch and snacks in the San Francisco office
Budget for conferences, research materials, and professional development
Access to cutting-edge compute resources and research tools

Company

LlamaIndex

twittertwittertwitter
company-logo
LlamaIndex enables enterprises to build AI Knowledge Assistants, improving data management, utilization, and enterprise intelligence.

H1B Sponsorship

LlamaIndex has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (5)

Funding

Current Stage
Early Stage
Total Funding
$29.5M
Key Investors
Amazon Web ServicesNorwestGreylock
2025-10-09Non Equity Assistance· $1M
2025-05-01Series Unknown
2025-03-04Series A· $19M

Leadership Team

leader-logo
Jerry Liu
Co-Founder
linkedin
leader-logo
Simon Suo
Co-Founder and CTO
linkedin
Company data provided by crunchbase