Director of Engineering, AI & ML, Data Collections jobs in United States
cer-icon
Apply on Employer Site
company-logo

PitchBook · 1 day ago

Director of Engineering, AI & ML, Data Collections

PitchBook, a Morningstar company, is seeking a Director of Engineering for their AI & ML Data Collections team. The role involves leading the strategy and execution of AI and ML initiatives to enhance data extraction and validation workflows, managing a global team of data scientists and engineers, and collaborating with various departments to ensure high-quality data pipelines.

AnalyticsFinanceVenture Capital
check
Culture & Values
badNo H1Bnote

Responsibilities

Define and execute the AI & ML strategy for data collection, extraction, and enrichment automation aligned with PitchBook’s long-term data strategy
Partner with senior leadership to identify high-impact opportunities for AI-driven automation and cost reduction in data collection workflows
Establish success metrics and operational KPIs for automation accuracy, throughput, and coverage improvement
Lead, hire, and develop a high-performing global team of data scientists and ML engineers; define team structure, roles, and growth paths that align with organizational goals
Foster a culture of innovation, accountability, inclusion, and continuous improvement across distributed offices
Champion hiring, mentorship, and professional development initiatives to grow internal AI/ML talent
Elevate engineering excellence through code reviews, design reviews, and technical guidance for ML engineers and scientists
Act as a multiplier by shaping best practices for experimentation, model evaluation, responsible AI, and scalable ML engineering
Guide teams across the organization toward cohesive, reusable, and standards-aligned architectures
Collaborate closely with Engineering, Product Management, and Data Operations to ensure the successful operationalization of AI/ML solutions into data pipelines and collection processes
Partner with data quality and enrichment teams to align ML outputs with domain-specific validation frameworks
Serve as a trusted technical and strategic advisor to stakeholders across Product, Engineering, and Data Operations
Oversee the end-to-end lifecycle of ML systems, from research and experimentation to deployment, monitoring, and optimization
Ensure high availability, reliability, and performance of production AI/ML systems
Implement and maintain strong standards of data integrity, security, and compliance in all models
Support the vision and values of the company through role modeling and encouraging desired behaviors
Participate in various company initiatives and projects as requested

Qualification

Machine LearningNatural Language ProcessingData ScienceAI StrategyDocument AIEntity ResolutionCloud-native ArchitectureDistributed ComputingCollaboration SkillsCommunication SkillsLeadership SkillsMentorship

Required

Bachelor's or Master's degree in Computer Science, Mathematics, Data Science, or a related technical discipline
12+ years of experience in machine learning, data science, or AI-focused engineering, including 7+ years leading technical teams
Proven success delivering AI-driven data extraction, enrichment, or document understanding systems at scale
Deep expertise in natural language processing, document AI, OCR, entity resolution, large-scale data automation, optimizing large document workflows, and addressing latency in retrieval-based architectures
Familiarity with agentic AI frameworks (MCP, A2A) and orchestration of multi-agent systems
Strong understanding of modern ML frameworks and infrastructure (e.g., PyTorch, TensorFlow, Hugging Face, LangChain)
Demonstrated ability to define and execute multi-year AI roadmaps with measurable business impact
Strong knowledge of cloud-native architecture, distributed computing, and scalable model deployment
Excellent communication, collaboration, and influencing skills — including experience presenting to executive and cross-functional leadership
A track record of fostering technical excellence and innovation across global, multidisciplinary teams
Must be authorized to work in the United States without the need for visa sponsorship now or in the future

Preferred

Master's degree preferred
Experience managing managers and geographically distributed teams is strongly preferred
Experience in fintech, data platforms, or large-scale information extraction systems preferred
Contributions to the AI/ML research community (e.g., publications, patents, or open-source projects) are strongly preferred

Benefits

Comprehensive health benefits
Additional medical wellness incentives
STD, LTD, AD&D, and life insurance
Paid sabbatical program after four years
Paid family and paternity leave
Annual educational stipend
Ability to apply for tuition reimbursement
CFA exam stipend
Robust training programs on industry and soft skills
Employee assistance program
Generous allotment of vacation days, sick days, and volunteer days
Matching gifts program
Employee resource groups
Subsidized emergency childcare
Dependent Care FSA
Company-wide events
Employee referral bonus program
Quarterly team building events
401k match
Shared ownership employee stock program
Monthly transportation stipend

Company

PitchBook

company-logo
PitchBook offers financial data and tools on companies, deals, investors, and markets to support sales and business development.

Funding

Current Stage
Late Stage
Total Funding
$13.8M
Key Investors
Morningstar
2016-10-14Acquired
2016-01-27Series B· $10M
2009-09-25Series A· $3.8M

Leadership Team

leader-logo
Nizar Tarhuni
Executive Vice President, Research & Market Intelligence
linkedin
leader-logo
Brett Kaluza
Chief Customer Officer
linkedin
Company data provided by crunchbase