Senior Engineer-AI Inference jobs in United States
cer-icon
Apply on Employer Site
company-logo

Bank of America · 1 day ago

Senior Engineer-AI Inference

Bank of America is committed to helping make financial lives better through the power of every connection. They are seeking a Senior Engineer-AI Inference to lead the engineering approach for complex features in their Gen AI platform, which empowers AI initiatives across various banking sectors.

Asset ManagementBankingFinanceFinancial ServicesFinTech
check
H1B Sponsor Likelynote

Responsibilities

Ensures that the design and engineering approach for complex features are consistent with the larger portfolio solution
Define the technology tool stack for the solution and evaluate and adapt new testing tool/framework/practices for team(s)
Enables team(s)/applications with Continuous Integration/Continuous Development (CI/CD) capabilities and engages with other technical stakeholders pertaining to efficient functioning of CI-CD pipeline
Guides and influences team(s) on design and best practices for high code performance –e.g. pairing, code reviews
Provides end-to-end delivery of complex features, including automation, for either a single team or multiple teams, at the program level
Conducts research, design prototyping and other exploration activities such as evaluating new toolsets and components for release management, CI/CD, and features
Works with stakeholders to establish high-level solution needs and with architects for technical requirements
Collaborate with product teams, data analysts and data scientists to design and build solutions
Design and execute the implementation plans to both move forward strategically, while at the same time ensuring the current technology stack is supporting current needs
Manage multiple priorities, and simultaneously engage with multiple teams worldwide
Be vocal and actively participate in all session with business stakeholders and agile teams
Manage next generation of architectural decision for advanced analytics platform, create strategy, roadmaps, present to tech and non-tech leaders
Coach and mentor team members

Qualification

Model OpsPython developmentAI/ML experienceData source platformsPerformance TuningModel MonitoringTest Driven DevelopmentAutomationInfluenceResult OrientationStakeholder ManagementTechnical Strategy DevelopmentApplication DevelopmentArchitectureBusiness AcumenRisk ManagementSolution DesignAgile PracticesAnalytical ThinkingData ManagementSolution Delivery ProcessCollaboration

Required

Minimum 8 years of relevant experience required
Experience in Model Ops and design, software development with proven effectiveness in delivering technology in fast-paced, demanding, industry driven environment for AI/ML, and advanced analytics
Hands on experience in both Python development on Linux. Strong understanding of modern open-source data science platform architecture for storage & compute separation, interactive development workbenches, containers, and toolsets such as Jupyter, VSCode etc
Experience of data sources and Vector Store platforms such as Redis, Solar, Postgres DB, FAISS, Teradata, Oracle, SQL Server, Hadoop etc
Experienced in using design patterns and following best software engineering practices
An understanding of fundamental algorithms and ability to optimize existing code
Proficient written and verbal communication skills to support and shape the platform and clearly articulate technical designs and concepts; and to communicate effectively with all levels within the organization
Experience with deploying models using vLLM/Triton Inference Server
Performance Tuning those models and deployment to provide higher throughput
Experience with various inference metrics, and related monitoring and observability
Experience with serving multiple tenants/clients with model endpoints with secure boundaries
Experience with Atheization & Authorization, Policy as Code, Systems Integration, and Model Routing
Model Evaluation frameworks to evaluate different models and their tradeoffs between efficiency and metrics
Experience building RAG for various knowledge bases, and document types
Model Monitoring – Ability to collect metrics to measure things like Model Drift, KPIs
Self-starter with the ability to challenge conventions, excellent communication skills
Strong analytical skills which enable ability to problem solve, apply reason, take initiative, use judgment, and perform concurrent tasks
Follows Test Driven Development practices including continual integration and clean code principles

Preferred

Experience developing Gen AI training and Inferencing platform with open-source model, Gen AI Model servicing capabilities, designing RAG frameworks, MCP modules for enterprise data systems

Company

Bank of America

company-logo
Bank of America is a financial institution that offers credit cards, home loans, and auto loan services.

H1B Sponsorship

Bank of America has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (780)
2024 (546)
2023 (590)
2022 (759)
2021 (715)
2020 (931)

Funding

Current Stage
Public Company
Total Funding
$3.59B
Key Investors
Berkshire Hathaway
2025-02-20Post Ipo Debt· $386.79M
2024-11-26Post Ipo Debt· $2B
2020-07-28Post Ipo Equity· $400M

Leadership Team

leader-logo
Charissa Messer
Senior Vice President, Creative Agency Executive (Enterprise Creative Solutions)
linkedin
leader-logo
Rami Salem
SVP Strategic Competitive Intelligence
linkedin
Company data provided by crunchbase