Bank of America · 5 months ago
Software Engineer III -Gen AI Inferencing
Bank of America is at the forefront of innovation in AI, focusing on building the next generation of Gen AI platforms. The role involves designing, building, and operating reusable toolkits for Gen AI capabilities, ensuring software meets functional and compliance requirements while collaborating with various teams to deliver solutions.
Asset ManagementBankingFinanceFinancial ServicesFinTech
Responsibilities
Codes solutions and unit test to deliver a requirement/story per the defined acceptance criteria and compliance requirements
Designs, develops, and modifies architecture components, application interfaces, and solution enablers while ensuring principal architecture integrity is maintained
Mentors other software engineers and coach team on Continuous Integration and Continuous Development (CI-CD) practices and automating tool stack
Executes story refinement, definition of requirements, and estimating work necessary to realize a story through the delivery lifecycle
Performs spike/proof of concept as necessary to mitigate risk or implement new ideas
Automates manual release activities
Designs, develops, and maintains automated test suites (integration, regression, performance)
Utilizes multiple architectural components (across data, application, business) in design and development of client requirements
Manage multiple priorities, and simultaneously engage with multiple teams
Participates in estimating work necessary to realize a story/requirement through the delivery lifecycle
Be vocal and actively participate in all session with business stakeholders and agile teams
Collaborate with product teams, data analysts and data scientists to design and build solutions
Qualification
Required
5+ years OOP in Python/Scala/Java programming experience with expert level development skills
Experience with AI/ML/GenAI Lifecycle Management and Development and its Ecosystem. Hands on experience building frameworks using MLOps, Fine – Tuning techniques, Inference Frameworks
Experience with deploying models using vLLM/Triton Inference Server in containers in production with automation. Performs Continuous Integration and Continuous Development (CI-CD) activities. Performance Tuning those models and deployment to provide higher throughput
Track record of maintaining large scale Python/Unix based systems
Hands on experience and knowledge generative AI RAG process for various use cases, including chunking, embedding, retrieval, reranking and summarization
Hands-on experience in application development in one or more areas MongoDB, Redis, Angular/React Frameworks, Containerization, Building API based application leveraging FAST API services, JWT Integration, API Gateway
Develop efficient utilities, automation frameworks, data science platforms that can be utilized across multiple Data Science teams for AI/ML and GenAI work
Working in large sized teams that collaboratively develop on a shared multi-repo codebase using IDEs (e.g. VS Code rather than Jupyter Notebooks), Continuous Integration (CI), Continuous Deployment (CD) and Continuous Testing
Strong automation, scripting, and Python development skills. Hands-on DevOps experience with one or more of the following enterprise development tools: Version Control (GIT/Bitbucket), Build Orchestration (Jenkins), Code Quality (SonarQube and pytest Unit Testing), Artifact Management (Artifactory) and Deployment (Ansible)
Preferred
Experience building & deploying Gen AI inferencing platform with open-source toolsets, building inferencing & servicing capabilities (AI Gateway, Policy store, Observability) for RAG/ MCP use cases etc
Hands on experience on driving and maintaining a culture of quality, innovation, and experimentation
Research on new tools and capabilities for better UI and UX for advanced analytics platform, quick prototype and demonstrate the features and capabilities, and participate on various user forums
Company
Bank of America
Bank of America is a financial institution that offers credit cards, home loans, and auto loan services.
H1B Sponsorship
Bank of America has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (780)
2024 (546)
2023 (590)
2022 (759)
2021 (715)
2020 (931)
Funding
Current Stage
Public CompanyTotal Funding
$3.59BKey Investors
Berkshire Hathaway
2025-02-20Post Ipo Debt· $386.79M
2024-11-26Post Ipo Debt· $2B
2020-07-28Post Ipo Equity· $400M
Leadership Team
Recent News
2026-01-11
2026-01-11
Company data provided by crunchbase