Staff AI Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

SoFi · 1 day ago

Staff AI Engineer

SoFi is a next-generation financial services company and national bank focused on changing personal finance through innovative technology. The Staff AI Engineer will play a critical role in setting the technical direction for AI initiatives, driving execution, and mentoring engineers to ensure successful delivery of AI-based solutions for risk management and compliance.

CreditCredit CardsFinancial ServicesFinTechLendingWealth Management
check
H1B Sponsor Likelynote

Responsibilities

Define the long-term technical architecture and strategy for our next-generation AI platform, particularly focusing on robust, scalable agentic frameworks and LLM deployment patterns
Architect and standardize the use of graph-based LLM orchestration, leveraging expert-level mastery of LangGraph to solve highly complex, multi-stage reasoning problems at scale
Develop robust, persistent infrastructure for agentic state management, ensuring that long-running agent workflows maintain context and reliability across distributed nodes and regional failovers
Pioneer and institutionalize advanced parameter-efficient fine-tuning (PEFT) and compression techniques to maximize model performance and minimize operational costs across the organization
Support the development of a unified model serving platform designed to host internally fine-tuned and custom-trained models to ensure high-throughput, low-latency inference across diverse hardware footprints
Define and enforce high standards for AI operationalization, requiring mastery in designing and deploying comprehensive AI observability solutions and advanced tracing/testing frameworks that guarantee production quality, compliance, and reliability
Mentor senior and junior AI Engineers, elevating the overall engineering quality
Coordinate with cross-functional teams to distill specific requirements, project roadmaps, and ensure accurate and on-time project deliveries
Stay up-to-date with the latest trends and advancements in GenAI, LLMs, and NLP, evaluating and experimenting with new techniques and tools to push the boundaries of AI innovation in the banking sector

Qualification

LangGraphLarge Language ModelsAI observability solutionsPythonDeep Model OptimizationCloud platformsContainerization technologiesAI-powered APIsAnalytical skillsMentorshipCollaboration skills

Required

Bachelor's or Master's degree in Computer Science, Data Science, AI, Machine Learning, or a related field. PhD is a plus
8+ years software development experience, with 3+ years of hands-on experience in developing and successfully deploying production-level AI applications that have been used by real customers or internal stakeholders
Expert-level experience with LangGraph to model and orchestrate complex, stateful multi-step reasoning and control flow in LLM applications
Expert-level proficiency in developing sophisticated agentic solutions, with a portfolio demonstrating advanced use of planning, memory management, tool integration, and control flow
Deep understanding of Large Language Model (LLM) architectures, prompt engineering, retrieval-augmented generation (RAG), and advanced text generation techniques
Proven experience implementing parameter-efficient fine-tuning (PEFT) techniques (e.g., LoRA) to customize and optimize pre-trained models for specific tasks with minimal computational overhead
Deep expertise in building or extending inference engines (e.g., vLLM, NVIDIA Triton, or TGI) and managing the underlying Kubernetes/GPU orchestration for custom model deployments
Deep experience designing and institutionalizing AI observability solutions (e.g., LangSmith, Arize, Deepchecks) and advanced tracing and testing methodologies for LLM and agentic systems
Experience with cloud platforms (AWS, Azure, or GCP) and containerization technologies (Docker, Kubernetes)
Expert level Python is required
Experience with large-scale data handling, including unstructured and structured data pipelines, with a strong preference for Snowflake and DynamoDB
Experience developing and integrating AI-powered APIs and microservices architecture into banking applications
Experience with vector databases and retrieval-augmented generation (RAG) techniques using systems like Elasticsearch, Pinecone, or FAISS for enhancing LLM performance
Exceptional ability to communicate complex technical concepts, drive consensus among senior technical leaders, and influence organizational AI strategy
Strong analytical and problem-solving skills with attention to detail and an ability to work with complex, large-scale systems
Strong collaboration skills, with experience working in agile, cross-functional teams

Preferred

React is strongly preferred
Familiarity with regulatory frameworks and ethical considerations in AI within the banking industry (e.g., GDPR, data privacy, model explainability)
Experience in banking or financial services use cases such as conversational AI for customer service, intelligent document processing for loan applications, fraud detection, or risk analysis

Company

SoFi is a finance company that offers a range of lending and wealth management services.

H1B Sponsorship

SoFi has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (213)
2024 (117)
2023 (131)
2022 (118)
2021 (81)
2020 (42)

Funding

Current Stage
Public Company
Total Funding
$12.25B
Key Investors
Fortress Investment GroupPGIMQatar Investment Authority
2025-12-04Post Ipo Equity· $1.5B
2025-07-29Post Ipo Equity· $1.5B
2025-04-17Post Ipo Debt· $3.2B

Leadership Team

leader-logo
Anthony Noto
CEO
linkedin
leader-logo
Jeremy Rishel
Chief Technology Officer
linkedin
Company data provided by crunchbase