Apply on Employer Site

Wells Fargo · 1 hour ago

Senior Software Engineer LLM Inferencing and AI Gateway

CONCORD, CA

Full-time

Hybrid

Mid, Senior Level

$100K/yr - $196K/yr

4+ years exp

Wells Fargo is seeking a Senior Software Engineer – LLM Inferencing & AI Gateway to join their Digital Technology – AI Capability Engineering team. The role involves designing, building, and operating the GPU-based GenAI platform and the serving infrastructure for LLM/SLM workloads, focusing on delivering reliable, scalable model endpoints through an API Gateway-based production architecture.

BankingFinancial ServicesFinTechInsurancePayments

No H1B

Responsibilities

Lead complex Generative AI initiatives and deliverables within technical domain environments

Contribute to large scale planning of strategies

Design, code, test, debug, and document for projects and programs associated with technology domain, including upgrades and deployments

Review moderately complex technical challenges that require an in-depth evaluation of technologies and procedures

Resolve moderately complex issues and lead a team to meet existing client needs or potential new clients needs while leveraging solid understanding of the function, policies, procedures, or compliance requirements

Collaborate and consult with peers, colleagues, and mid-level managers to resolve technical challenges and achieve goals

Lead projects and act as an escalation point, provide guidance and direction to less experienced staff

Engineer GPUs clusters and node pools; configure NVLink/NVSwitch, NVIDIA GPU Operator, MIG profiles, container runtime, and kernel/driver baselines for high‑throughput LLM/SLM workloads

Qualification

GPU InferenceNVIDIA CUDAPythonGPU orchestrationLLM serving frameworksGenerative AI engineeringSoft skills

Required

4+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education

Preferred

1+ years of experience with GPU Inference including NVIDIA CUDA, cuDNN, NVLink/NVSwitch, MIG, NIXL, GPU profiling, and performance tuning on H100/H200 architectures

1+ years of experience with GPU orchestration platforms, such as RunAI (collections, queues, quotas, preemption, fair-share scheduling), OpenShift AI (RHOAI), and cluster administration on OCP or GKE

1+ years of experience with LLM/SLM serving frameworks, including vLLM, Triton, TensorRT‑LLM/MII, KV‑cache optimization strategies, and FP8/INT4 quantization techniques (AWQ/GPTQ)

1+ years of experience working with LLM API gateways, including OAuth2/mTLS authentication, rate‑limiting and quota management, OpenAPI/SDK integration, SLAs, and versioning/deprecation practices

2+ years of experience in Generative AI engineering, including LLM/SLM operations, fine‑tuning, evaluation pipelines, and developing model‑specific performance optimization recipes

4+ years of experience in Python, including scripting, automation, and model/inference‑related development

Benefits

Health benefits

401(k) Plan

Paid time off

Disability benefits

Life insurance, critical illness insurance, and accident insurance

Parental leave

Critical caregiving leave

Discounts and savings

Commuter benefits

Tuition reimbursement

Scholarships for dependent children

Adoption reimbursement

Company

Wells Fargo

Glassdoor3.6

Wells Fargo & Company is a financial services firm that provides banking, insurance, investments, and mortgage services.

Founded in 1852

San Francisco, California, USA

10001+ employees

http://www.wellsfargo.com

Funding

Current Stage

Public Company

Total Funding

unknown

1978-10-06IPO

Leadership Team

Charlie Scharf

CEO

Fernando Rivas

CEO of Corporate & Investment Banking

Recent News

The Real Deal

Kolter pays $26M for condemned Miami Beach hotel as redevelopment play

2026-01-23

Morningstar.com

Wells Fargo Currently Down Seven Consecutive Days, on Pace for Longest Losing Streak Since January 2024 — Data Talk

2026-01-22

Business – Latest Financial & Stock Market News | New York Post

Wells Fargo moves wealth-management unit to Palm Beach, joining Florida rush

2026-01-22

Company data provided by crunchbase