AI Platform and Solutions Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Ampere · 5 hours ago

AI Platform and Solutions Engineer

Ampere is a semiconductor design company focused on high-performance, energy efficient, sustainable cloud computing. As an AI Platform & Solutions Engineer, you will build and operate Ampere’s internal AI platform and user-facing tools, delivering secure AI applications and improving operational automation across the company.

Cloud ComputingCloud Data ServicesHardwareSemiconductor
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Own and evolve core components like vector/RAG services, model access, model serving environments (cloud and on-prem where applicable), and underlying compute
Package and deploy services with Docker, primarily to Azure VMs (and adjacent Azure services as needed)
Implement logs/metrics/traces using Azure Monitor/Application Insights, build dashboards, and configure actionable alerts
Manage secrets via Key Vault, use managed identities, and enforce least-privilege access patterns
Build pipelines (GitHub Actions and/or Azure DevOps) for build/test/deploy across environments
Build and extend FastAPI services (endpoints, adapters, background jobs, structured payloads, pagination, and error handling)
Design and maintain MCP-style tools and connectors that link models/agents to enterprise systems like Jira/Confluence, ticketing systems, knowledge bases, NetSuite/manufacturing tools, and internal services
Build practical RAG foundations: chunking, embeddings, retrieval filters/metadata, batch upserts, and index maintenance across vector/search systems
Integrate with LLM APIs (Azure Foundry/Google Vertex AI) including streaming and tool/function calling
Own user-facing web UIs (primarily Svelte/SvelteKit + TypeScript) with strong fundamentals around routing, SSR, state, performance, and accessibility
Implement streaming responses via SSE/WebSockets, resilient chat/task interfaces, incremental rendering, retries, and degraded-mode handling
Build multi-step agents and workflows with tool calls, approvals, progress events, audit trails, and safe failure modes
Ship tools that improve day-to-day work, including internal chat/agent apps, workflow automation, and developer productivity integrations (e.g., VS Code, Open WebUI, external AI tools where appropriate)
Implement Microsoft Entra ID auth for web apps/APIs using OIDC/OAuth2 (MSAL), validate tokens (JWT/JWKS), and enforce scopes/RBAC
Ensure safe enterprise use of LLMs with clear permission boundaries, logging/audit trails, and secure handling of sensitive data
Partner with internal teams (engineering, manufacturing, ops) to gather requirements, translate them into working systems, document architecture and playbooks, and support adoption

Qualification

PythonFastAPIAzureLLM integrationTypeScriptContainerizationCI/CDREST APIsOIDC/OAuth2Technical communicationProblem-solvingTeam collaboration

Required

Bachelors degree & 2 years of related experience; or an advanced degree without experience
Software engineering experience delivering production systems (or equivalent portfolio)
Strong Python proficiency (required) with experience building production services (FastAPI preferred)
Frontend capability: Strong TypeScript/JavaScript fundamentals and experience with a modern framework (Svelte preferred; React/Vue acceptable)
Hands-on experience integrating LLM APIs including streaming and tool/function calling
Working understanding of RAG and vector search fundamentals
Experience deploying and operating applications in cloud environments (Azure strongly preferred)
Practical knowledge of REST APIs and integration patterns
Working understanding of OIDC/OAuth2 and web auth flows; Entra ID experience strongly preferred
Git proficiency (branching, PRs, code review) and comfort in Linux dev environments
Ability to operate independently: take ambiguous problems, ship solutions, and own them through deployment and iteration
Clear technical communication: can explain tradeoffs and designs to both technical and non-technical stakeholders

Preferred

Direct MCP development experience (clients/servers; OAuth 2.1)
Vector DB and search ops: Chroma/Pinecone/Weaviate/Qdrant/Azure AI Search, ingestion pipelines, indexing strategy
Infrastructure as Code: Terraform/Bicep
Strong CI/CD practices: quality gates, smoke tests, environment promotion
Caching/rate limiting/cost controls: Redis, batching, token/cost telemetry
Testing discipline: pytest/Vitest, Playwright, contract testing
On-prem or hybrid infrastructure exposure; model serving environments
Internal developer tooling experience (VS Code extensions, CLIs, internal portals like Open WebUI)
Deeper LLM/RAG evaluation and safety: prompt injection defenses, retrieval evaluation, regression tests, A/B testing, auditability
Exposure to fine-tuning concepts (LoRA/QLoRA) and agent-system failure modes/monitoring
Core ML fundamentals: supervised/unsupervised learning, bias/variance, overfitting, regularization, cross-validation, metrics
Deep learning basics: embeddings, attention/transformers at a conceptual level, training vs inference constraints
LLM-specific mechanics: tokenization, context windows, temperature/top-p, latency/cost tradeoffs, prompt vs fine-tune boundaries
RAG depth: chunking strategies, embedding model selection, retrieval evaluation (recall@k, MRR), rerankers, hybrid search, query rewriting
Evaluation discipline: offline eval harnesses, labeled test sets, prompt regression tests, A/B testing, human eval workflows
Safety/security: prompt injection, data exfiltration paths, least-privilege tool access, sandboxing, output validation
Agentic systems understanding: planning vs execution, tool selection, state management, failure modes, monitoring agent behavior

Benefits

Premium medical insurance, dental insurance, vision insurance, as well as income protection and a 401K retirement plan, so that you can feel secure in your health and financial future.
Unlimited Flextime and 10+ paid holidays so that you can embrace a healthy work-life balance.
A variety of healthy snacks, energizing espresso, and refreshing drinks to keep you fueled and focused throughout the day.

Company

Ampere

twittertwittertwitter
company-logo
A semiconductor design company leading the future of computing with an innovative approach to CPU design focused on high-performance, energy efficient AI compute.

H1B Sponsorship

Ampere has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (52)
2024 (50)
2023 (61)
2022 (56)
2021 (52)
2020 (30)

Funding

Current Stage
Late Stage
Total Funding
$813.8M
Key Investors
OracleThe Carlyle Group
2025-03-20Acquired
2022-04-01Secondary Market· $127.8M
2022-04-01Convertible Note· $300M

Leadership Team

leader-logo
Adam Young
Senior Principal Software Engineer
linkedin
leader-logo
Atiq Bajwa
Chief Architect
linkedin
Company data provided by crunchbase