GenAI Software Developer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Toyota North America · 2 days ago

GenAI Software Developer

Toyota North America is currently hiring a GenAI Software Developer for a temporary assignment. The role involves building and maintaining RESTful APIs, integrating GenAI services, and utilizing AWS infrastructure to support AI solutions.

Manufacturing
check
Comp. & Benefits
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Build and maintain RESTful APIs with Python (FastAPI; OpenAI/Bedrock SDKs as clients), containerized and deployed on AWS ECS Fargate
Design clean contracts and versioned APIs; document with OpenAPI/Swagger
Integrate with AWS Bedrock and other GenAI services to enable RAG and knowledge-base queries
Work with vector databases (e.g., Pinecone, Weaviate, OpenSearch/Elasticsearch vector) for semantic search and retrieval
Implement robust API clients for AI endpoints, including auth, throttling, retries, and error handling
Configure API Gateway for secure routing, throttling, authentication/authorization
Use IaC (Terraform or AWS CloudFormation) for ECS/Fargate, API Gateway, IAM, networking
Utilize AWS services: S3, Lambda, OpenSearch/Elasticsearch, CloudWatch, Bedrock
Build CI/CD pipelines (GitHub Actions, Jenkins, or CodePipeline) for automated build/test/deploy; use GitHub/GitLab and artifact repos (e.g., Artifactory)
Write unit, integration, and end-to-end tests with pytest; automate regression tests with QA
Perform load/stress testing; analyze performance and reliability metrics
Implement centralized logging and metrics (CloudWatch, Dynatrace; Elasticsearch/OpenSearch if needed); set up SLIs/SLO-based alerts

Qualification

PythonAWSGenAIFastAPICI/CDVector databasesAnalytical skillsCollaboration skillsCommunication skills

Required

Build and maintain RESTful APIs with Python (FastAPI; OpenAI/Bedrock SDKs as clients), containerized and deployed on AWS ECS Fargate
Design clean contracts and versioned APIs; document with OpenAPI/Swagger
Integrate with AWS Bedrock and other GenAI services to enable RAG and knowledge-base queries
Work with vector databases (e.g., Pinecone, Weaviate, OpenSearch/Elasticsearch vector) for semantic search and retrieval
Implement robust API clients for AI endpoints, including auth, throttling, retries, and error handling
Configure API Gateway for secure routing, throttling, authentication/authorization
Use IaC (Terraform or AWS CloudFormation) for ECS/Fargate, API Gateway, IAM, networking
Utilize AWS services: S3, Lambda, OpenSearch/Elasticsearch, CloudWatch, Bedrock
Build CI/CD pipelines (GitHub Actions, Jenkins, or CodePipeline) for automated build/test/deploy; use GitHub/GitLab and artifact repos (e.g., Artifactory)
Write unit, integration, and end-to-end tests with pytest; automate regression tests with QA
Perform load/stress testing; analyze performance and reliability metrics
Implement centralized logging and metrics (CloudWatch, Dynatrace; Elasticsearch/OpenSearch if needed); set up SLIs/SLO-based alerts
Strong proficiency in Python programming, with practical experience using FastAPI for API development
Expertise in prompt engineering to design, test, and refine prompts for LLMs
Experience building AI agents and conversational AI systems using CAG methodologies
Working knowledge of Retrieval-Augmented Generation (RAG) and its application in AI solutions
Hands-on experience with vector databases such as Pinecone, Weaviate, or similar platforms
Familiarity with scoring and ranking techniques for large language model outputs
Solid understanding of AWS cloud infrastructure components including IAM, Lambda, S3, and EC2
Excellent collaboration skills within agile, cross-functional teams
Strong analytical and problem-solving abilities
Effective communication skills to convey complex AI concepts clearly
Work Authorization: Green Card, US Citizen

Preferred

Preferred years of experience: 3 years

Benefits

Medical
Dental
Vision
401K

Company

Toyota North America

company-logo
At Toyota, we’re known for making some of the highest quality vehicles on the road. But there is more to our story.

Funding

Current Stage
Late Stage
Total Funding
$4.5M
Key Investors
ARPA-E
2024-12-18Grant· $4.5M

Leadership Team

leader-logo
Tetsuo Ogawa
CEO
leader-logo
Brian Kursar
Group Vice President - Head of Enterprise AI
linkedin
Company data provided by crunchbase