Apply on Employer Site

Robert Half · 3 weeks ago

AI Engineer – Agentic & Generative AI

Richardson, TX

Full-time

Hybrid

Senior Level

$180K/yr - $200K/yr

5+ years exp

Robert Half is at the forefront of the Generative AI revolution, dedicated to shaping the future of artificial intelligence. They are seeking an AI Engineer to develop, deploy, and scale advanced AI applications that address real-world challenges, focusing on designing and building AI applications with cutting-edge technologies.

ConsultingHuman ResourcesRecruitingSecurity

No H1B

Hiring Manager

Alexandra Brice

Responsibilities

Evaluate and use LLMs and multimodal models from multiple providers (e.g., OpenAI, Google, Anthropic, etc.) for: Conversational assistants, task-based copilots, and AI agents

Summarization, content generation, document understanding, generative analytics

Basic multimodal use cases (text + image, text + document, and soon video/audio)

Design and implement agentic workflows (e.g., tool-calling, multi-step reasoning, multi-agent orchestration) using: LangChain, OpenAI Agents SDK, Google ADK or similar frameworks

Design and optimize prompts and system instructions to: Improve task completion, reliability, and latency

Minimize hallucinations and toxic/unsafe outputs

Implement structured outputs (JSON/JSON Schema)

Develop function/tool calling and prompts that help AI call them properly

Integrate safety/guardrail layers (e.g., content moderation APIs, Guardrails AI, Rebuff, custom policies) to keep conversations focused

Architect and implement RAG pipelines: Choose and configure vector databases (e.g., PGVector, Vertex AI Search, Pinecone, etc.)

Build ingestion pipelines for internal data (documents, tickets, logs, property data, etc.)

Implement knowledge retrieval process that draws from multiple sources and uses reranking to improve the response quality

Explore emerging retrieval techniques (semantic caching, knowledge graphs, long-context models, memory systems)

Build or integrate front-end experiences (React / Vue / Svelte / Web RTC) for AI agents and copilots

Develop back-end services to orchestrate AI calls using REST, gRPC, WebSockets, or MCP; ensure scalability and observability

Integrate with internal systems and data sources using secure APIs and data contracts

Design and maintain evaluation pipelines and benchmarks for LLM-based features: Offline metrics (accuracy, relevance, latency, cost)

Human-in-the-loop evaluations where needed

Use AI observability and tracing tools (e.g., LangSmith, OpenTelemetry, etc.) to monitor quality

Optimize for performance, reliability, latency, and cost through: Model selection and routing (e.g., small vs. large models, Google vs. OpenAI)

Prompt/token optimization and caching strategies

Collaborate with cross-functional teams (Product, Design, Domain Experts, Data Science, Platform Engineering) to define requirements and success metrics

Participate in architecture and design reviews; write clear technical documentation and runbooks

Contribute to shared libraries, templates, and best practices for AI development

Work in an Agile environment and own features from design through deployment and maintenance

Qualification

Generative AIPythonLLM APIsRAG architecturesJavaScript/TypeScriptSQLCloud platformsDockerKubernetesCommunicationCollaboration

Required

5+ years of total experience in Software Engineering and/or Data Science, with at least 2 years focused on Generative AI/LLMs

Degree in Computer Science, Machine Learning, Data Science, or related field, or equivalent practical experience

Strong proficiency in Python for AI/ML, data pipelines, and back-end services

JavaScript/TypeScript for front-end and/or Node services

SQL and experience working with relational databases and basic data modeling

Working with coding assistants like Windsurf, Cursor, Codex, etc

Proven experience building production-grade software: Writing clean, testable, maintainable code

Using CI/CD pipelines, code reviews, and Git workflows

Hands-on experience with at least one agentic/orchestration framework (OpenAI Agents SDK, Google ADK, LangChain, etc.)

LLM APIs and/or open-source models (e.g., via OpenAI, Google, Hugging Face, Ollama)

Vector embeddings, vector databases, and RAG architectures

Experience with one or more major cloud platforms (GCP, Azure, and AWS) and: Docker for containerization

Kubernetes or a managed container service (e.g., EKS, GKE, AKS)

Strong communication skills and ability to collaborate with both technical and non-technical stakeholders

Preferred

Experience with voice-enabled AI agents (STT, TTS, WebRTC, Twilio Voice, Socket.IO, VAPI)

Multimodal models (e.g., GPT models including Realtime, Gemini Pro Vision, etc.)

Orchestrating multiple models (routing, ensembles, fallback strategies)

Familiarity with AI experiment tracking and evaluation frameworks (e.g., OpenAI Evals, Langsmith Evals, etc.)

Feature stores, data versioning (e.g., Feast, DVC), and MLOps workflows

Browser automation software such as PlayWright

Background in AI security, privacy, and compliance (PII handling, SOC2, GDPR considerations)

A/B testing and online experimentation for AI features

Company

Robert Half

Glassdoor3.8

🔒 At Robert Half, we prioritize your security—if you believe you've encountered a scam or fraudulent recruiter, please report it immediately to https://www.roberthalf.com/us/en/fraud-alert.

Founded in 1948

Menlo Park, California, USA

10001+ employees

http://www.roberthalf.com/

Funding

Current Stage

Public Company

Total Funding

unknown

1978-01-13IPO

Leadership Team

Keith Waddell

President and CEO

Paul Gentzkow

President and Chief Executive Officer, Talent Solutions

Recent News

EIN Presswire

U.S. Hospitality Staffing Market Projected To Witness Massive Growth, 2025-2032 | Culinary Staffing, PeopleReady, HSS

2025-05-17

EIN Presswire

Human Resource Outsourcing Market Size Forecasted to Grow at 5.48% CAGR, Reaching USD 446.25 billion by 2034

2025-04-21

EIN Presswire

IT Staffing Market Expected to Reach USD 409.12 Billion, Growing at a CAGR of 5.44% by 2034

2025-04-14

Company data provided by crunchbase