Applied AI Architect - Austin, TX jobs in United States
cer-icon
Apply on Employer Site
company-logo

Trend Micro · 17 hours ago

Applied AI Architect - Austin, TX

Trend Micro, a global cybersecurity leader, helps make the world safe for exchanging digital information across enterprises, governments, and consumers. They are seeking an Applied AI Architect to lead the technical direction for model architecture selection, fine-tuning, and optimization, translating research into scalable solutions for cybersecurity.

Cloud SecurityCyber SecuritySecurityVirtualization
check
Culture & Values
badNo H1BnoteU.S. Citizen Onlynote

Responsibilities

Drive research-to-production of LLM/SLM systems — from design and fine-tuning to evaluation, deployment, and continual adaptation in enterprise agent workflows
Lead technical choices — determine when to apply context engineering, prompt tuning, continued pretraining, supervised fine-tuning, reasoning fine-tuning, LoRA, or RL
Architect high-performance inference and serving using vLLM, NVIDIA NIM, Triton, CUDA, or other optimized frameworks
Integrate reinforcement learning frameworks (veRL, SkyRL, PyTorch, Ray RLlib) to enhance reasoning, adaptability, and agent feedback loops
Develop and operationalize AI Ops pipelines — build benchmarks and metrics for model evaluation, observability, drift detection, and lifecycle automation
Advance agent interoperability using A2A (Agent-to-Agent) or MCP (Model Context Protocol) for large-scale coordination
Collaborate with cybersecurity researchers to embed threat reasoning, anomaly detection, and defensive logic directly into model behavior
Publish, document, and codify reusable AI blueprints for hybrid (cloud + on-prem) deployments and future research acceleration

Qualification

LLM/SLM production experienceGPU-accelerated inferencePython proficiencyAI Ops toolchainsContainerized AI microservicesResearch-driven mindsetData-oriented approachOwnership mentality

Required

Proven end-to-end experience bringing LLM/SLM research into production — from fine-tuning and inference optimization to evaluation and AI Ops integration
Excellent knowledge of at least one of the following: Deep understanding of data-model-infrastructure trade-offs and optimization under real business constraints
Hands-on experience fine-tuning LLMs using frameworks such as LLaMA Factory, NeMo, and PEFT (e.g., LoRA)
Strong knowledge of GPU-accelerated inference (ex: vLLM, NIM, Triton, CUDA, NCCL, PyTorch/XLA)
Familiarity with AI Ops toolchains (ex: Weights & Biases, MLflow, Ray Serve)
Proficiency in Python, and experience building containerized AI microservices (ex: Docker, Kubernetes, Ray)
3+ years of applied AI/ML research or engineering, including 2+ years in production-scale deployment

Preferred

Demonstrated success in building scalable infrastructure and launching LLM/SLM-based features and agent systems within enterprise platforms
Expertise in quantization, distillation, or GPU profiling to lower inference cost
Clear conceptual understanding of when to fine-tune vs prompt-engineer vs use RLHF — and evidence of having applied each effectively
Familiarity with agentic frameworks (LangChain, AWS Strands, AutoGen, etc)
Deep understanding of A2A/MCP protocols for interoperable multi-agent systems

Benefits

Comprehensive medical, dental and vision insurance
Life insurance
Short & Long Term Disability
Pre-partum, maternity, parental and medical leave
Mental Health Wellness Program
Adoption Assistance
Wellness Incentive
Pet Insurance
401(k) with company match
Paid Time Off
14 Annual Holidays
Tuition Assistance
Employee Resource Groups

Company

Trend Micro

company-logo
Trend Micro is an IT firm that offers cybersecurity solutions like cloud security, endpoint protection, and network threat detection.

Funding

Current Stage
Public Company
Total Funding
unknown
2000-08-17IPO

Leadership Team

leader-logo
Eva Chen
Co-Founder & CEO
leader-logo
Kevin Simzer
Chief Operating Officer
linkedin
Company data provided by crunchbase