Principal Solutions Engineering - AI Server/Rack Infrastructure jobs in United States
cer-icon
Apply on Employer Site
company-logo

Advanced Microdevices Pvt. Ltd. (India) · 1 day ago

Principal Solutions Engineering - AI Server/Rack Infrastructure

Advanced Micro Devices, Inc is a company focused on building products that accelerate next-generation computing experiences. They are seeking a Principal Member of Technical Staff to lead system design support, customer engagement, and engineering efforts for their AMD Instinct product line, with responsibilities including architecture optimization, debugging, and cross-functional alignment.

BiopharmaBiotechnologyIndustrialManufacturing
badNo H1Bnote

Responsibilities

System Architecture & Design Support
Solution Optimization: Partner deeply with customers to architect and optimize Rack-Scale AI solution deployments using AMD Instinct GPUs
Design Reviews: Provide support of design reviews for customer platform/rack designs; proactively flag areas for modification to improve quality, performance and competitive advantage
Bring-Up, Debug & Validation
Documentation & Best Practices: Deliver comprehensive technical documentation, best practices, and reference architectures to streamline the adoption and deployment of AMD AI platforms
Hands-on Engineering: Drive hands-on rack, platform, and component-level debug and validation. This includes complex stress testing, issue reproductions, and deep-dive root cause analysis
Issue Resolution: Lead customer issue resolution efforts, gathering diagnostics, managing critical escalations, and driving long-term process improvements to ensure customer success
System Firmware Debug & Deployment : Lead debug efforts for system firmware (BIOS, BMC) during initial bring-up and large-scale deployment phases. Ensure seamless integration between hardware, firmware, and software stacks, and resolve interaction issues in customer environments
End-Customer Debug & Sustaining: Own the technical support interface for end customers, provide high-level engineering for deployed fleets
Cross-Functional Alignment: Represent debug progress, technical insights, and status with clarity and impact at the leadership level, ensuring alignment and accountability across cross-functional teams
Roadmap Influence: Provide regular, detailed technical feedback from the field to directly influence AMD’s software and hardware roadmaps
Future Architecture: Drive future product architecture decisions by leveraging unique insights gained from deep customer execution engagement
Mentorship: Build a culture of ownership, accountability, and technical excellence within the team, while actively mentoring senior engineers and emerging technical leaders

Qualification

System ArchitectureHardware/Firmware DebugCustomer EngagementSystem FirmwareDebugging ToolsLeadershipMentorship

Required

Dynamic and experienced Principal Member of Technical Staff to own system design support, rack-level bring-up, and critical customer engagement for AMD Instinct product line
Act as the technical bridge between AMD's internal system architects, platform development teams, and OEM partners
Influence the design and architecture of AI solutions
Lead hands-on debug and validation efforts at customer locations
Drive engineering, root cause analysis, and influence future roadmaps based on field execution
System Architecture & Design Support
Partner deeply with customers to architect and optimize Rack-Scale AI solution deployments using AMD Instinct GPUs
Provide support of design reviews for customer platform/rack designs; proactively flag areas for modification to improve quality, performance and competitive advantage
Drive hands-on rack, platform, and component-level debug and validation
Lead customer issue resolution efforts, gathering diagnostics, managing critical escalations, and driving long-term process improvements to ensure customer success
Lead debug efforts for system firmware (BIOS, BMC) during initial bring-up and large-scale deployment phases
Ensure seamless integration between hardware, firmware, and software stacks, and resolve interaction issues in customer environments
Own the technical support interface for end customers, provide high-level engineering for deployed fleets
Represent debug progress, technical insights, and status with clarity and impact at the leadership level
Provide regular, detailed technical feedback from the field to directly influence AMD's software and hardware roadmaps
Drive future product architecture decisions by leveraging unique insights gained from deep customer execution engagement
Build a culture of ownership, accountability, and technical excellence within the team, while actively mentoring senior engineers and emerging technical leaders
Bachelors, Masters, or PhD in Electrical Engineering, Computer Engineering, or Computer Science

Preferred

Advanced experience in system architecture, hardware/firmware debug, and customer-facing engineering roles (HPC or AI/ML focus preferred)
Deep understanding of Server/Rack system architecture (x86, GPU, PCIe, Interconnects)
Strong proficiency in System Firmware (BIOS/UEFI, BMC/OpenBMC) debug, update flows, and deployment strategies
Experience with system bring-up and debugging tools (oscilloscopes, logic analyzers, ITP, JTAG)
Knowledge of power delivery, thermal management, and mechanical form factors in datacenter environments
Proven track record of leading technical teams through complex problem-solving scenarios and interacting with executive leadership
Ability to travel to customer, factory and company locations

Benefits

AMD benefits at a glance.

Company

Advanced Microdevices Pvt. Ltd. (India)

twittertwittertwitter
company-logo
Advanced Microdevices (mdi) is a leader in innovative membrane technologies.

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Nalini Kant Gupta
Founder & Managing Director
Company data provided by crunchbase