AMD · 18 hours ago
Principal Engineer Inference Stack
AMD is a company focused on building innovative products that accelerate next-generation computing experiences. They are seeking a strategic software engineering lead to improve the performance of key applications and benchmarks, working with a talented team and the latest technology.
AI InfrastructureArtificial Intelligence (AI)Cloud ComputingComputerEmbedded SystemsGPUHardwareSemiconductor
Responsibilities
Develop techniques for optimizing scale-up and scale-out inference
Develop methods and tooling to utilize dynamic resources in service of inference
Support proliferation of rocm ecosystem
Qualification
Required
Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent
Preferred
Expertise in the K8s ecosystem, especially as it pertains to large scale inference
Operational experience with atleast one of sglang, or vllm and with kserve, llm-d. Experience running inference as a service can be substituted in-lieu of experience with frameworks such as kserve or llm-d
Expertise with techniques used to optimize inference like distributed kv-cache, disaggregation, request scheduling etc
Ability to write high quality code with a keen attention to detail. Preferred languages are go and python
Experience with modern concurrent programming
Effective communicator with keen attention to detail
Prior experience roadmapping deeply technical areas is highly valuable
Benefits
AMD benefits at a glance.
Company
AMD
Advanced Micro Devices is a semiconductor company that designs and develops graphics units, processors, and media solutions.
Funding
Current Stage
Public CompanyTotal Funding
unknownKey Investors
OpenAIDaniel Loeb
2025-10-06Post Ipo Equity
2023-03-02Post Ipo Equity
2021-06-29Post Ipo Equity
Recent News
PCMag.com - Technology Product Reviews, News, Prices & Tips
2026-01-23
2026-01-23
Company data provided by crunchbase