Advanced Microdevices Pvt. Ltd. (India) · 9 hours ago
AI Models MAD - Model Automation and Dashboarding
Advanced Micro Devices, Inc. is a leading company focused on accelerating next-generation computing experiences. They are seeking a skilled software engineer to join their Model Automation and Dashboarding team, where the role involves building tools and infrastructure for automating the validation of AI models on AMD hardware, ensuring reliability and performance.
BiotechnologyIndustrialPharmaceuticalManufacturingBiopharma
Responsibilities
Automate functional and performance testing of AI models across ROCm-supported hardware using scalable tools and pipelines
Proficiency in Python and C++ with deep experience in performance tuning, debugging, and robust test design, ensuring reliable, maintainable, high-performance codebases
Develop tools for continuous benchmarking and regression tracking across hardware generations and ROCm releases
Build and maintain real-time dashboards that report relevant performance, accuracy, and reliability metrics for both internal and public users
Collaborate with teams like Deep Learning Models (DLM) and MADengine to support a wide range of models, including public and private/NDA workloads
Ensure out-of-box confidence for ROCm clients by validating model performance and functionality in standardized and reproducible environments
Contribute to the design of portable, easy-to-use Python interfaces that support multi-node profiling, distributed workloads, and containerized deployments
Support public-facing MAD GitHub repositories and Docker releases, enabling the community to run and validate models on ROCm
Qualification
Required
Strong technical expertise in Python and Linux-based systems
Passionate about quality assurance, benchmarking, and automation in the AI/ML space
Excellent problem-solving skills
Takes ownership in defining goals and delivering impactful solutions
Proficiency in Python and C++
Deep experience in performance tuning, debugging, and robust test design
Experience working with machine learning frameworks
Experience with performance dashboards or automation platforms
Automate functional and performance testing of AI models across ROCm-supported hardware using scalable tools and pipelines
Develop tools for continuous benchmarking and regression tracking across hardware generations and ROCm releases
Build and maintain real-time dashboards that report relevant performance, accuracy, and reliability metrics for both internal and public users
Collaborate with teams to support a wide range of models, including public and private/NDA workloads
Ensure out-of-box confidence for ROCm clients by validating model performance and functionality in standardized and reproducible environments
Contribute to the design of portable, easy-to-use Python interfaces that support multi-node profiling, distributed workloads, and containerized deployments
Support public-facing MAD GitHub repositories and Docker releases
Preferred
Strong Python development skills, with experience in test automation, CI/CD, and Linux scripting
Familiarity with AI frameworks (e.g., PyTorch, TensorFlow), model benchmarking, and ML model lifecycles
Strong experience with profiling tools, system monitoring, or regression tracking systems for deep learning models
Solid experience in performance dashboards, visualization tools (e.g., Grafana, Plotly), and metrics collection pipelines
Proficiency with version control (GitHub), testing strategies, code reviews, and collaborative software development
Strong written and verbal communication skills with a proactive approach to defining and driving development efforts
Benefits
AMD benefits at a glance.
Company
Advanced Microdevices Pvt. Ltd. (India)
Advanced Microdevices (mdi) is a leader in innovative membrane technologies.
Funding
Current Stage
Late StageLeadership Team
Nalini Kant Gupta
Founder & Managing Director
Recent News
2024-10-18
2024-10-16
Company data provided by crunchbase