Quadric · 1 month ago
AI Inference Engineer
Quadric is an innovative company specializing in neural processing unit architecture. The AI Inference Engineer will bridge AI/LLM models with Quadric's platforms, focusing on model porting, optimization for efficient inference, and performance benchmarking.
ComputerHardwareSemiconductor
Responsibilities
Quantize, prune and convert models for deployment
Port models to Quadric platform using Quadric toolchain
Optimize inference deployment for latency, speed
Benchmark and profile model performance and accuracy
Develop tools to scale and speed up the deployment
Make Improvement to SDK and runtime
Provide technical support and documents to customers and developer community
Qualification
Required
Bachelor's or Master's in Computer Science and/or Electric Engineering
5+ years of experience in AI/LLM model inference and deployment frameworks/tools
experience with model quantization (PTQ, QAT) and tools
experience with model accuracy measures
experience with model inference performance profiling
experience with at least one of the following frameworks: onnxruntime, Pytorch, vLLM, huggingface-transformer, neural-compressor, llamacpp
Proficiency in C/C++ and Python
Demonstrate good capability in problem solving, debug and communication
Benefits
Health Care Plan (Medical, Dental & Vision)
Retirement Plan (401k, IRA)
Life Insurance (Basic, Voluntary & AD&D)
Paid Time Off (Vacation, Sick & Public Holidays)
Family Leave (Maternity, Paternity)
Short Term & Long Term Disability
Training & Development
Work From Home
Free Food & Snacks
Stock Option Plan
Company
Quadric
Quadric is a Semiconductor IP licensor. Quadric's GPNPU processor blends machine learning inference performance with DSP programmability.
H1B Sponsorship
Quadric has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
Funding
Current Stage
Growth StageTotal Funding
$43.75MKey Investors
DensoNSITEXEPear VC
2022-12-15Series B· $5.5M
2022-12-15Debt Financing
2022-03-16Series B· $21M
Recent News
2025-12-11
2025-12-09
2025-08-21
Company data provided by crunchbase