SIGN IN
Large Model Training Acceleration Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

OCBridge · 1 day ago

Large Model Training Acceleration Engineer

OCBridge is an AI platform engineering group focused on large-scale model training systems and performance acceleration. They are seeking an engineer specializing in large model training acceleration and distributed optimization to improve training efficiency, scalability, and performance for large generative and multimodal models across distributed compute environments.
ConsultingHuman ResourcesRecruiting
check
H1B Sponsor Likelynote

Responsibilities

Optimize large model training pipelines for performance and scalability
Design and improve distributed training systems
Implement and tune data, model, and pipeline parallelism strategies
Benchmark and profile training workloads to identify bottlenecks
Improve GPU utilization and training throughput
Collaborate with infrastructure and research teams on large-scale training systems
Build performance tooling and optimization frameworks for training acceleration

Qualification

Distributed training optimizationDeep learning systemsPythonC++CUDA optimizationPyTorchLarge model toolchainsBenchmarking toolsSoftware engineering skillsTransformer modelsDiffusion modelsMandarin Chinese

Required

Bachelor's, Master's, or PhD in Computer Science, AI, Electrical Engineering, or related field
3–10 years of experience in deep learning systems or large model training
Strong experience with distributed training optimization
Hands-on experience with parallel training methods: Data parallelism, Model parallelism, Pipeline parallelism
Strong software engineering skills in Python and C++
CUDA and GPU performance optimization experience
Experience with deep learning frameworks such as PyTorch
Experience with large model toolchains such as Megatron or DeepSpeed
Familiarity with transformer and diffusion models
Experience with benchmarking and profiling tools

Preferred

Experience in generative AI or computer vision training systems
Experience building large-scale training infrastructure
Experience with high-performance distributed compute environments
Mandarin Chinese proficiency

Benefits

Equity
Additional benefits may be included

Company

OCBridge

twittertwitter
company-logo
OCBridge is a leader in AI-powered recruitment, delivering talent with unmatched speed, accuracy, and scale.

H1B Sponsorship

OCBridge has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
2024 (3)
2023 (6)
2022 (2)
2021 (2)
2020 (2)

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Kirby Deng
Founder and CEO
linkedin
Company data provided by crunchbase