Senior Machine Learning Ops Engineer @ ByteDance | Jobright.ai
JOBSarrow
RecommendedLiked
0
Applied
0
Senior Machine Learning Ops Engineer jobs in Seattle, WAH1B Visa Sponsored Senior Machine Learning Ops Engineer jobs in Seattle, WA
74 applicants
company-logo

ByteDance · 2 days ago

Senior Machine Learning Ops Engineer

Wonder how qualified you are to the job?

ftfMaximize your interview chances
ContentData Mining
check
H1B Sponsorship
check
Comp. & Benefits

Insider Connection @ByteDance

Discover valuable connections within the company who might provide insights and potential referrals, giving your job application an inside edge.

Responsibilities

Responsible for ensuring our ML systems are operating and running efficiently for large model development, training, evaluation, and inference
Responsible for the stability of offline tasks/services in multi-data center, multi-region, and multi-cloud scenarios
Responsible for resource management and planning, cost and budget, including computing and storage resources
Responsible for global system disaster recovery, cluster machine governance, stability of business services, resource utilization improvement, and operation efficiency improvement
Build software tools, products, and systems to monitor and manage the ML infrastructure and services efficiently
Be part of the global team roster that ensures system and business on-call support

Qualification

Find out how your skills align with this job's requirements. If anything seems off, you can easily click on the tags to select or unselect skills to reflect your actual expertise.

Programming (Go/Python/Shell)Linux EnvironmentKubernetesContainersAbstractionWorkflowTechnical DocumentationProblem-SolvingLogical AnalysisDocumentationResponsibilityLearning AbilityCommunicationSelf-DriveTeam SpiritML Distributed SystemsGPU Servers

Required

Bachelor's degree or above, major in computer science, computer engineering or related
Strong proficiency in at least one programming language such as Go/Python/Shell in Linux environment
Strong hands-on experience with Kubernetes and containers skills, and have more than 2 years of relevant operation and maintenance experience
Possess excellent logical analysis ability, able to reasonably abstract and split business logic
Have good documentation principles and habits to be able to write and update workflow and technical documentation as required on time
Possess a strong sense of responsibility, good learning ability, communication ability and self-drive, good team spirit

Preferred

Engaged in the operation and maintenance of large-scale ML distributed systems
Experience in operation and maintenance of GPU servers

Company

ByteDance

company-logo
ByteDance is an internet technology company that operates creative content platforms.

H1B Sponsorship

ByteDance has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Trends of Total Sponsorships
2023 (502)
2022 (518)
2021 (510)
2020 (272)

Funding

Current Stage
Late Stage
Total Funding
$9.51B
Key Investors
G42Tiger Global ManagementGeneral Atlantic
2023-03-15Secondary Market· $100M
2020-12-11Private Equity· $2B
2020-03-30Secondary Market· Undisclosed

Leadership Team

leader-logo
Julie Gao
CFO
linkedin
leader-logo
Ahmed Hany
Principal Sales Engineer
linkedin
Company data provided by crunchbase
logo

Orion

Your AI Copilot