World Model / Action Policy Researcher jobs in United States
cer-icon
Apply on Employer Site
company-logo

Medal · 2 days ago

World Model / Action Policy Researcher

Medal is a leading platform for gaming moments, focused on advancing human-like intelligence in machines. The World Model / Action Policy Researcher will conduct cutting-edge research in deep learning and reinforcement learning, particularly in the context of gaming and simulation environments.

GamingOnline GamesVideo GamesVideo Streaming

Responsibilities

5+ years of experience in deep learning research or reinforcement learning, with a focus on embodied agents or simulation environments
Strong foundation in representation learning and generative modeling, particularly using architectures such as diffusion models, VAEs, and transformers applied to video
Experience with world models and predictive control — you understand how to train models that simulate dynamics and plan actions in learned environments
Proficiency in reinforcement learning (RL, model-based RL, or imitation learning) and the ability to design and evaluate policy networks
Programming fluency in Python and deep learning frameworks such as PyTorch
Strong experimental skills — comfort with large-scale training, evaluation pipelines, and managing complex datasets or simulations
Publications or open-source contributions in areas like world modeling, simulation learning, or agent policies are a strong plus
Ownership & scientific rigor: You see ideas through from concept to proof to deployment. You write clean, reproducible code and maintain a high bar for experimental validity
Performance and scaling mindset: You care about how research translates into production systems, with an understanding of compute efficiency, distributed training, and data bottlenecks
Curiosity-driven and result-oriented: You’re excited by open-ended problems, but you also know how to define measurable goals and ship impactful systems
Gaming & simulation passion: Interest in interactive environments, physics-based simulations, or gaming AI. Experience with Unity, Unreal Engine, or custom simulators is a plus

Qualification

Deep learning researchReinforcement learningRepresentation learningGenerative modelingPython programmingPyTorchWorld modelsPredictive controlExperimental skillsGaming passionCuriosity-drivenScientific rigorResult-orientedOwnership

Required

5+ years of experience in deep learning research or reinforcement learning, with a focus on embodied agents or simulation environments
Strong foundation in representation learning and generative modeling, particularly using architectures such as diffusion models, VAEs, and transformers applied to video
Experience with world models and predictive control — you understand how to train models that simulate dynamics and plan actions in learned environments
Proficiency in reinforcement learning (RL, model-based RL, or imitation learning) and the ability to design and evaluate policy networks
Programming fluency in Python and deep learning frameworks such as PyTorch
Strong experimental skills — comfort with large-scale training, evaluation pipelines, and managing complex datasets or simulations
Ownership & scientific rigor: You see ideas through from concept to proof to deployment. You write clean, reproducible code and maintain a high bar for experimental validity
Performance and scaling mindset: You care about how research translates into production systems, with an understanding of compute efficiency, distributed training, and data bottlenecks
Curiosity-driven and result-oriented: You're excited by open-ended problems, but you also know how to define measurable goals and ship impactful systems

Preferred

Publications or open-source contributions in areas like world modeling, simulation learning, or agent policies are a strong plus
Gaming & simulation passion: Interest in interactive environments, physics-based simulations, or gaming AI. Experience with Unity, Unreal Engine, or custom simulators is a plus

Benefits

Competitive salary
Equity options
Comprehensive health insurance
401k

Company

Medal

twittertwittertwitter
company-logo
Medal is a developer of a short-form gaming video clips platform to share digital game clips and highlights with gamers.

Funding

Current Stage
Growth Stage
Total Funding
$85.5M
Key Investors
OMERS VenturesMakers FundHorizons Ventures
2024-07-11Series Unknown· $13M
2021-12-14Series C· $45M
2020-01-01Series B· $15M

Leadership Team

leader-logo
Iggy Harmsen
Product
linkedin
J
Joshua Lipson
Chief Architect
linkedin
Company data provided by crunchbase