Skip to content

joshuayanggithub/coinrun

 
 

Repository files navigation

Open-Oasis — CoinRun World Model

An action-conditioned, video-generating world model trained on the videogame CoinRun (Procgen) gameplay. Adapted from 500M Oasis Minecraft World Model, but scaled down to 64×64 pixel-space generation, without need of interleaved training with VAE.

Given a single prompt frame and a sequence of actions, the model autoregressively generates future frames using DDIM diffusion, acting as a "simulator of the game world" aka World Model.

Architecture

Sample Rollouts

Videos from the largest trained 58 Model:

episode.1777341548.mp4

Scaling Laws

Scaling analysis

With limited compute, I scaled across 5 model sizes from 5M - 58M parameters

Train Loss Val Loss

Credits

About

scaling action-conditioned video generation on the 2d platformer game coinrun

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 96.9%
  • Shell 3.1%