MarioVGG is a text-to-video model which takes in the initial ...
MarioVGGis a text-to-video model which takes in the initial game state as a single image frame, and an action in the form of a text prompt. It then outputs a video depicting the action in the game.