GVL Online Demo (0-shot)

GVL Live Demo

Upload a video enter your Gemini API key and task description, then first shuffle it, and "Get Response" to analyze the frames.

After receiving the response, you can click "Parse Response" to see the predicted task completion percentage for each frame. You can toggle back to GT order to examine the predicted value function as well as the caption.

Or click one of the examples below to try:

Folding Dress Example
Glass on Rack Example
Green numbers show ground truth frame order. Red numbers show shuffled frame order.
Preview
Processing request...