Skip to content

Intro to WorldJen

WorldJen is a unified platform for evaluating video and world models. It helps researchers and builders run accurate, comparable benchmarks so you can see where your model excels and where it needs improvement.

What it does

  • Connect your GPU — Install a lightweight runner on your own machine or cloud instance. Your hardware stays under your control.
  • Pick models and dimensions — Connect models via Hugging Face (and other sources). Choose exactly which evaluation dimensions matter for your use case.
  • Get actionable results — See per-dimension scores, video-level breakdowns, and exports so you can act on the data.

Who it's for

WorldJen is built for teams working on video generation and world models who want:

  • Consistent, repeatable evaluations across runs and models
  • Clear metrics (motion quality, physics, prompt adherence, etc.) instead of a single opaque score
  • The ability to run evaluations on their own infrastructure

How it fits together

You create runs in the app: you select a runner (your GPU worker), a model, and dimensions to evaluate. The runner generates videos; each video is then scored along the chosen dimensions. Results appear in the run page — summary stats, per-video scores, and optional exports (e.g. CSV).