Intro to WorldJen

WorldJen is a unified platform for evaluating video and world models. It helps researchers and builders run accurate, comparable benchmarks so you can see where your model excels and where it needs improvement.

What it does

Connect your GPU — Install a lightweight runner on your own machine or cloud instance. Your hardware stays under your control.
Pick models and dimensions — Connect models via Hugging Face (and other sources). Choose exactly which evaluation dimensions matter for your use case.
Get actionable results — See per-dimension scores, video-level breakdowns, and exports so you can act on the data.

Who it's for

WorldJen is built for teams working on video generation and world models who want:

Consistent, repeatable evaluations across runs and models
Clear metrics (motion quality, physics, prompt adherence, etc.) instead of a single opaque score
The ability to run evaluations on their own infrastructure

How it fits together

You create runs in the app: you select a runner (your GPU worker), a model, and dimensions to evaluate. The runner generates videos; each video is then scored along the chosen dimensions. Results appear in the run page — summary stats, per-video scores, and optional exports (e.g. CSV).

Intro to WorldJen ​

What it does ​

Who it's for ​

How it fits together ​

Intro to WorldJen

What it does

Who it's for

How it fits together