Modelvideo1y ago

Pyramid Flow review

An open-source, MIT-licensed text- and image-to-video model that makes 768p, 24fps, 10-second clips — and runs on a single consumer GPU with offloading.

Maker
Peking University · Kuaishou
Launched
Oct 10, 2024
Pricing
open-source
Visit official site
Pyramid Flow — real sample output
A frame from Pyramid Flow's official 768p · 24fps · 10-second text-to-video sample (a snowy Tokyo street). Look at the pedestrians and you can see the model's characteristic warping. Source: pyramid-flow.github.io.
Firstlook

Our verdict

If you want to OWN a capable text-to-video model — weights on disk, MIT-licensed, running on a single consumer GPU — Pyramid Flow is still one of the most practical open options. Go in knowing it's a 2024 model: clips are short, motion warps under scrutiny, and newer open models have since raised the bar.

First look — our read from the docs and sources below; not yet hands-on tested.

Pyramid Flow arrived in October 2024 with an unusually honest pitch: not "the best video model," but the most ownable one. The weights are MIT-licensed and sitting on Hugging Face, the training method is published, and — the part that actually matters for most people — it runs on a single consumer GPU instead of a data-center rack.

That last point is the whole story. Plenty of "open" video models are open the way a supercar is purchasable: technically yes, practically no. Pyramid Flow's team trained it in roughly 20,700 A100-hours and tuned it so that, with CPU offloading, it fits in under 12 GB of VRAM (under 8 GB if you're patient). If you have a recent gaming card, you can generate 768p, 24fps, up-to-10-second clips on your own machine — no API key, no per-second meter running.

Four frames sampled across one Pyramid Flow 10-second clip
Four frames across one official 10-second sample. The camera move and falling snow hold together; the people are where the seams show. Source: pyramid-flow.github.io.

So how good is it? This is a first look — I haven't run my own batch on local hardware yet, so I'm not putting a number on it — but the official samples tell an honest story. Wide, slow, atmospheric shots (the snowy Tokyo street above, fireworks, churning water) look genuinely good. Ask for people moving, fast action, or fine detail, and the 2024-ness shows: limbs warp, faces smear, and ten seconds is about as far as coherence stretches.

Who it's for

If you want to learn how video diffusion actually works, build a pipeline you control, or generate atmospheric B-roll for free, Pyramid Flow is one of the best on-ramps that exists. The license is clean, the hardware bar is genuinely low, and the community has already wired it into ComfyUI.

Who should skip it

If you need broadcast-ready, character-consistent clips today, you'll be happier renting a hosted model like Kling — you give up ownership, but you get a real jump in polish. And if you won't touch Python or ComfyUI, start with the Hugging Face Space demo before committing to a local install.

Eighteen months on, Pyramid Flow is no longer the frontier — newer open models like HunyuanVideo and Mochi push quality higher. But for "a real video model I can keep, on hardware I already own," it remains a remarkably sensible answer.

Provider

Providerhuggingfacerain1011/pyramid-flow-miniflux· MIT

Specs & key facts

What it doesText-to-video + image-to-video[src]
Max video768p · 24 fps · up to 10s (miniFLUX)[src]
Faster preset384p · 24 fps · 5s[src]
Still imageUp to 1024p (miniFLUX)[src]
VariantsminiFLUX (FLUX-based) · SD3-based[src]
VRAM with offloadingUnder 12 GB (CPU offload) · under 8 GB (sequential offload)[src]
Training compute~20,700 A100 GPU-hours[src]
LicenseMIT (commercial use allowed)[src]

Capabilities

Text-to-videoYes
Image-to-videoYes
First-party hosted APINo — self-host
ComfyUI supportCommunity
Commercial useYes (MIT)
Audio / soundNo

How to use it

  1. 1Download the miniFLUX checkpoint from Hugging Face (rain1011/pyramid-flow-miniflux).
  2. 2Install the repo's inference code — generation runs through the PyramidDiTForVideoGeneration class.
  3. 3Run the included Jupyter demo (video_generation_demo.ipynb) or launch the Gradio app (app.py).
  4. 4Tight on VRAM? Enable CPU offloading (under 12 GB) or sequential offload (under 8 GB), trading speed for memory.
  5. 5No GPU at all? Try the official Hugging Face Space demo before installing anything.

Pricing

Open source

Free

MIT-licensed weights on Hugging Face. You only pay for the GPU you run it on — your own card, or rented cloud compute.

There is no first-party hosted API or paid tier. Pyramid Flow is a model you download and run, not a subscription.

Pros & cons

Pros

  • Genuinely open: MIT-licensed, weights on Hugging Face, commercial use allowed.
  • Runs on a single consumer GPU — under 8 GB VRAM with sequential offloading.
  • Does both text-to-video and image-to-video at 768p / 24fps, up to 10 seconds.
  • Trained for a fraction of the usual cost (~20.7k A100-hours), with the method published.

Cons

  • No first-party hosted API — you (or a rented GPU) do the running.
  • It's a 2024 model: motion and fine detail warp, especially on people and fast action.
  • Setup is code-first; the smoothest paths are the HF Space or a community ComfyUI node.
  • Newer open models (and hosted ones like Kling) now look noticeably cleaner.

Alternatives

FAQ

Sources

Sources

  1. 1.Capabilities, resolution/fps/duration, VRAM, how to run, MIT licensehttps://github.com/jy0205/Pyramid-FlowVerified 2026-06-26
  2. 2.miniFLUX checkpoint (text-to-video, FLUX-based), pipeline taghttps://huggingface.co/rain1011/pyramid-flow-minifluxVerified 2026-06-26
  3. 3.SD3-based checkpoint varianthttps://huggingface.co/rain1011/pyramid-flow-sd3Verified 2026-06-26
  4. 4.Pyramidal flow-matching method + ~20.7k A100-hour training figurehttps://arxiv.org/abs/2410.05954Verified 2026-06-26
  5. 5.Affiliation (Peking University + Kuaishou + BUPT); Oct 10 2024 open-source releasehttps://venturebeat.com/ai/new-high-quality-ai-video-generator-pyramid-flow-launches-and-its-fully-open-sourceVerified 2026-06-26
  6. 6.Official generated sample outputshttps://pyramid-flow.github.ioVerified 2026-06-26

More coverage

News & first-looks about this release. Coming soon.
Head-to-head comparisons. Coming soon.