Compatibility Matrix¶
This page summarizes common model + optimization combinations.
For the canonical, code-level list of model IDs recognized by
VideoGenerator.from_pretrained(...), see the registrations in
fastvideo/registry.py (register_configs(...) entries).
Note
The full STA integration in fastvideo/ is archived from main and kept
in sta_do_not_delete:
https://github.com/hao-ai-lab/FastVideo/tree/sta_do_not_delete
We do this because we believe VSA is strictly better than STA for the
actively maintained main inference path.
The symbols used have the following meanings:
- ✅ = Full compatibility
- ❌ = No compatibility
- ⭕ = Does not apply to this model
Models x Optimization¶
The HuggingFace Model ID can be passed directly to
from_pretrained(). FastVideo then uses model-specific default settings for
pipeline initialization and sampling.
| Model Name | HuggingFace Model ID | Resolutions | TeaCache | Sliding Tile Attn (Legacy Branch) | Sage Attn | VSA | BSA |
|---|---|---|---|---|---|---|---|
| FastWan2.1 T2V 1.3B | FastVideo/FastWan2.1-T2V-1.3B-Diffusers |
480P | ⭕ | ⭕ | ⭕ | ✅ | ⭕ |
| FastWan2.2 TI2V 5B Full Attn* | FastVideo/FastWan2.2-TI2V-5B-FullAttn-Diffusers |
720P | ⭕ | ⭕ | ⭕ | ✅ | ⭕ |
| Wan2.2 TI2V 5B | Wan-AI/Wan2.2-TI2V-5B-Diffusers |
720P | ⭕ | ⭕ | ✅ | ⭕ | ⭕ |
| Wan2.2 T2V A14B | Wan-AI/Wan2.2-T2V-A14B-Diffusers |
480P 720P |
❌ | ❌ | ✅ | ⭕ | ⭕ |
| Wan2.2 I2V A14B | Wan-AI/Wan2.2-I2V-A14B-Diffusers |
480P 720P |
❌ | ❌ | ✅ | ⭕ | ⭕ |
| HunyuanVideo | hunyuanvideo-community/HunyuanVideo |
720px1280p 544px960p |
❌ | ✅ | ✅ | ⭕ | ⭕ |
| FastHunyuan | FastVideo/FastHunyuan-diffusers |
720px1280p 544px960p |
❌ | ✅ | ✅ | ⭕ | ⭕ |
| Wan2.1 T2V 1.3B | Wan-AI/Wan2.1-T2V-1.3B-Diffusers |
480P | ✅ | ✅* | ✅ | ⭕ | ⭕ |
| Wan2.1 T2V 14B | Wan-AI/Wan2.1-T2V-14B-Diffusers |
480P, 720P | ✅ | ✅* | ✅ | ⭕ | ⭕ |
| Wan2.1 I2V 480P | Wan-AI/Wan2.1-I2V-14B-480P-Diffusers |
480P | ✅ | ✅* | ✅ | ⭕ | ⭕ |
| Wan2.1 I2V 720P | Wan-AI/Wan2.1-I2V-14B-720P-Diffusers |
720P | ✅ | ✅ | ✅ | ⭕ | ⭕ |
| TurboWan2.1 T2V 1.3B | loayrashid/TurboWan2.1-T2V-1.3B-Diffusers |
480P | ⭕ | ⭕ | ⭕ | ⭕ | ⭕ |
| TurboWan2.1 T2V 14B | loayrashid/TurboWan2.1-T2V-14B-Diffusers |
480P, 720P | ⭕ | ⭕ | ⭕ | ⭕ | ⭕ |
| TurboWan2.2 I2V A14B | loayrashid/TurboWan2.2-I2V-A14B-Diffusers |
480P 720P |
⭕ | ⭕ | ⭕ | ⭕ | ⭕ |
| LongCat T2V 13.6B | See note** | 480P 720P |
❌ | ❌ | ❌ | ⭕ | ✅ |
| Matrix Game 2.0 Base | FastVideo/Matrix-Game-2.0-Base-Diffusers |
352x640 | ⭕ | ⭕ | ⭕ | ⭕ | ⭕ |
| Matrix Game 2.0 GTA | FastVideo/Matrix-Game-2.0-GTA-Diffusers |
352x640 | ⭕ | ⭕ | ⭕ | ⭕ | ⭕ |
| Matrix Game 2.0 TempleRun | FastVideo/Matrix-Game-2.0-TempleRun-Diffusers |
352x640 | ⭕ | ⭕ | ⭕ | ⭕ | ⭕ |
Note: Wan2.2 TI2V 5B has some quality issues when performing I2V generation. We are working on fixing this issue.
Sliding Tile Attn (Legacy Branch) entries refer to the archived
sta_do_not_delete branch workflow, not active main inference wiring.
Canonical Supported IDs¶
The authoritative source for model-ID recognition is
fastvideo/registry.py. If a model ID is registered there, FastVideo can
resolve default pipeline and sampling configuration for it.
Special requirements¶
Sliding Tile Attention¶
- Full STA pipeline usage is on the archived branch: https://github.com/hao-ai-lab/FastVideo/tree/sta_do_not_delete
- STA currently requires Hopper GPUs (H100s).
TurboWan2.1 (TurboDiffusion)¶
- Uses TurboDiffusionPipeline with RCM scheduler for 1-4 step generation
- Requires SLA attention backend:
export FASTVIDEO_ATTENTION_BACKEND=SLA_ATTN - Uses
guidance_scale=1.0(no classifier-free guidance)
Matrix Game 2.0¶
- Image-to-video game world models with keyboard/mouse control input
- Three variants available: Base (universal), GTA, and TempleRun
- Each variant has different keyboard dimensions for control inputs