stable_audio
¶
Config for the Stable Audio Open 1.0 DiT.
Note: the SA pipeline bypasses the standard ComposedPipelineBase
component loader because the published HF repo ships a single monolithic
model.safetensors (no Diffusers-style model_index.json or
per-subfolder layout). The arch fields and param_names_mapping here
document the architecture and key remap so the same conventions used by
the rest of the DiT family apply (FSDP shard conditions, supported
attention backends, future loader integrations) — they are not currently
consumed by fastvideo/models/loader/fsdp_load.py for SA.