kl_divergence
¶
Modules¶
fastvideo.eval.metrics.audio.kl_divergence.metric
¶
PaSST KL divergence KL(gt || pred) on AudioSet-527 logits.
Ports av_bench.metrics.kl.compute_kl 1:1. The primary score is
the softmax variant (reported as "MKL" / "KL_PaSST" by V2A papers);
the sigmoid variant is exposed in details["kl_sigmoid"].
Classes¶
fastvideo.eval.metrics.audio.kl_divergence.metric.KLDivergenceMetric
¶
Bases: BaseMetric
PaSST KL divergence KL(gt || pred) on AudioSet-527 logits.
Per-sample. Requires sample["audio"] (generated) and
sample["reference_audio"] (ground truth).