imagebind_score
¶
Modules¶
fastvideo.eval.metrics.audio.imagebind_score.metric
¶
ImageBind audio↔video cosine similarity (IB-Score).
Per-sample. Reads sample["video_path"] (or sample["video"].source
for a :class:Video wrapper) and sample["audio"]; ImageBind decodes
its own clips so the path is required, not the pool-decoded tensor.
Classes¶
fastvideo.eval.metrics.audio.imagebind_score.metric.ImageBindScoreMetric
¶
Bases: BaseMetric
ImageBind audio↔video cosine similarity, per-sample.