VBench Subject Consistency — DINO ViT-B/16 temporal feature similarity.
Measures how well the main subject maintains its appearance throughout
the video via cosine similarity of DINO features between consecutive
frames and the first frame.
Classes
Functions