metric ¶ VBench Aesthetic Quality — CLIP ViT-L/14 + LAION aesthetic predictor. Encodes frames through CLIP, passes L2-normalized features through a linear aesthetic head (768 → 1), and averages scores / 10. Classes¶ Functions¶