metric ¶ VBench Color — GRiT dense captioning for color accuracy. Detects the target object via GRiT and checks if the expected color keyword appears in the object's caption. Score = frames_with_correct_color / frames_with_object_detected. Classes¶ Functions¶