Language-assisted Vision Model Debugger

Vision models with high overall accuracy often exhibit systematic errors on some important subsets of data, posing potential serious safety concerns. Diagnosing such bugs of vision models is gaining increased attention, but traditional diagnostic methods require labor-intensive data collection and attributes annotation.

BibTex: