-
vHeat: Building Vision Models upon Heat Conduction
A fundamental problem in learning robust and expressive visual representations lies in efficiently estimating the spatial relationships of visual semantics throughout the entire... -
Language-assisted Vision Model Debugger
Vision models with high overall accuracy often exhibit systematic errors on some important subsets of data, posing potential serious safety concerns. Diagnosing such bugs of...