-
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Alpha-CLIP is an enhanced version of CLIP with an auxiliary alpha channel to suggest attentive regions and fine-tuned with constructed millions of RGBA region-text pairs. -
Google Scaned Dataset (GSO)
The Deitke dataset contains 30 objects chosen by SyncDreamer [17] and rendered 16 views with uniformly distributed camera poses and environment lighting for each object.