C-BEV: Contrastive Bird’s Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation

The CVUSA and CVACT datasets are used for cross-view geolocalization. The VIGOR dataset is used for cross-view image retrieval and 3-DoF pose estimation.

BibTex: