C-BEV: Contrastive Bird’s Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation
The CVUSA and CVACT datasets are used for cross-view geolocalization. The VIGOR dataset is used for cross-view image retrieval and 3-DoF pose estimation.
BibTex: