-
Bag-of-visual-words and spatial extensions for land-use classification
Bag-of-visual-words and spatial extensions for land-use classification -
Deep semantic understanding of high resolution remote sensing image
Deep semantic understanding of high resolution remote sensing image -
Exploring models and data for remote sensing image caption generation
Exploring models and data for remote sensing image caption generation -
Remote Sensing Image Captioning
Remote Sensing Image Captioning Dataset (RSICD) and UCM-captions dataset for remote sensing image captioning -
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Alpha-CLIP is an enhanced version of CLIP with an auxiliary alpha channel to suggest attentive regions and fine-tuned with constructed millions of RGBA region-text pairs. -
Microsoft COCO
The Microsoft COCO dataset was used for training and evaluating the CNNs because it has become a standard benchmark for testing algorithms aimed at scene understanding and...