Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 1 dataset found Groups: Image Captioning Formats: JSON Filter Results Generalizable Entity Grounding via Assistance of Large Language Model The GELLA framework leverages a large language model to ground entities with long captions. Dataset JSON