Semantic Image-Text-Classes

This dataset is introduced by the paper "Understanding, Categorizing and Predicting Semantic Image-Text Relations".

If you are using this dataset it in your work, please cite:

@inproceedings{otto2019understanding, title={Understanding, Categorizing and Predicting Semantic Image-Text Relations}, author={Otto, Christian and Springstein, Matthias and Anand, Avishek and Ewerth, Ralph}, booktitle={In Proceedings of ACM International Conference on Multimedia Retrieval (ICMR 2019)}, year={2019} }

To create the full tar use the following command in the command line:

cat train.tar.part* > train_concat.tar

Then simply untar it via

tar -xf train_concat.tar

The jsonl files contain metadata of the following format:

id, origin, CMI, SC, STAT, ITClass, text, tagged text, image_path

License Information:

This dataset is composed of various open access sources as described in the paper. We thank all the original authors for their work.

Data and Resources

Cite this as

Christian Otto, Matthias Springstein, Avishek Anand, Ralph Ewerth (2019). Dataset: Semantic Image-Text-Classes. https://doi.org/10.25835/0010577

DOI retrieved: April 25, 2019

Additional Info

Field Value
Imported on October 14, 2021
Last update August 4, 2023
License CC-BY-NC-3.0
Source https://data.uni-hannover.de/dataset/image-text-classes
Version 1.0
Author Christian Otto
More Authors
Matthias Springstein
Avishek Anand
Ralph Ewerth
Author Email Christian Otto
Maintainer Christian Otto
Maintainer Email Christian Otto
Source Creation 23 April, 2019, 10:18 AM (UTC+0000)
Source Modified 20 January, 2022, 14:14 PM (UTC+0000)