-
Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A S...
The dataset for the paper titled Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey -
text2fabric
A comprehensive, large-scale public dataset relating the visual appearance of fabrics to natural language. -
Localizing moments in video with natural language
Localizing moments in video with natural language -
PhotoBot: Reference-Guided Interactive Photography via Natural Language
PhotoBot is a framework for fully automated photo acquisition based on an interplay between high-level human language guidance and a robot photographer. -
Chinese Poetry
The Chinese Poetry dataset is a dataset of Chinese poems used for language modeling. -
Penn Treebank
The Penn Treebank dataset contains one million words of 1989 Wall Street Journal material annotated in Treebank II style, with 42k sentences of varying lengths.