Multimodal Convolutional Neural Networks for Matching Image and Sentence

Multimodal convolutional neural networks for matching image and sentence

BibTex: