-
Bengali Medical Corpus
A comprehensive 46-hour Bengali medical corpus encompassing disease names, symptoms, and symptom severity. -
VisualGenome datasets
The VisualGenome datasets containing Bengali, Hindi, and Malayalam sentences for fine-tuning. -
Bengali Hate Speech Dataset
The Bengali Hate Speech Dataset is a large-scale dataset for hate speech detection in the Bengali language. It contains 8,087 labelled examples, categorized into political,... -
Multimodal Hate Speech Detection in Bengali
Multimodal hate speech detection dataset for Bengali language -
Nayadiganta Dataset
Nayadiganta dataset is used as independent test set. -
Bengali and Hindi News Articles
Bengali dataset consists of articles from online public news portals such as Prothom-Alo, BDNews24 and Nayadiganta. The articles encompass domains such as politics,... -
BanglaLekhaImageCaptions dataset
The BanglaLekhaImageCaptions dataset is a modified version of the dataset introduced in [24]. It contains 9,154 images with two captions for each image. -
Bengali word segmentation
Bengali handwritten word segmentation dataset -
Bengali OCR
Bengali handwritten character recognition dataset -
BanglaWritting
Bengali handwritten word images dataset