Skip to main content
Exit comparison
Removed
Added

Datasets Comparison

Version 1

BanglaLekhaImageCaptions

Published:13 May 2019|Version 1|DOI:10.17632/rxxch9vw59.1
Contributors:Nafees Mansoor, Abrar Hasin Kamal, Nabeel Mohammed, Sifat Momen, Md Matiur Rahman

Description

This dataset consists of images and annotations in Bengali. The images are human annotated in Bengali by two adult native Bengali speakers. All popular image captioning datasets have a predominant western cultural bias with the annotations done in English. Using such datasets to train an image captioning system assumes that a good English to target language translation system exists and that the original dataset had elements of the target culture. Both these assumptions are false, leading to the need of a culturally relevant dataset in Bengali, to generate appropriate image captions of images relevant to the Bangladeshi and wider subcontinental context. The dataset presented consists of 9,154 images.

Categories

Artificial Intelligence, Computer Vision, Natural Language Processing, Machine Learning, Bengali Language, Bangladesh, Image Analysis

Licence

Creative Commons Attribution 4.0 International

Version 2

BanglaLekhaImageCaptions

Published:28 July 2019|Version 2|DOI:10.17632/rxxch9vw59.2
Contributors:Nafees Mansoor, Abrar Hasin Kamal, Nabeel Mohammed, Sifat Momen, Md Matiur Rahman

Description

This dataset consists of images and annotations in Bengali. The images are human annotated in Bengali by two adult native Bengali speakers. All popular image captioning datasets have a predominant western cultural bias with the annotations done in English. Using such datasets to train an image captioning system assumes that a good English to target language translation system exists and that the original dataset had elements of the target culture. Both these assumptions are false, leading to the need of a culturally relevant dataset in Bengali, to generate appropriate image captions of images relevant to the Bangladeshi and wider subcontinental context. The dataset presented consists of 9,154 images.

Categories

Artificial Intelligence, Computer Vision, Natural Language Processing, Machine Learning, Bengali Language, Bangladesh, Image Analysis

Licence

Creative Commons Attribution 4.0 International