BanglaLekhaImageCaptions

Name: BanglaLekhaImageCaptions
Creator: Nafees Mansoor
Published: 2019-07-28T03:33:29.635Z
Keywords: Artificial Intelligence, Computer Vision, Natural Language Processing, Machine Learning, Bengali Language, Bangladesh, Image Analysis

Mansoor, Nafees; Kamal, Abrar Hasin; Mohammed, Nabeel; Momen, Sifat; Rahman, Md Matiur

doi:10.17632/rxxch9vw59.2

BanglaLekhaImageCaptions

Published: 28 July 2019| Version 2 | DOI: 10.17632/rxxch9vw59.2

Contributors:

Nafees Mansoor, Abrar Hasin Kamal, Nabeel Mohammed, Sifat Momen, Md Matiur Rahman

Description

This dataset consists of images and annotations in Bengali. The images are human annotated in Bengali by two adult native Bengali speakers. All popular image captioning datasets have a predominant western cultural bias with the annotations done in English. Using such datasets to train an image captioning system assumes that a good English to target language translation system exists and that the original dataset had elements of the target culture. Both these assumptions are false, leading to the need of a culturally relevant dataset in Bengali, to generate appropriate image captions of images relevant to the Bangladeshi and wider subcontinental context. The dataset presented consists of 9,154 images.

BanglaLekhaImageCaptions

Description

Files

Categories

Licence