Datasets Comparison
Version 1
BanglaLekhaImageCaptions
Description
This dataset consists of images and annotations in Bengali. The images are human annotated in Bengali by two adult native Bengali speakers. All popular image captioning datasets have a predominant western cultural bias with the annotations done in English. Using such datasets to train an image captioning system assumes that a good English to target language translation system exists and that the original dataset had elements of the target culture. Both these assumptions are false, leading to the need of a culturally relevant dataset in Bengali, to generate appropriate image captions of images relevant to the Bangladeshi and wider subcontinental context. The dataset presented consists of 9,154 images.
Categories
Artificial Intelligence, Computer Vision, Natural Language Processing, Machine Learning, Bengali Language, Bangladesh, Image Analysis
Licence
Creative Commons Attribution 4.0 International
Version 2
BanglaLekhaImageCaptions
Description
This dataset consists of images and annotations in Bengali. The images are human annotated in Bengali by two adult native Bengali speakers. All popular image captioning datasets have a predominant western cultural bias with the annotations done in English. Using such datasets to train an image captioning system assumes that a good English to target language translation system exists and that the original dataset had elements of the target culture. Both these assumptions are false, leading to the need of a culturally relevant dataset in Bengali, to generate appropriate image captions of images relevant to the Bangladeshi and wider subcontinental context. The dataset presented consists of 9,154 images.
Categories
Artificial Intelligence, Computer Vision, Natural Language Processing, Machine Learning, Bengali Language, Bangladesh, Image Analysis
Licence
Creative Commons Attribution 4.0 International