MatriVasha: Bangla Handwritten Compound Character Dataset and Recognition

Published: 8 June 2021| Version 1 | DOI: 10.17632/v39pc2g2wp.1
Contributors:
,
,
,

Description

Cite this dataset as: Ferdous J., Karmaker S., Rabby A.K.M.S.A., Hossain S. (2021) MatriVasha: A Multipurpose Comprehensive Database for Bangla Handwritten Compound Characters. In: Tavares J.M.R.S., Chakrabarti S., Bhattacharya A., Ghatak S. (eds) Emerging Technologies in Data Mining and Information Security. Lecture Notes in Networks and Systems, vol 164. Springer, Singapore. https://doi.org/10.1007/978-981-15-9774-9_74 MatriVasha the largest dataset of handwritten Bangla compound characters for research on handwritten Bangla compound character recognition. The proposed dataset contains 120 different types of compound characters that consist of 306,464‬ images written where 152,950 male and 153,514 female handwritten Bangla compound characters. This dataset can be used for other issues such as gender, age, district base handwriting research because the sample was collected that included district authenticity, age group, and an equal number of men and women.

Files

Steps to reproduce

Unzip the image and use it. The class map can be found in the metadata.csv. Cite this dataset: Ferdous J., Karmaker S., Rabby A.K.M.S.A., Hossain S. (2021) MatriVasha: A Multipurpose Comprehensive Database for Bangla Handwritten Compound Characters. In: Tavares J.M.R.S., Chakrabarti S., Bhattacharya A., Ghatak S. (eds) Emerging Technologies in Data Mining and Information Security. Lecture Notes in Networks and Systems, vol 164. Springer, Singapore. https://doi.org/10.1007/978-981-15-9774-9_74