BanglaTense: A Large-Scale Dataset of Bangla Sentences Categorized by Tense: Past, Present, and Future

Name: BanglaTense: A Large-Scale Dataset of Bangla Sentences Categorized by Tense: Past, Present, and Future
Creator: Umme Ayman
Published: 2025-01-15T17:16:22.502Z
Keywords: Natural Language Processing, Machine Learning, Sentence Processing

Ayman, Umme; Bijoy, Md Hasan Imam; Mithu, Md. Monarul Islam

doi:10.17632/39w5khrg87.4

BanglaTense: A Large-Scale Dataset of Bangla Sentences Categorized by Tense: Past, Present, and Future

Published: 15 January 2025| Version 4 | DOI: 10.17632/39w5khrg87.4

Contributors:

,

Description

The BanglaTense dataset is a comprehensive collection of Bangla (Bengali) sentences meticulously categorized based on their tense: Past, Present, and Future. The dataset comprises a total of 17,819 annotated sentences with three distinct tense types: 5,629 in the past tense, 6,101 in the present tense, and 6,089 in the future tense. This dataset is designed to facilitate research and development in natural language processing (NLP) and computational linguistics, particularly for Bangla, a widely spoken language in Bangladesh and parts of India. With applications spanning tense detection, text classification, language modeling, and educational tools, BanglaTense is a valuable resource for the Bangla NLP community, facilitating advancements in temporal analysis and robust NLP model development.

Files

Institutions

Daffodil International University

BanglaTense: A Large-Scale Dataset of Bangla Sentences Categorized by Tense: Past, Present, and Future

Description

Files

Institutions

Categories

Licence