EnglishTense: A large scale English texts dataset categorized into three categories: Past, Present, Future tenses.

Published: 5 August 2024| Version 1 | DOI: 10.17632/jnb2xp9m4r.1
Contributors:
,
,

Description

he EnglishTense dataset is a comprehensive collection of English sentences meticulously categorized based on their tense: Past, Present, and Future. The dataset comprises a total of 13,316 annotated sentences with three distinct tense types: 4,621 in the present tense, 3,851 in the past tense, and 4,844 in the future tense. This dataset is designed to facilitate research and development in natural language processing (NLP) and computational linguistics, particularly for English, a widely spoken language in the world. With applications spanning tense detection, text classification, language modeling, and educational tools, EnglishTense is a valuable resource for the NLP community, facilitating advancements in temporal analysis and robust NLP model development.

Files

Institutions

Daffodil International University

Categories

Data Science, Natural Language Processing, Sentence Processing

Licence