EnglishTense: A large scale English texts dataset categorized into three categories: Past, Present, Future tenses.

Name: EnglishTense: A large scale English texts dataset categorized into three categories: Past, Present, Future tenses.
Creator: Umme Ayman ayman
Published: 2024-08-05T19:13:50.929Z
Keywords: Data Science, Natural Language Processing, Sentence Processing

ayman, Umme Ayman; Rahman, Md. Hafizur; Islam, Md. Shafiqul

doi:10.17632/jnb2xp9m4r.1

EnglishTense: A large scale English texts dataset categorized into three categories: Past, Present, Future tenses.

Published: 5 August 2024| Version 1 | DOI: 10.17632/jnb2xp9m4r.1

Contributors:

,

Description

he EnglishTense dataset is a comprehensive collection of English sentences meticulously categorized based on their tense: Past, Present, and Future. The dataset comprises a total of 13,316 annotated sentences with three distinct tense types: 4,621 in the present tense, 3,851 in the past tense, and 4,844 in the future tense. This dataset is designed to facilitate research and development in natural language processing (NLP) and computational linguistics, particularly for English, a widely spoken language in the world. With applications spanning tense detection, text classification, language modeling, and educational tools, EnglishTense is a valuable resource for the NLP community, facilitating advancements in temporal analysis and robust NLP model development.

Files

Institutions

Daffodil International University

EnglishTense: A large scale English texts dataset categorized into three categories: Past, Present, Future tenses.

Description

Files

Institutions

Categories

Licence