BenCoref: A Dataset of Nominal Phrase and Pronominal Reference Annotations in Bengali

Published: 8 June 2021| Version 3 | DOI: 10.17632/c59rssj3t4.3
Contributors:
,
, Mohammad Mamun Or Rashid,

Description

This dataset contains 3622 coreference annotations forming 356 coreference clusters in 31630 tokens. The dataset is divided into 69 documents. The text in the documents come accross in 2 categories: short story and novel. The short stories and novels originate from classic Bengali literature.

Files

Institutions

  • North South University

Categories

Natural Language Processing

Licence