BenCoref: A Dataset of Nominal Phrases and Pronominal Reference Annotations

Published: 9 June 2021| Version 4 | DOI: 10.17632/c59rssj3t4.4
Contributors:
,
, Mohammad Mamun Or Rashid,

Description

This dataset contains 3622 coreference annotations forming 356 coreference clusters in 31630 tokens. The dataset is divided into 69 documents. The text in the documents originate from classic Bengali literature and comes across in 2 categories: short story and novel.

Files

Institutions

  • North South University

Categories

Natural Language Processing

Licence