ontonotes-conll2012

Published: 14 March 2022| Version 2 | DOI: 10.17632/zmycy7t9h9.2
Contributor:
Frank Xavier

Description

** Please cite the original paper and distribution but not this distribution. ** Extended version of OntoNotesV5.0 used in CoNLL2012, includes extension version of v4 English, Arabic, Chinese (test data of 3 langs are v9), and v12 English This data is processed based on the official script (v4) https://conll.cemantix.org/2012/data.html (v12) https://cemantix.org/data/ontonotes.html (v12 script) https://github.com/yuchenlin/OntoNotes-5.0-NER-BIO directory structure (only top level directories showed here, refer to https://conll.cemantix.org/2012/data.html for the lower-level ones) conll-2012 ├── v12 │   └── data │   ├── conll-2012-test │   │   └── data │   │   └── english │   ├── development │   │   └── data │   │   └── english │   ├── test │   │   └── data │   │   └── english │   └── train │   └── data │   └── english └── v4 └── data ├── development │   └── data │   ├── arabic │   ├── chinese │   └── english ├── test │   └── data │   ├── arabic │   ├── chinese │   └── english └── train └── data ├── arabic ├── chinese └── english

Files

Categories

Natural Language Processing

Licence