Three Chinese military-related news websites used for named entity recognition

Published: 26 December 2018| Version 1 | DOI: 10.17632/7j9hkwtnr7.1
Contributor:
Jianguo Xu

Description

there are 106230 sentences from May 7, 2016 to September 6, 2018 which are divided into two dataset, existing dataset and new dataset.

Files

Steps to reproduce

Including a few fundamental steps, including data collecting, data cleaning, word embeddings building, and data tagging.

Categories

Infometrics

Licence