Three Chinese military-related news websites used for named entity recognition
Published: 26 December 2018| Version 1 | DOI: 10.17632/7j9hkwtnr7.1
Contributor:
Jianguo Xu
Description
there are 106230 sentences from May 7, 2016 to September 6, 2018 which are divided into two dataset, existing dataset and new dataset.
Files
Steps to reproduce
Including a few fundamental steps, including data collecting, data cleaning, word embeddings building, and data tagging.
Categories
Infometrics