Three Chinese military-related news websites used for named entity recognition

Published: 26-12-2018| Version 1 | DOI: 10.17632/7j9hkwtnr7.1
Jianguo Xu


there are 106230 sentences from May 7, 2016 to September 6, 2018 which are divided into two dataset, existing dataset and new dataset.


Steps to reproduce

Including a few fundamental steps, including data collecting, data cleaning, word embeddings building, and data tagging.