Three Chinese military-related news websites used for named entity recognition
Published: 26 December 2018| Version 1 | DOI: 10.17632/7j9hkwtnr7.1
Contributor:
Jianguo XuDescription
there are 106230 sentences from May 7, 2016 to September 6, 2018 which are divided into two dataset, existing dataset and new dataset.
Files
Steps to reproduce
Including a few fundamental steps, including data collecting, data cleaning, word embeddings building, and data tagging.
Categories
Infometrics