Three Chinese military-related news websites used for named entity recognition
Published: 26-12-2018| Version 1 | DOI: 10.17632/7j9hkwtnr7.1
Contributor:
Description
there are 106230 sentences from May 7, 2016 to September 6, 2018 which are divided into two dataset, existing dataset and new dataset.
Files
Steps to reproduce
Including a few fundamental steps, including data collecting, data cleaning, word embeddings building, and data tagging.