Dataset of ‘A Literature Mining Method of Fusing Text and Table Extraction in Materials Science’
We propose a named entity recognition model for material text, called SciBERT-Fasttext-BiLSTM-CRF (SFBC). We used this model to identify named entities from texts in the stainless steel scientific literature and shared data on the frequency of occurrence of selected entities in this database between 2012 and 2021. By analysing the data in this dataset, researchers are able to understand the top research trends in stainless steel materials over the last decade.
National Key Research and Development Program of China
Natural Science Foundation of Shanghai