TP-NER: A named entity recognition dataset of target and precursor named entities for high-temperature steel
Published: 16 December 2022| Version 1 | DOI: 10.17632/5zng6khy9h.1
Contributors:
M Saef Ullah Miah, Junaida Sulaiman, , Talha Bin SarwarDescription
This dataset contains target and precursor-named entity data for high-temperature steel-related texts from published papers in IOB format. This dataset has been annotated and verified by domain experts. This dataset has 249304 rows and 3 columns. The sentence ID is in the first column, the tokens are in the second column, and the target or precursor tag is in the third column. This dataset contains data for five annotated documents among 25 documents.
Files
Steps to reproduce
1. Collect data from renowned publishers and get by the highest citation 2. Annotate by the domain expert 3. Validate by the domain expert
Institutions
Universiti Malaysia Pahang
Categories
Materials Science, Computational Materials Science, Natural Language Processing