TP-NER: A named entity recognition dataset of target and precursor named entities for high-temperature steel

Published: 16 December 2022| Version 1 | DOI: 10.17632/5zng6khy9h.1
Contributors:
M Saef Ullah Miah, Junaida Sulaiman,
, Talha Bin Sarwar

Description

This dataset contains target and precursor-named entity data for high-temperature steel-related texts from published papers in IOB format. This dataset has been annotated and verified by domain experts. This dataset has 249304 rows and 3 columns. The sentence ID is in the first column, the tokens are in the second column, and the target or precursor tag is in the third column. This dataset contains data for five annotated documents among 25 documents.

Files

Steps to reproduce

1. Collect data from renowned publishers and get by the highest citation 2. Annotate by the domain expert 3. Validate by the domain expert

Institutions

Universiti Malaysia Pahang

Categories

Materials Science, Computational Materials Science, Natural Language Processing

Licence