Data for: Extractive Summarization of Clinical Trial Descriptions

Published: 13 June 2019| Version 1 | DOI: 10.17632/gg58kc7zy7.1
Contributor:
Christian Gulden

Description

This archive contains the summarization corpus generated as a result of the filtering stages (trials-final.csv), the rouge scores for the generated summaries (rouge-results-parsed.csv), the data and results of the human evaluation (evaluation/ subfolder), the code used to generate the corpus (extract.r, filter.r, and determine_similarity_threshold.r). The summaries were generated using the summarize_all.py script.

Files

Categories

Medical Informatics, Natural Language Processing, Text Mining

License