Radiological Exam Scheduling Times in Public Healthcare (Aragon, Spain): A Cleaned Dataset from IACS and SERAM
Description
This dataset provides information about radiological procedures carried out in public hospitals in the Spanish region of Aragón during a representative month of scheduled (non-emergency) activity. It has been compiled from two main sources: - The SERAM catalog (Spanish Society of Medical Radiology) from 2004 and 2009, which offers standardized estimated durations for a wide variety of radiological procedures. - Anonymized hospital scheduling records extracted from the Hospital Information System (HIS), provided by the Aragonese Institute of Health Sciences (IACS). The resulting dataset is a merged and cleaned version of the SERAM standards and the IACS hospital logs. The original SERAM data was extracted from PDFs and converted to structured format (Excel), followed by a meticulous cleaning process. However, it is important to clarify several aspects regarding data quality and interpretation: The so-called “actual duration” (duracion_real) values do not reflect the real execution time of radiological procedures. Instead, they refer to agenda slot durations allocated in the hospital's scheduling system. In other words, they represent planned time blocks, not the actual time a patient enters or exits the radiology room. To obtain real-time clinical execution data, it would be necessary to access the Radiology Information System (RIS), which logs precise timestamps of image acquisition, patient preparation, and post-procedure cleaning. Such access was not available. Therefore, these timing values may be overestimated or generalized, and do not account for operational variability inherent to clinical settings. Despite these limitations, the dataset remains valuable for several use cases: - As a foundation for synthetic data generation. - For workflow modeling, bottleneck analysis, or resource planning studies, especially when complemented with assumptions or artificial enhancements. - As a baseline structure for healthcare simulation frameworks. Its utility lies primarily in simulation, modeling, and educational applications for data analysis, rather than precise clinical auditing. So even though this dataset does not fit perfectly fine-grained clinical efficiency analyses, it is a representation and a structured approximation from radiological studies in the real world. The dataset contains 103.204 rows from different radiology deparments. It is fully anonymized and contains no personal or sensitive information. All fields, including column names are in spanish languaje but a translation and explanation of all fields is given. Field metadata descriptions is given below in the uploaded file "metadata_description.txt"
Files
Institutions
Categories
Funding
Spanish State Research Agency
PID2020-113037RB-I00