Datasets and R Markdown files for the article "Survey on critical results management in Brazilian clinical laboratories: Profiling practices through multivariate analysis, prioritization, and a 'New Statistics' approach" submitted to Clinica Chimica Acta
Description
This repository contains supplementary materials related to the study "๐๐ฎ๐ซ๐ฏ๐๐ฒ ๐จ๐ง ๐๐ซ๐ข๐ญ๐ข๐๐๐ฅ ๐ซ๐๐ฌ๐ฎ๐ฅ๐ญ๐ฌ ๐ฆ๐๐ง๐๐ ๐๐ฆ๐๐ง๐ญ ๐ข๐ง ๐๐ซ๐๐ณ๐ข๐ฅ๐ข๐๐ง ๐๐ฅ๐ข๐ง๐ข๐๐๐ฅ ๐ฅ๐๐๐จ๐ซ๐๐ญ๐จ๐ซ๐ข๐๐ฌ: ๐๐ซ๐จ๐๐ข๐ฅ๐ข๐ง๐ ๐ฉ๐ซ๐๐๐ญ๐ข๐๐๐ฌ ๐ญ๐ก๐ซ๐จ๐ฎ๐ ๐ก ๐ฆ๐ฎ๐ฅ๐ญ๐ข๐ฏ๐๐ซ๐ข๐๐ญ๐ ๐๐ง๐๐ฅ๐ฒ๐ฌ๐ข๐ฌ, ๐ฉ๐ซ๐ข๐จ๐ซ๐ข๐ญ๐ข๐ณ๐๐ญ๐ข๐จ๐ง, ๐๐ง๐ ๐ '๐๐๐ฐ ๐๐ญ๐๐ญ๐ข๐ฌ๐ญ๐ข๐๐ฌ' ๐๐ฉ๐ฉ๐ซ๐จ๐๐๐ก". The dataset, figures, exported results, and analysis scripts are included to ensure full transparency and reproducibility of the research findings. ๐ ๐จ๐ฅ๐๐๐ซ ๐๐ญ๐ซ๐ฎ๐๐ญ๐ฎ๐ซ๐ 1_๐๐๐ญ๐๐ฌ๐๐ญ/ This folder contains the dataset used in the study, formatted for direct use in the Feature Priorizer R Markdown script. 2_๐ ๐ข๐ ๐ฎ๐ซ๐๐ฌ/ All figures generated by the Feature Priorizer are stored here in 600 DPI resolution, ensuring high-quality graphics for publication and analysis. 3_๐๐ฑ๐ฉ๐จ๐ซ๐ญ๐๐/ This folder contains the exported results, including statistical outputs, tables, and processed datasets derived from the analyses. 4_๐๐ฎ๐ฉ๐ฉ๐ฅ๐๐ฆ๐๐ง๐ญ๐๐ซ๐ฒ_๐ ๐ข๐ฅ๐๐ฌ/ This folder contains auxiliary files used in generating the Feature Priorizer HTML report, ensuring an enhanced visual presentation and incorporating dynamic statistical quotes. โ ๐ฌ๐ญ๐ฒ๐ฅ๐๐ฌ.๐๐ฌ๐ฌ: Defines the formatting of the HTML report, ensuring a consistent visual presentation. logo.html, logo.png, logo.txt โ Files related to the project's visual identity. โ ๐๐๐ข๐๐ง๐๐_๐๐ญ๐๐ญ๐ฌ_๐๐๐๐ฅ๐๐๐ญ๐ข๐จ๐ง๐ฌ.๐ฃ๐ฉ๐๐ : An image displayed in the HTML report, complementing the section on statistical and scientific reflections. โ ๐ฌ๐ญ๐๐ญ๐ช๐ฎ๐จ๐ญ๐_๐๐ฒ๐๐ฅ๐_๐ฌ๐ญ๐๐ญ๐.๐ซ๐๐ฌ: An RDS file that stores the state of the statistical quotes cycle. This file is dynamically updated to prevent repetitions, ensuring that the quotes presented in the report change with each execution. 5_๐ ๐๐๐ญ๐ฎ๐ซ๐ ๐๐ซ๐ข๐จ๐ซ๐ข๐ณ๐๐ซ โ ๐ ๐๐๐ซ๐ค๐๐จ๐ฐ๐ง ๐๐๐ซ๐ข๐ฉ๐ญ The "Feature Priorizer" is an R Markdown-based analytical pipeline (Script_Feature_Prioritizer.Rmd) developed to perform the full multivariate analysis workflow presented in the study. The script integrates: A) Dimensionality reduction (Logistic PCA) B) Unsupervised clustering (K-Means) C) Feature prioritization using the Nihans Index and Pareto Analysis D) Statistical and practical significance assessment (Chi-square test, Cohen's h) E) Automated report generation in HTML format, including figures and tables 6_๐ ๐ข๐ฅ๐๐ฌ ๐๐๐ฅ๐๐ญ๐๐ ๐ญ๐จ ๐ญ๐ก๐ ๐ ๐๐๐ญ๐ฎ๐ซ๐ ๐๐ซ๐ข๐จ๐ซ๐ข๐ณ๐๐ซ โ ๐๐๐ซ๐ข๐ฉ๐ญ_๐ ๐๐๐ญ๐ฎ๐ซ๐_๐๐ซ๐ข๐จ๐ซ๐ข๐ญ๐ข๐ณ๐๐ซ.๐๐ฆ๐: The R Markdown script that executes the entire analytical pipeline โ ๐๐๐ซ๐ข๐ฉ๐ญ_๐ ๐๐๐ญ๐ฎ๐ซ๐_๐๐ซ๐ข๐จ๐ซ๐ข๐ญ๐ข๐ณ๐๐ซ.๐ก๐ญ๐ฆ๐ฅ: The automatically generated HTML report containing all results, figures, and statistical summaries โ ๐๐ง๐ฌ๐ญ๐๐ฅ๐ฅ_๐ฉ๐๐๐ค๐๐ ๐๐ฌ.๐๐ฆ๐: A helper script that installs all necessary R packages for running the Feature Priorizer
Files
Steps to reproduce
To reproduce this study using the "Feature Priorizer" R Markdown tool, researchers should follow these steps: ๐) ๐๐๐ญ๐ ๐๐จ๐ฅ๐ฅ๐๐๐ญ๐ข๐จ๐ง: Use the questionnaire provided in the study to collect responses from laboratories. Ensure that responses are recorded consistently to facilitate processing. ๐) ๐๐๐ญ๐ ๐๐ซ๐๐ง๐ฌ๐๐จ๐ซ๐ฆ๐๐ญ๐ข๐จ๐ง & ๐ ๐๐๐ญ๐ฎ๐ซ๐ ๐๐ง๐ ๐ข๐ง๐๐๐ซ๐ข๐ง๐ (๐๐๐ญ๐ ๐ฉ๐ซ๐๐ฉ๐๐ซ๐๐ญ๐ข๐จ๐ง): Convert the collected responses into structured features. In this study, 60 features were created, but additional features may be defined depending on the analytical context. Each feature should be encoded as a binary variable (Yes/No format) following the methodology applied in this study. To enhance interpretability and standardization, we recommend naming each feature using Lexical Blendsโformed by merging parts of two or more wordsโfollowing the approach used in this study. This helps create intuitive and meaningful labels for each feature. ๐) ๐๐๐ญ๐๐ฌ๐๐ญ ๐ ๐จ๐ซ๐ฆ๐๐ญ๐ญ๐ข๐ง๐ & ๐๐ซ๐ ๐๐ง๐ข๐ณ๐๐ญ๐ข๐จ๐ง: Save the dataset as an Excel file (.xlsx format) and place it inside the "1_Dataset" folder. Then, define: ๐.๐) The file name of the XLSX dataset; ๐.๐) The worksheet name (spreadsheet tab) within the file. ๐) ๐๐ข๐ฉ๐๐ฅ๐ข๐ง๐ ๐๐จ๐ซ ๐๐ฎ๐ง๐ง๐ข๐ง๐ ๐ญ๐ก๐ "๐ ๐๐๐ญ๐ฎ๐ซ๐ ๐๐ซ๐ข๐จ๐ซ๐ข๐ณ๐๐ซ" ๐ ๐๐๐ซ๐ค๐๐จ๐ฐ๐ง ๐๐จ๐จ๐ฅ ๐.๐) ๐๐ฉ๐๐ง ๐ญ๐ก๐ ๐ ๐๐ซ๐จ๐ฃ๐๐๐ญ: Locate and open the Project_Critical_Results.Rproj file. This will launch RStudio with the correct working directory. ๐.๐) ๐๐ง๐ฌ๐ญ๐๐ฅ๐ฅ ๐๐๐ช๐ฎ๐ข๐ซ๐๐ ๐๐๐๐ค๐๐ ๐๐ฌ: Open Install_packages.Rmd in RStudio; Click on "Knit" to install all required R packages. ๐.๐) ๐๐ฑ๐๐๐ฎ๐ญ๐ ๐ญ๐ก๐ "๐ ๐๐๐ญ๐ฎ๐ซ๐ ๐๐ซ๐ข๐จ๐ซ๐ข๐ณ๐๐ซ" ๐๐๐ซ๐ข๐ฉ๐ญ: Open Script_Feature_Prioritizer.Rmd in RStudio; Click on "Knit" to execute the script and generate the output report.