KneeAI Reproducibility Package for Uncertainty-Aware Knee Osteoarthritis Severity Assessment

Published: 22 June 2026| Version 4 | DOI: 10.17632/cgjjbw8hsf.4
Contributors:
Kevin Alejandro Yepez Chafla, Emmily Villacreses Martínez

Description

This repository provides the reproducibility package for the study “Uncertainty-Aware Clinical Decision Support for Knee Osteoarthritis Severity Assessment from Radiographs Using a Hybrid KL 5-to-3 Strategy.” It includes manifests, split-integrity audits, notebooks, figures, tables, statistical outputs, and documentation for an uncertainty-aware knee osteoarthritis (KOA) severity-assessment research prototype. The analyzed data were derived from the public Kaggle Knee Osteoarthritis Dataset with Severity Grading. The final manifest contains 8,260 radiographs from 4,130 corrected identifiers, split at the corrected-identifier/laterality-pair level into 5,282 training, 1,322 validation, and 1,656 independent test images, with no corrected-identifier overlap. This should not be interpreted as definitive patient-level partitioning because curated clinical patient identifiers and metadata were not available in the public mirror. The model uses EfficientNetB3 with internal KL 0–4 supervision and post-hoc KL 5-to-3 aggregation: KL-0/1 = Non-OA, KL-2/3 = Mild–Moderate OA, and KL-4 = Severe OA. The software is a research prototype only, not a clinically validated diagnostic device.

Files

Steps to reproduce

1. Load the dataset and ensure patient-wise separation between training, validation, and test sets. 2. Resize all images to 300×300 pixels. 3. Apply normalization using EfficientNet preprocessing (ImageNet standardization). 4. Use data augmentation (rotation, zoom, shifts) during training. 5. Train an EfficientNetB3 model using transfer learning. 6. Apply staged training: initial warm-up (frozen backbone) followed by fine-tuning. 7. Use Bayesian optimization (Optuna) to tune learning rate, dropout, and L2 regularization. 8. Evaluate both 5-class and 3-class classification schemes. 9. Validate using accuracy, F1-score, AUC, and confusion matrix.

Institutions

Categories

Radiology, Artificial Intelligence, Biomedical Engineering, Medical Imaging, Machine Learning

Licence