Dataset: The Death Spiral of Open Source Projects: A Post-Mortem Analysis of Pull Request Workflow Dynamics
Description
This dataset serves as the replication package for the research article "The Death Spiral of Open Source Projects: A Post-Mortem Analysis of Pull Request Workflow Dynamics" (Accepted for publication in the Journal of Systems and Software). It contains a large-scale, mined dataset of nearly 4 million human-driven pull requests (PRs) and over 6.3 million comments across 3,472 GitHub repositories. The data is structured to facilitate post-mortem analysis of Open Source Software (OSS) project mortality. It contrasts the micro-level workflow dynamics of an Inactive Cohort (abandoned projects) against a 1:1 structurally matched Active Cohort ( successful projects).
Files
Steps to reproduce
All the required steps are mentioned in the manuscript. Kindly cite the paper available at: https://doi.org/10.1016/j.jss.2026.112942, if using dataset in your study.
Institutions
- Guru Nanak Dev UniversityPunjab, Amritsar