Air Pollution Dataset from Queen Elizabeth Olympic Park Development (2013, 2016, 2019)
Description
This dataset was compiled for the study “Public Infrastructure Impact on Air Quality: Empirical Evidence from Queen Elizabeth Olympic Park Development.” It includes annual concentrations of NO₂, NOₓ, PM₁₀, and PM₂.₅ for the years 2013, 2016, and 2019, measured in micrograms per cubic metre (μg/m³) at a high spatial resolution of 20 metres. Data were sourced from the Greater London Authority. Supplementary variables include average annual rainfall index (Met Office), annual traffic volume by road segment (GLA), and population density by ward (GLA). The dataset was used to estimate the causal effects of infrastructure development on air quality using a Regression Discontinuity Design (RDD).
Files
Steps to reproduce
To reproduce the results, load the pollution-specific dataset (e.g. NO2_complete.Rdata) in R and follow the included script. The analysis includes: (1) covariate balance tests via Welch’s t-test, (2) McCrary density test for continuity of the running variable, (3) sharp Regression Discontinuity Design (RDD) estimation using felm(), (4) multicollinearity diagnostics using VIF, (5) placebo tests with shifted cutoff points, and (6) polynomial sensitivity analysis. Output tables and visualizations are generated using stargazer, ggplot2, and rdplotdensity. The same code structure is used for NO₂, NOₓ, PM₁₀, and PM₂.₅ datasets, which are provided in separate scripts.
Institutions
Categories
Funding
Ministry of Finance of the Republic of Indonesia