A dataset for fault detection and diagnosis of an air handling unit from a real industrial facility

Published: 10 March 2023| Version 2 | DOI: 10.17632/8x62ntvrg7.2
Michael Ahern,
Dominic O' Sullivan,
Ken Bruton


This dataset was collected for the purpose of applying fault detection and diagnosis (FDD) techniques to real data from an industrial facility. The data for an air handling unit (AHU) is extracted from a building management system (BMS) and aligned with the Project Haystack naming convention. This dataset differs from other publicly available datasets in three main ways. Firstly, the dataset does not contain fault detection ground truth. The lack of labelled datasets in the industrial setting is a significant limitation to the application of FDD techniques found in the literature. Secondly, the granularity of this dataset is much less than other publicly available datasets. Due to data storage constraints, data is recorded at 15-min intervals rather than at 1-min or 5-min intervals. Thirdly, the dataset contains a myriad of data issues. For example, there are missing features, missing time intervals, and inaccurate data. Therefore, we hope this dataset will encourage the development of robust FDD techniques that are more suitable for real world applications.



Science Foundation Ireland, University College Cork National University of Ireland University College Cork Boole Library


Heating Ventilation Air Conditioning Control System, Fault


Science Foundation Ireland