Large-Scale Curated Multivariate Time Series Anomaly Detection Dataset for Laptop Performance Metrics

Published: 7 July 2025| Version 1 | DOI: 10.17632/97jn6xrs84.1
Contributors:
Veena More,
,

Description

High-quality multivariate time-series datasets are significantly less accessible compared to more common data types such as images or text, due to the resource-intensive process of continuous monitoring, precise annotation, and long-term observation. This paper introduces a cost-effective solution in the form of a large-scale, curated dataset specifically designed for anomaly detection in computing systems’ performance metrics. The dataset encompasses 45 GB of multivariate time-series data collected from 66 systems, capturing key performance indicators such as CPU usage, memory consumption, disk I/O, system load, and power consumption across diverse hardware configurations and real-world usage scenarios. Annotated anomalies, including performance degradation and resource inefficiencies, provide a reliable benchmark and ground truth for evaluating anomaly detection models. By addressing the accessibility challenges associated with time-series data, this resource facilitates advancements in machine learning applications, including anomaly detection, predictive maintenance, and system optimisation. Its comprehensive and practical design makes it a foundational asset for researchers and practitioners dedicated to developing reliable and efficient computing systems.

Files

Institutions

  • Akkamahadevi Women's University

Categories

Time Series Analysis, Time Series Prediction, Multivariate Analysis, Time Series, Time Series Modeling

Licence