MFCCs Feature Scaling Images for Multi-class Human Action Analysis : A Benchmark Dataset

Name: MFCCs Feature Scaling Images for Multi-class Human Action Analysis : A Benchmark Dataset
Creator: Muhammad Bilal Shaikh
Published: 2023-07-25T22:00:36.303Z
Keywords: Computer Vision Representation, Benchmarking, Multimodality, Action Recognition

Shaikh, Muhammad Bilal; Chai, Douglas; Islam, Syed Mohammed Shamsul; Akhtar, Naveed

doi:10.17632/6d8v9jmvgm.1

MFCCs Feature Scaling Images for Multi-class Human Action Analysis : A Benchmark Dataset

Published: 25 July 2023| Version 1 | DOI: 10.17632/6d8v9jmvgm.1

Contributors:

,

Description

his dataset comprises an array of Mel Frequency Cepstral Coefficients (MFCCs) that have undergone feature scaling, representing a variety of human actions. Feature scaling, or data normalization, is a preprocessing technique used to standardize the range of features in the dataset. For MFCCs, this process helps ensure all coefficients contribute equally to the learning process, preventing features with larger scales from overshadowing those with smaller scales. In this dataset, the audio signals correspond to diverse human actions such as walking, running, jumping, and dancing. The MFCCs are calculated via a series of signal processing stages, which capture key characteristics of the audio signal in a manner that closely aligns with human auditory perception. The coefficients are then standardized or scaled using methods such as MinMax Scaling or Standardization, thereby normalizing their range. Each normalized MFCC vector corresponds to a segment of the audio signal. The dataset is meticulously designed for tasks including human action recognition, classification, segmentation, and detection based on auditory cues. It serves as an essential resource for training and evaluating machine learning models focused on interpreting human actions from audio signals. This dataset proves particularly beneficial for researchers and practitioners in fields such as signal processing, computer vision, and machine learning, who aim to craft algorithms for human action analysis leveraging audio signals.

Files

Steps to reproduce

https://github.com/mbilalshaikh/pymaivar/blob/main/README.md

Institutions

Edith Cowan University, University of Western Australia

Funding

Higher Education Commission

PM/HRDI-UESTPs/UETs-I/Phase-1/Batch-VI/2018

Office of National Intelligence, Government of Australia

NIPG-2021–001

MFCCs Feature Scaling Images for Multi-class Human Action Analysis : A Benchmark Dataset

Description

Files

Steps to reproduce

Institutions

Categories

Funding

Related Links

Licence