A Dataset for Voice-Based Human Identity Recognition

Published: 31 March 2022| Version 2 | DOI: 10.17632/zw4p4p7sdh.2
Contributor:
Baha' Alsaify

Description

This dataset is divided into two main sub-datasets: samePhrase and differentPhrase. Each speaker has the same label in both sub-datasets. In the samePhrase sub-dataset, a speaker repeats the sentence “Machine Learning 1, 2, 3, 4, 5, 6, 7, 8, 9, 10” ten times. The length of each sample is between seven and ten seconds. For the differentPhrase sub-dataset, each speaker contributed with a phrase selected randomly from different resources such as books, songs lyrics, orone-line texts. Each speaker contributed with ten different samples, the length of each sample inthe differentPhrase sub-dataset does not exceed ten seconds

Files

Institutions

Jordan University of Science and Technology

Categories

Machine Learning, Human Identification, Human Voice

License