A Comprehensive Central Kurdish Sound Dataset for Robust Speech-to-Text Transformation

Published: 19 April 2024| Version 3 | DOI: 10.17632/gft65z43hs.3
Contributors:
,
,
,
,

Description

Exploring the intricacies of Speech Recognition Technology (SRT), our dataset encompasses a wide range of age demographics, spanning from adolescents to individuals in their fifties. This diverse dataset comprises a substantial collection of raw data, amounting to 1,739,089 entries. Within this dataset, a meticulous curation process has yielded a total of 1,683 hours of data, providing a thorough examination of language acquisition patterns across different age cohorts within the Central Kurdish linguistic domain.

Files

Institutions

University of Halabja

Categories

Speaker Recognition, Kurd, Deep Learning, Speech Synthesis

Licence