H-Voice: Fake voice histograms (Imitation+DeepVoice)

Published: 31 Jan 2020 | Version 4 | DOI: 10.17632/k47yd3m28w.4
Contributor(s):

Description of this data

This data set consists of (6672) histograms of original voice recordings and fake voice recordings obtained by Imitation [1, 2] and Deep Voice [3]. The histograms provided in this dataset can be used to train a machine learning system to classify original and fake voice recordings obtained with the imitation and Deep Voice algorithms.

Each directory has the following composition:
-- corrupted images have been fixed --
Training_fake: 2088 histograms of fake voice recordings (2016 with Imitation and with 72 Deep Voice)
Training_original: 2020 histograms of original voice recordings
Validation_fake: 864 histograms of fake voice recordings (all with Imitation)
Validation_original: 864 histograms of original voice recordings
External_test1: 760 histograms (380 original + 380 fake with Imitation)
External_test2: 76 histograms (4 original + 72 fake with Deep Voice)

References:
[1] DM Ballesteros L, JM Moreno A. Highly transparent steganography model of speech signals using Efficient Wavelet Masking. Expert Systems with Applications 39 (10), 2012, 9141-9149, https://doi.org/10.1016/j.eswa.2012.02.066
[2] DM Ballesteros L, JM Moreno A. On the ability of adaptation of speech signals and data hiding, Expert Systems with Applications 39 (16), 2012, 12574-12579, https://doi.org/10.1016/j.eswa.2012.05.027
[3] S.O. Arik, M. Chrzanowski, A. Coates, G. Diamos, A. Gibiansky, Y. Kang, X. Li, J. Miller, A. Ng, J. Raiman, S. Sengupta, M. Shoeybi. Deep Voice: Real-time Neural Text-to-Speech. 2017. https://arxiv.org/abs/1702.07825

Experiment data files

  • External_test1
    Cite
  • External_test2
    Cite
  • Training_fake
    Cite
  • Training_original
    Cite
  • Validation_fake
    Cite
  • Validation_original
    Cite

Steps to reproduce

The histograms provided in this dataset can be used to train an machine learning system to classify original and fake voice recordings obtained with the Imitation and Deep Voice algorithms. A detailed description of this dasaset has been submitted to the journal Data in Brief, with the title "A dataset of histograms of original and fake voice recordings (H-Voice)".

Related links

Latest version

  • Version 4

    2020-01-31

    Published: 2020-01-31

    DOI: 10.17632/k47yd3m28w.4

    Cite this dataset

    Ballesteros L, Dora Maria; Rodriguez, Yohanna Patricia; Renza, Diego (2020), “H-Voice: Fake voice histograms (Imitation+DeepVoice)”, Mendeley Data, v4 http://dx.doi.org/10.17632/k47yd3m28w.4

Statistics

Views: 8362
Downloads: 7241

Previous versions

Compare to version

Institutions

Universidad Militar Nueva Granada

Categories

Computer Vision, Speech Processing, Machine Learning

Licence

CC BY 4.0 Learn more

The files associated with this dataset are licensed under a Creative Commons Attribution 4.0 International licence.

What does this mean?
You can share, copy and modify this dataset so long as you give appropriate credit, provide a link to the CC BY license, and indicate if changes were made, but you may not do so in a way that suggests the rights holder has endorsed you or your use of the dataset. Note that further permission may be required for any content within the dataset that is identified as belonging to a third party.

Report