Bat Echolocation Call Analysis with Deep Learning Models

Name: Bat Echolocation Call Analysis with Deep Learning Models
Creator: Sercan Alipek
Published: 2024-06-26T14:59:40.599Z
Keywords: Animal Acoustics, Supervised Learning, Unsupervised Learning, Clustering, Biological Classification, Data Analysis, Image Classification, Classification System, Bioacoustics, Sound Detection Algorithm, Echolocation Behavior, Data Analysis in Ecology, Convolutional Neural Network, Deep Learning, Pattern Recognition Clustering Process, Autoencoder, Cluster Analysis

Alipek, Sercan; Maelzer, Moritz; Moll, Jochen

doi:10.17632/9x2g6dsbtv.2

Bat Echolocation Call Analysis with Deep Learning Models

Published: 26 June 2024| Version 2 | DOI: 10.17632/9x2g6dsbtv.2

Contributors:

,

Description

This code project contains a multitude of scripts often required for understanding bat echolocation call data. Basic scripts in the data preparation are provided for: - cleaning audio data - processing audio data into linear spectrograms or Mel-Frequency-Cepstral-Coefficients (MFCC) - creating unique IDs for each pair of image and corresponding label The optimization core contains the following functionalities: - basic but highly customizable training and validation functions for neural network models - performance metrics to evaluate models for various tasks, binary and multi-class, respectively - logger classes to track various values and parameters The analysis part is divided into an unsupervised learning approach and a supervised learning approach. First, with the help of unsupervised methods like autoencoders and clustering algorithms like UMAP, the datasets can be analysed for general geometric properties. The convolutional autoencoder is used to learn an efficient representation of the data and feed the UMAP clustering algorithm with highly compressed latent feature vectors for faster convergence. The clustering can reveal similarities between specific classes then and guide the further analysis with supervised learning methods by adjusting the composition of classes and dataset sources before deciding to train a supervised neural network model. Dataset sources encompass meta data like year/time of measurement and location/position of measurement devices. The used dataset is confidential and this code project is hand-tailored to this dataset. Nevertheless, all concepts used in this project are applicable to most animal call datasets that provide labeled audio data. Thus, this code project can be a helpful template for future research in animal acoustic analysis and specifically for bat echolocation call data.

Files

Steps to reproduce

Readme file is provided. See "README.md" in project folder for more information.

Institutions

Goethe-Universitat Frankfurt am Main Campus Riedberg, Universitat Siegen Department Maschinenbau, Goethe-Universitat Frankfurt am Main

Bat Echolocation Call Analysis with Deep Learning Models

Description

Files

Steps to reproduce

Institutions

Categories

Licence