Audio and Visual Perception Dataset

Published: 16 December 2019| Version 2 | DOI: 10.17632/3hrb6gsgyn.2
Contributors:
Deepank Verma,
,

Description

This dataset consists of 150 Audio (10-sec duration) and 200 Image clips collected from local streets from a study area in the city of Mumbai, India. The photographs were clicked with the help of point and shoot camera while audio clips are collected with the help of Zoom H4N Pro recorder and Roland CS-10 EM binaural microphones. The audio and image clips are collected in the mixed urbanscapes, which include scenes of high and mid-rise buildings, slum-like dense settlements, open and green streets, street markets, and empty streets. Perception survey is conducted with the help of these clips, in which 73 (male:40, female:33, mean age = 32.11, std-dev = 6.12) respondents are sent folders containing 24 images and 18 audio clips each randomly sampled from the larger dataset. The respondents are a group of Urban planning practitioners and researchers employed in various cities in India. Each respondent viewed and heard each image and audio clip using a PC monitor and headphones and provided the ratings for each of the attributes as a Likert scale from 1 (not at all) to 7 (very much).

Files

Institutions

Indian Institute of Technology Bombay

Categories

Urban Planning, Image Acquisition, Perception, Urban Design, Soundscapes, Urban Analysis

Licence