Single and Multi-Label Environmental Sound Classification Using Convolutional Neural Networks

dc.contributor.authorAlvarez-Buylla Puente, Santiago
dc.contributor.departmentChalmers tekniska högskola / Institutionen för arkitektur och samhällsbyggnadstekniksv
dc.contributor.departmentChalmers University of Technology / Department of Architecture and Civil Engineeringen
dc.date.accessioned2019-07-03T14:49:22Z
dc.date.available2019-07-03T14:49:22Z
dc.date.issued2018
dc.description.abstractArtificial neural networks are computational systems made up of simple processing units that have a natural propensity for storing experiential knowledge and making it available for use. In the recent years this technology has seen an exponential growth in the fields of image recognition, natural language processing or speech recognition. However, there is a dearth of research on environmental sound analysis. In combination with IoT and wireless sensor networks, artificial neural networks could help to characterize and therefore better address noise issues present in urban environments. This master thesis investigates the theory and construction of artificial neural net-works for single-label and multi-label multiclass classification of environmental sounds like dog bark, street music or jackhammer. Evaluation to di˙erent cor-ruptions of the sounds are studied, as well as methods to increase robustness to these variations. A convolutional neural network arquitecture is proposed for both tasks. The in-puts to the networks are time-frequency patches extracted from the computed mel-spectrogram of the signals. Dropout and weight decay regularization methods are applied and the cross-entropy loss is optimized using Adam algorithm. Results show that these systems are very sensitive to noise and level corruptions of the inputs. Techniques like data augmentation and amplitude scaling are needed to avoid these issues. Results to the multi-label classification task show that it is still possible for a neural network to learn in a complicated mixed environment. However there is still room for improvement regarding prediction accuracy. Since no previous benchmarks are available for comparison, this study sets the stage for the multi-label classification task using UrbanSound8K dataset.
dc.identifier.urihttps://hdl.handle.net/20.500.12380/255604
dc.language.isoeng
dc.setspec.uppsokTechnology
dc.subjectAkustik
dc.subjectBuilding Futures
dc.subjectAcoustics
dc.subjectBuilding Futures
dc.titleSingle and Multi-Label Environmental Sound Classification Using Convolutional Neural Networks
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster Thesisen
dc.type.uppsokH
local.programmeSound and vibration (MPSOV), MSc
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
255604.pdf
Storlek:
4.26 MB
Format:
Adobe Portable Document Format
Beskrivning:
Fulltext