Audio classification using braided convolutional neural networks

Ajmera, Pawan K.

Please use this identifier to cite or link to this item: http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/9687

Title:	Audio classification using braided convolutional neural networks
Authors:	Ajmera, Pawan K.
Keywords:	EEE Convolutional neural networks (CNNs) GSCv1 Neural networks
Issue Date:	Sep-2020
Publisher:	IET
Abstract:	Convolutional neural networks (CNNs) work surprisingly well and have helped drastically enhance the state-of-the-art techniques in the domain of image classification. The unprecedented success motivated the application of CNNs to the domain of auditory data. Recent publications suggest hidden Markov models and deep neural networks for audio classification. This study aims to achieve audio classification by representing audio as spectrogram images and then use a CNN-based architecture for classification. This study presents an innovative strategy for a CNN-based neural architecture that learns a sparse representation imitating the receptive neurons in the primary auditory cortex in mammals. The feasibility of the proposed CNN-based neural architecture is assessed for audio classification tasks on standard benchmark datasets such as Google Speech Commands datasets (GSCv1 and GSCv2) and the UrbanSound8K dataset (US8K). The proposed CNN architecture, referred to as braided convolutional neural network, achieves 97.15, 95 and 91.9% average recognition accuracy on GSCv1, GSCv2 and US8 K datasets, respectively, outperforming other deep learning architectures.
URI:	https://ietresearch.onlinelibrary.wiley.com/doi/full/10.1049/iet-spr.2019.0381 http://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/9687
Appears in Collections:	Department of Electrical and Electronics Engineering

Files in This Item:

There are no files associated with this item.

Show full item record