Please use this identifier to cite or link to this item: https://elib.vku.udn.vn/handle/123456789/4287
Title: Data Augmentation Methods for Cross-Device Acoustic Scene Classification
Authors: Dang, An
Vu, Toan
Keywords: TAU Urban Acoustic Scene 2020 Mobile dataset, featuring audio scenes recorded by multiple devices
Neural network (DNN) methods have improved the accuracy of acoustic scene classification (ASC)
Issue Date: Nov-2024
Publisher: Springer Nature
Abstract: Recent advances in deep neural network (DNN) methods have improved the accuracy of acoustic scene classification (ASC). However, these DNN systems have struggled to classify audio scenes across domains, and when faced with domain imbalance in ASC datasets. In this study, we propose an ASC system that addresses these issues using two data augmentation methods. The first method, MixStyleFreq, reduces device mismatch problems by combining the frequency-wise means and standard deviations of convolutional feature maps from different audio scenes. The second method, Spectrum Normalization Augmentation (SpecNormAug), generates additional data for minority devices based on majority devices, improving the representation of minority devices and reducing bias in DNNs toward dominant devices. Our model is built on the efficient MobileNetV2 network, suitable for ASC applications on devices with limited computational capacity. We evaluate our methods on the TAU Urban Acoustic Scene 2020 Mobile dataset, featuring audio scenes recorded by multiple devices. Our approaches significantly improve generalization performance for ASC tasks compared to other data augmentation methods and achieve competitive results compared to state-of-the-art methods.
Description: Lecture Notes in Networks and Systems (LNNS,volume 882); The 13th Conference on Information Technology and Its Applications (CITA 2024) ; pp: 283-294.
URI: https://elib.vku.udn.vn/handle/123456789/4287
https://doi.org/10.1007/978-3-031-74127-2_24
ISBN: 978-3-031-74126-5
Appears in Collections:CITA 2024 (International)

Files in This Item:

 Sign in to read



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.