Please use this identifier to cite or link to this item: https://elib.vku.udn.vn/handle/123456789/4287
Full metadata record
DC FieldValueLanguage
dc.contributor.authorDang, An-
dc.contributor.authorVu, Toan-
dc.date.accessioned2024-12-06T06:59:30Z-
dc.date.available2024-12-06T06:59:30Z-
dc.date.issued2024-11-
dc.identifier.isbn978-3-031-74126-5-
dc.identifier.urihttps://elib.vku.udn.vn/handle/123456789/4287-
dc.identifier.urihttps://doi.org/10.1007/978-3-031-74127-2_24-
dc.descriptionLecture Notes in Networks and Systems (LNNS,volume 882); The 13th Conference on Information Technology and Its Applications (CITA 2024) ; pp: 283-294.vi_VN
dc.description.abstractRecent advances in deep neural network (DNN) methods have improved the accuracy of acoustic scene classification (ASC). However, these DNN systems have struggled to classify audio scenes across domains, and when faced with domain imbalance in ASC datasets. In this study, we propose an ASC system that addresses these issues using two data augmentation methods. The first method, MixStyleFreq, reduces device mismatch problems by combining the frequency-wise means and standard deviations of convolutional feature maps from different audio scenes. The second method, Spectrum Normalization Augmentation (SpecNormAug), generates additional data for minority devices based on majority devices, improving the representation of minority devices and reducing bias in DNNs toward dominant devices. Our model is built on the efficient MobileNetV2 network, suitable for ASC applications on devices with limited computational capacity. We evaluate our methods on the TAU Urban Acoustic Scene 2020 Mobile dataset, featuring audio scenes recorded by multiple devices. Our approaches significantly improve generalization performance for ASC tasks compared to other data augmentation methods and achieve competitive results compared to state-of-the-art methods.vi_VN
dc.language.isoenvi_VN
dc.publisherSpringer Naturevi_VN
dc.subjectTAU Urban Acoustic Scene 2020 Mobile dataset, featuring audio scenes recorded by multiple devicesvi_VN
dc.subjectNeural network (DNN) methods have improved the accuracy of acoustic scene classification (ASC)vi_VN
dc.titleData Augmentation Methods for Cross-Device Acoustic Scene Classificationvi_VN
dc.typeWorking Papervi_VN
Appears in Collections:CITA 2024 (International)

Files in This Item:

 Sign in to read



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.