Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này:
https://elib.vku.udn.vn/handle/123456789/4287
Nhan đề: | Data Augmentation Methods for Cross-Device Acoustic Scene Classification |
Tác giả: | Dang, An Vu, Toan |
Từ khoá: | TAU Urban Acoustic Scene 2020 Mobile dataset, featuring audio scenes recorded by multiple devices Neural network (DNN) methods have improved the accuracy of acoustic scene classification (ASC) |
Năm xuất bản: | thá-2024 |
Nhà xuất bản: | Springer Nature |
Tóm tắt: | Recent advances in deep neural network (DNN) methods have improved the accuracy of acoustic scene classification (ASC). However, these DNN systems have struggled to classify audio scenes across domains, and when faced with domain imbalance in ASC datasets. In this study, we propose an ASC system that addresses these issues using two data augmentation methods. The first method, MixStyleFreq, reduces device mismatch problems by combining the frequency-wise means and standard deviations of convolutional feature maps from different audio scenes. The second method, Spectrum Normalization Augmentation (SpecNormAug), generates additional data for minority devices based on majority devices, improving the representation of minority devices and reducing bias in DNNs toward dominant devices. Our model is built on the efficient MobileNetV2 network, suitable for ASC applications on devices with limited computational capacity. We evaluate our methods on the TAU Urban Acoustic Scene 2020 Mobile dataset, featuring audio scenes recorded by multiple devices. Our approaches significantly improve generalization performance for ASC tasks compared to other data augmentation methods and achieve competitive results compared to state-of-the-art methods. |
Mô tả: | Lecture Notes in Networks and Systems (LNNS,volume 882); The 13th Conference on Information Technology and Its Applications (CITA 2024) ; pp: 283-294. |
Định danh: | https://elib.vku.udn.vn/handle/123456789/4287 https://doi.org/10.1007/978-3-031-74127-2_24 |
ISBN: | 978-3-031-74126-5 |
Bộ sưu tập: | CITA 2024 (International) |
Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.