Please use this identifier to cite or link to this item:
https://elib.vku.udn.vn/handle/123456789/6216| Nhan đề: | Exploring Dynamic Parameters for Vietnamese Gender-Independent ASR |
| Tác giả: | Sotheara, Leang Eric, Castelli Dominique, Vaufreydaz Sethserey, Sam |
| Từ khoá: | Speech dynamics Acoustic gesture Gender-independent automatic speech recognition Tonal and low-resource language |
| Năm xuất bản: | thá-2026 |
| Nhà xuất bản: | Springer Nature |
| Tóm tắt: | The dynamic characteristics of speech signal provide temporal information and play an important role in enhancing Automatic Speech Recognition (ASR). In this work, we characterized the acoustic transitions in a ratio plane of Spectral Subband Centroid Frequencies (SSCFs) using polar parameters to capture the dynamic characteristics of the speech and minimize spectral variation. These dynamic parameters were combined with Mel-Frequency Cepstral Coefficients (MFCCs) in Vietnamese ASR to capture more detailed spectral information. The SSCF0 was used as a pseudo-feature for the fundamental frequency (F0) to describe the tonal information robustly. The findings showed that the proposed parameters significantly reduce word error rates and exhibit greater gender independence than the baseline MFCCs. |
| Mô tả: | Lecture Notes in Networks and Systems (LNNS,volume 1581); The 14th Conference on Information Technology and Its Applications (CITA 2025) ; pp: 247-258 |
| Định danh: | https://doi.org/10.1007/978-3-032-00972-2_19 https://elib.vku.udn.vn/handle/123456789/6216 |
| ISBN: | 978-3-032-00971-5 (p) 978-3-032-00972-2 (e) |
| Bộ sưu tập: | CITA 2025 (International) |
Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.