Please use this identifier to cite or link to this item:
https://elib.vku.udn.vn/handle/123456789/6216| Title: | Exploring Dynamic Parameters for Vietnamese Gender-Independent ASR |
| Authors: | Sotheara, Leang Eric, Castelli Dominique, Vaufreydaz Sethserey, Sam |
| Keywords: | Speech dynamics Acoustic gesture Gender-independent automatic speech recognition Tonal and low-resource language |
| Issue Date: | Jan-2026 |
| Publisher: | Springer Nature |
| Abstract: | The dynamic characteristics of speech signal provide temporal information and play an important role in enhancing Automatic Speech Recognition (ASR). In this work, we characterized the acoustic transitions in a ratio plane of Spectral Subband Centroid Frequencies (SSCFs) using polar parameters to capture the dynamic characteristics of the speech and minimize spectral variation. These dynamic parameters were combined with Mel-Frequency Cepstral Coefficients (MFCCs) in Vietnamese ASR to capture more detailed spectral information. The SSCF0 was used as a pseudo-feature for the fundamental frequency (F0) to describe the tonal information robustly. The findings showed that the proposed parameters significantly reduce word error rates and exhibit greater gender independence than the baseline MFCCs. |
| Description: | Lecture Notes in Networks and Systems (LNNS,volume 1581); The 14th Conference on Information Technology and Its Applications (CITA 2025) ; pp: 247-258 |
| URI: | https://doi.org/10.1007/978-3-032-00972-2_19 https://elib.vku.udn.vn/handle/123456789/6216 |
| ISBN: | 978-3-032-00971-5 (p) 978-3-032-00972-2 (e) |
| Appears in Collections: | CITA 2025 (International) |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.