Please use this identifier to cite or link to this item: https://elib.vku.udn.vn/handle/123456789/6216
Title: Exploring Dynamic Parameters for Vietnamese Gender-Independent ASR
Authors: Sotheara, Leang
Eric, Castelli
Dominique, Vaufreydaz
Sethserey, Sam
Keywords: Speech dynamics
Acoustic gesture
Gender-independent automatic speech recognition
Tonal and low-resource language
Issue Date: Jan-2026
Publisher: Springer Nature
Abstract: The dynamic characteristics of speech signal provide temporal information and play an important role in enhancing Automatic Speech Recognition (ASR). In this work, we characterized the acoustic transitions in a ratio plane of Spectral Subband Centroid Frequencies (SSCFs) using polar parameters to capture the dynamic characteristics of the speech and minimize spectral variation. These dynamic parameters were combined with Mel-Frequency Cepstral Coefficients (MFCCs) in Vietnamese ASR to capture more detailed spectral information. The SSCF0 was used as a pseudo-feature for the fundamental frequency (F0) to describe the tonal information robustly. The findings showed that the proposed parameters significantly reduce word error rates and exhibit greater gender independence than the baseline MFCCs.
Description: Lecture Notes in Networks and Systems (LNNS,volume 1581); The 14th Conference on Information Technology and Its Applications (CITA 2025) ; pp: 247-258
URI: https://doi.org/10.1007/978-3-032-00972-2_19
https://elib.vku.udn.vn/handle/123456789/6216
ISBN: 978-3-032-00971-5 (p)
978-3-032-00972-2 (e)
Appears in Collections:CITA 2025 (International)

Files in This Item:

 Sign in to read



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.