Please use this identifier to cite or link to this item: https://elib.vku.udn.vn/handle/123456789/6216
Nhan đề: Exploring Dynamic Parameters for Vietnamese Gender-Independent ASR
Tác giả: Sotheara, Leang
Eric, Castelli
Dominique, Vaufreydaz
Sethserey, Sam
Từ khoá: Speech dynamics
Acoustic gesture
Gender-independent automatic speech recognition
Tonal and low-resource language
Năm xuất bản: thá-2026
Nhà xuất bản: Springer Nature
Tóm tắt: The dynamic characteristics of speech signal provide temporal information and play an important role in enhancing Automatic Speech Recognition (ASR). In this work, we characterized the acoustic transitions in a ratio plane of Spectral Subband Centroid Frequencies (SSCFs) using polar parameters to capture the dynamic characteristics of the speech and minimize spectral variation. These dynamic parameters were combined with Mel-Frequency Cepstral Coefficients (MFCCs) in Vietnamese ASR to capture more detailed spectral information. The SSCF0 was used as a pseudo-feature for the fundamental frequency (F0) to describe the tonal information robustly. The findings showed that the proposed parameters significantly reduce word error rates and exhibit greater gender independence than the baseline MFCCs.
Mô tả: Lecture Notes in Networks and Systems (LNNS,volume 1581); The 14th Conference on Information Technology and Its Applications (CITA 2025) ; pp: 247-258
Định danh: https://doi.org/10.1007/978-3-032-00972-2_19
https://elib.vku.udn.vn/handle/123456789/6216
ISBN: 978-3-032-00971-5 (p)
978-3-032-00972-2 (e)
Bộ sưu tập: CITA 2025 (International)

Các tập tin trong tài liệu này:

 Đăng nhập để xem toàn văn



Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.