Time-Domain Target Speaker Extraction with Parallel Intra and Inter-framework

Ha, Minh Tan; Le, Dinh Nguyen; Dang, An

Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này: https://elib.vku.udn.vn/handle/123456789/6223

Nhan đề:	Time-Domain Target Speaker Extraction with Parallel Intra and Inter-framework
Tác giả:	Ha, Minh Tan Le, Dinh Nguyen Dang, An
Từ khoá:	Target speaker extraction Informed talker extraction Time-domain talker extraction Parallel intra- and inter-framework Deep learning End-to-end deep neural network
Năm xuất bản:	thá-2026
Nhà xuất bản:	Springer Nature
Tóm tắt:	Speaker extraction addresses isolating the specific speaker’s voice from a bend of other speakers using supplementary information. This paper proposes a time-domain speaker extraction using a parallel intra- and inter-framework (TSEPII). An efficient intra- and inter-architecture converts mixed utterance into multi-scale embedding coefficients. Additionally, we incorporate parallel architectures to achieve more stability than previous single architectures. This architecture includes the main components such as the auxiliary encoder (the talker encoding block), the extraction encoder (utterance encoding block), the talker extraction block, and the extraction decoder (the utterance decoding block). In particular, the time domain-based raw voice processing system keeps important information. The utterance encoding block transforms the mixed voice into multiple-scale embedding values, while the talker encoding block learns the target talker by the talker embedding feature. The talker extraction block plays an important role and uses multiple-scale embedding values and the talker embedding feature as the input features. It estimates the time-domain mask for the system. Finally, the utterance decoding block recreates the utterance of the target talker. Experiments show that the TSEPII achieves state-of-the-art performance and competes with current methods.
Mô tả:	Lecture Notes in Networks and Systems (LNNS,volume 1581); The 14th Conference on Information Technology and Its Applications (CITA 2025) ; pp: 147-158
Định danh:	https://doi.org/10.1007/978-3-032-00972-2_12 https://elib.vku.udn.vn/handle/123456789/6223
ISBN:	978-3-032-00971-5 (p) 978-3-032-00972-2 (e)
Bộ sưu tập:	CITA 2025 (International)

Các tập tin trong tài liệu này:

Đăng nhập để xem toàn văn

Hiển thị đầy đủ biểu ghi tài liệu Xem thống kê

Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.