Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này: https://elib.vku.udn.vn/handle/123456789/6197
Toàn bộ biểu ghi siêu dữ liệu
Trường DCGiá trị Ngôn ngữ
dc.contributor.authorNguyen, Xuan Thang-
dc.contributor.authorNguyen, Thanh Vinh-
dc.contributor.authorNguyen, Thuy Duong-
dc.contributor.authorHoang, Tran Huy Son-
dc.contributor.authorNguyen, Gia Bao-
dc.contributor.authorNguyen, Thị Ngoc Thao-
dc.date.accessioned2026-01-19T09:37:31Z-
dc.date.available2026-01-19T09:37:31Z-
dc.date.issued2026-01-
dc.identifier.isbn978-3-032-00971-5 (p)-
dc.identifier.isbn978-3-032-00972-2 (e)-
dc.identifier.urihttps://doi.org/10.1007/978-3-032-00972-2_38-
dc.identifier.urihttps://elib.vku.udn.vn/handle/123456789/6197-
dc.descriptionLecture Notes in Networks and Systems (LNNS,volume 1581); The 14th Conference on Information Technology and Its Applications (CITA 2025) ; pp: 519-531vi_VN
dc.description.abstractRetrieval Augmented Generation (RAG) is a popular approach that enhances the accuracy of Large Language Models (LLMs) by leveraging a knowledge base. It is rapidly becoming integral tools across various applications. However, as the use of RAG continues to expand, so do the challenges associated with their deployment, particularly in terms of data privacy. As a part of RAG pipeline, user query and all retrieved documents should be sent as a prompt to the LLM providers, leaving them open to privacy hazards such data leaks or illegal access. This study presents RLPT, a framework designed to enhance user privacy in RAG. It achieves this by identifying and eliminating sensitive information from user inputs before sending them to the LLM. The RLPT framework utilizes a local LLM to rapidly identify sensitive information in user input and subsequently replaces it with distinctive placeholders. These placeholders are used to indicate and hide the actual sensitive data, ensuring that the LLM does not capture the original sensitive information during prompt processing. The framework is evaluated using a dataset consisting of 4000 synthesized context documents. The results indicate that it is capable of accurately detecting and filtering privacy and sensitive information, achieving a high accuracy rate of 88,7%.vi_VN
dc.language.isoenvi_VN
dc.publisherSpringer Naturevi_VN
dc.subjectRetrieval-augmented generationvi_VN
dc.subjectLarge language modelvi_VN
dc.subjectPrivacy protectionsvi_VN
dc.subjectData anonymizationvi_VN
dc.titlePreserving User Privacy in Retrieval Augmented Generation: A Novel Approach Using Local Placeholder Taggingvi_VN
dc.typeWorking Papervi_VN
Bộ sưu tập: CITA 2025 (International)

Các tập tin trong tài liệu này:

 Đăng nhập để xem toàn văn



Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.