Please use this identifier to cite or link to this item: https://elib.vku.udn.vn/handle/123456789/4279
Full metadata record
DC FieldValueLanguage
dc.contributor.authorNguyen, Anh D-
dc.contributor.authorPham, Dang H-
dc.contributor.authorNguyen, Duc M-
dc.contributor.authorNguyen, Hoa N-
dc.date.accessioned2024-12-04T09:01:27Z-
dc.date.available2024-12-04T09:01:27Z-
dc.date.issued2024-11-
dc.identifier.isbn978-3-031-74126-5-
dc.identifier.urihttps://elib.vku.udn.vn/handle/123456789/4279-
dc.identifier.urihttps://doi.org/10.1007/978-3-031-74127-2_16-
dc.descriptionLecture Notes in Networks and Systems (LNNS,volume 882); The 13th Conference on Information Technology and Its Applications (CITA 2024) ; pp: 184-196.vi_VN
dc.description.abstractThe goal of text-based person search is to use a textual description query to find the required individual within an image gallery. There are two major obstacles to this task: (i) how to bridge the gap between the textual and visual modalities in the feature space and (ii) how to improve the image representation by removing the focus of the model on unnecessary regions from the image as the same way we do on text input. To address these challenges, we propose cMDTPS, a comprehensive masked modality modeling method with improved similarity distribution matching loss. The proposed method consists of two components: (i) an improved cross-modality alignment loss that minimizes the distance between the distributions of the same person in different modalities, and (ii) a comprehensive masked modality modeling method that helps the model focus on important parts of inputs. We evaluate our proposed method on three popular benchmarks: CUHK-PEDES, ICFG-PEDES, and RSTPReid, and show that it outperforms the state-of-the-art methods.vi_VN
dc.language.isoenvi_VN
dc.publisherSpringer Naturevi_VN
dc.subjectComprehensive Masked Modality Modelingvi_VN
dc.subjectText-based Person Searchvi_VN
dc.titlecMDTPS: Comprehensive Masked Modality Modeling with Improved Similarity Distribution Matching Loss for Text-based Person Searchvi_VN
dc.typeWorking Papervi_VN
Appears in Collections:CITA 2024 (International)

Files in This Item:

 Sign in to read



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.