Please use this identifier to cite or link to this item: https://elib.vku.udn.vn/handle/123456789/4002
Title: Dual-Transformer Framework for Vietnamese Image-to-Poem Generation
Authors: Nguyen, Minh Chau
Dang, Van Thin
Nguyen, Vinh Tiep
Truong, Quoc Truong
Nguyen, Thanh Son
Keywords: Language model
Text generation
Poem generation
Image-to-text
Vietnamese language
Transformer architectures
Issue Date: Jul-2024
Publisher: Vietnam-Korea University of Information and Communication Technology
Series/Report no.: CITA;
Abstract: The Image-to-Poem is one of the novelty tasks in Artificial Intelligence and has received researchers' attention in recent years. This task requires the system to automatically generate a poem which is a creative content and specific structure in an aesthetically pleasing manner based on the input's image. However, most existing methods still have problems with topic inconsistency and irregularity to tackle this task. Moreover, the lack of benchmark datasets is a big problem because of the different points of view of a person which leads to difficulty in creating Image-to-Poem datasets, especially the low-resource languages. Therefore, in this paper, we present a Visual-68Poem dataset for the Vietnamese Image-to-Poem task with six-eight poems with a variety of content and context. In addition, we propose a Dual-Transformer architecture, including a component to extract the main objects and concept keywords from the image and a language model to generate a six-eight poem. Specifically, the generated poems must be similar to the context of the image and are complied with the rules of the structural genre of poetry. Experimental results on our standard dataset show that our proposed models consistently achieve competitive performance over other models on different measure scores. We release our dataset and code to facilitate future work on this task.
Description: Proceedings of the 13th International Conference on Information Technology and Its Applications (CITA 2024); pp: 2-13.
URI: https://elib.vku.udn.vn/handle/123456789/4002
ISBN: 978-604-80-9774-5
Appears in Collections:CITA 2024 (Proceeding - Vol 2)

Files in This Item:

 Sign in to read



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.