Please use this identifier to cite or link to this item: https://elib.vku.udn.vn/handle/123456789/2733
Title: Extending OCR Model for Font and Style Classification
Authors: Vu, Dinh Nguyen
Nguyen, Tien Dong
Dang, Minh Tuan
Ninh, Thi Anh Ngoc
Nguyen, Vu Son Lam
Nguyen, Viet Anh
Nguyen, Hoang Dang
Keywords: Font Classification
Style Classification
OCR
Issue Date: Jul-2023
Publisher: Springer Nature
Abstract: Font and style classification aims to recognize which font and which style the characters in the input image belong to. The con- junction of font and style classification with traditional OCR systems is important in the reconstruction of visually-rich documents. However, the current text recognition systems have yet to take into account these tasks and focus solely on the recognition of characters from input images. The separation of these tasks makes the document reconstruction systems computationally expensive. In this paper, we propose a new approach that extends the current text recognition model to include font and style classification. We also present a dataset comprising input images and corresponding characters, fonts, and styles in Vietnamese. We evaluate the effectiveness of this extension on multiple recent OCR models, including VST [10], CRNN [8], ViSTR [1], TROCR [7], SVTR [4] . Our results demonstrate that our extension achieves decent accuracy rates of 98.1% and 90% for font and style classification, respectively. Moreover, our extension can even boost the performance of the original OCR models.
Description: Lecture Notes in Networks and Systems (LNNS, volume 734); CITA: Conference on Information Technology and its Applications; pp: 193-204.
URI: https://link.springer.com/chapter/10.1007/978-3-031-36886-8_16
http://elib.vku.udn.vn/handle/123456789/2733
ISBN: 978-3-031-36886-8
Appears in Collections:CITA 2023 (International)

Files in This Item:

 Sign in to read



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.