Please use this identifier to cite or link to this item:
Title: A Study on Vietnamese Semantic Analysis using BERT-Based PreTrained Language Model
Authors: Pham, Vu Thu Nguyet
Ha, Thi Minh Phuong
Keywords: Natural Language Processing
Sentiment Analysis
Deep Learning
Issue Date: Jul-2022
Publisher: Da Nang Publishing House
Abstract: One of the most significant NLP tasks is sentiment analysis, in which machine learning models are taught to identify text based on polarity of opinion. Many suggested models have produced cutting-edge results for sentiment analysis in English corpora. However, there have not been many investigations of this technique for Vietnamese corpus, which has resulted in several limitations in Vietnamese study. In this paper, we suggested a sentiment analysis technique for Vietnamese utilizing the PhoBERT pretrained model. PhoBERT is based on RoBERTa, a robust Vietnamese optimization of the well-known BERT model. Our technique produces quite good performance on the given dataset with an AUC score of 86%. This is anticipated to provide the groundwork for future study in Vietnamese, which is a language with limited resources.
Description: The 11th Conference on Information Technology and its Applications; Poster; pp. 30-37.
Appears in Collections:CITA 2022

Files in This Item:

 Sign in to read

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.