Please use this identifier to cite or link to this item:
https://elib.vku.udn.vn/handle/123456789/2701
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Tran, Duy | - |
dc.contributor.author | Le, Thang | - |
dc.contributor.author | Tran, Khoa | - |
dc.contributor.author | Le, Hoang | - |
dc.contributor.author | Do, Cuong | - |
dc.contributor.author | Ha, Thanh | - |
dc.date.accessioned | 2023-09-25T08:17:10Z | - |
dc.date.available | 2023-09-25T08:17:10Z | - |
dc.date.issued | 2023-06 | - |
dc.identifier.isbn | 978-604-80-8083-9 | - |
dc.identifier.uri | http://elib.vku.udn.vn/handle/123456789/2701 | - |
dc.description | Proceeding of The 12th Conference on Information Technology and It's Applications (CITA 2023); pp: 45-52. | vi_VN |
dc.description.abstract | In addition to its use in the realm of plastic surgery and aesthetics, Facial Beauty Prediction technology also has applications in other areas, such as advertising and social media, where it can be used to optimize marketing strategies and help individuals enhance their online presence. There are applications in other areas as well, such as advertising and social media. This study introduces an effective approach to evaluate human face beauty using a transformer-based architecture. While Convolutional Neural Network (CNN) is a conventional method for this task, our experimental results demonstrate that our Vision Transformer (ViT) based model outperforms the other two effective baselines, VGGNet and ResNet50, in evaluating human face beauty on the widely-used benchmark dataset SCUT-FPB 5500. Our ViT-based model demonstrates superior performance in Mean Absolute Error (MAE) and Mean Squared Error (MSE) compared to VGG16 and ResNet-50, despite employing a simple data pipeline without any data augmentation. Our study suggests that transformer-based architectures offer a more effective means of evaluating human beauty and open new avenues for further research in this field. | vi_VN |
dc.language.iso | en | vi_VN |
dc.publisher | Vietnam-Korea University of Information and Communication Technology | vi_VN |
dc.relation.ispartofseries | CITA; | - |
dc.subject | Artificial Intelligence | vi_VN |
dc.subject | Vision Transformer | vi_VN |
dc.subject | Facial features extraction | vi_VN |
dc.subject | VGG-16 | vi_VN |
dc.subject | Resnet-50 | vi_VN |
dc.subject | ViT-based-16-2k | vi_VN |
dc.title | Facial Beauty Prediction with Vision Transformer | vi_VN |
dc.type | Working Paper | vi_VN |
Appears in Collections: | CITA 2023 (National) |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.