Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này: https://elib.vku.udn.vn/handle/123456789/6214
Toàn bộ biểu ghi siêu dữ liệu
Trường DCGiá trị Ngôn ngữ
dc.contributor.authorNguyen, Quoc Vuong-
dc.contributor.authorLe, Tang Phu Quy-
dc.contributor.authorPham, Van Nam-
dc.contributor.authorTon, That Ron-
dc.contributor.authorPhung, Anh Sang-
dc.contributor.authorTruong, The Quoc Dung-
dc.contributor.authorNguyen, Ngoc Xuan Quynh-
dc.contributor.authorNguyen, Huu Nhat Minh-
dc.date.accessioned2026-01-20T02:23:01Z-
dc.date.available2026-01-20T02:23:01Z-
dc.date.issued2026-01-
dc.identifier.isbn978-3-032-00971-5 (p)-
dc.identifier.isbn978-3-032-00972-2 (e)-
dc.identifier.urihttps://doi.org/10.1007/978-3-032-00972-2_21-
dc.identifier.urihttps://elib.vku.udn.vn/handle/123456789/6214-
dc.descriptionLecture Notes in Networks and Systems (LNNS,volume 1581); The 14th Conference on Information Technology and Its Applications (CITA 2025) ; pp: 273-285vi_VN
dc.description.abstractThe proliferation of abusive websites, particularly those facilitating phishing, fraud has emerged as a critical cybersecurity threat. Detecting these abusive websites efficiently remains a crucial challenge, necessitating sophisticated feature engineering and advanced machine learning techniques. In this paper, we present a comprehensive comparative study of domain-based and content-based approaches for abusive website detection with two datasets such as Vietnamese abusive websites and international phising datasets. Through extensive evaluation, we demonstrate that the integration of multiple feature types significantly enhances the detection accuracy. In particular, hosting-related features exhibit strong independent predictive capability, while machine learning models that take advantage of these features continue to achieve robust performance. Although extracted features contribute substantially to high-accuracy detection, our findings indicate that source code analysis is the most effective method for identifying abusive websites. In particular, language models, such as Phishlang, excel at capturing the textual patterns within website source code, achieving outstanding performance with an accuracy of 0.98 and an F1-score of 0.97.vi_VN
dc.language.isoenvi_VN
dc.publisherSpringer Naturevi_VN
dc.subjectAbusive website detectionvi_VN
dc.subjectMachine learningvi_VN
dc.subjectLanguage modelvi_VN
dc.subjectFeature engineeringvi_VN
dc.titleA Comparative Study on Domain and Content-Based Approaches for Abusive Website Detectionvi_VN
dc.typeWorking Papervi_VN
Bộ sưu tập: CITA 2025 (International)

Các tập tin trong tài liệu này:

 Đăng nhập để xem toàn văn



Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.