Please use this identifier to cite or link to this item:
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLe, Hoang Minh Quang-
dc.contributor.authorLe, Trung Kien-
dc.contributor.authorPham, Van Truong-
dc.contributor.authorTran, Thi Thao-
dc.descriptionLecture Notes in Networks and Systems (LNNS, volume 734); CITA: Conference on Information Technology and its Applications; pp: 169-180.vi_VN
dc.description.abstractPreviously, Multi-Layer Perceptrons (MLPs) were primarily used in image classification tasks. The emergence of the MLP-Mixer architecture has demonstrated the continued efficacy of MLPs in other visual tasks. To obtain superior results, it is imperative to have pre-trained weights from large datasets, and the Cross-Location (Token Mix) operation must be adaptively modified to suit the specific task at hand. Inspired by this, we proposed AMG-Mixer, an MLP-based architecture for image segmentation. In particular, recognizing the importance of positional information, we proposed AxialMBconv Token Mix utilizing Axial Attention. Additionally, to reduce Axial Attention’s receptive field constraints, we proposed Multi-scale Multi-axis MLP Gated (MS-MAMG) block which employs Multi-Axis MLP. The proposed AMG-Mixer architecture outperformed State-of-the-Art (SOTA) methods on benchmark datasets including GLaS, Data Science Bowl 2018, and Skin Lesion Segmentation ISIC 2018, even without pre-training. The proposed AMG-Mixer architecture has been confirmed effective and high performing in our study. The code is available at
dc.publisherSpringer Naturevi_VN
dc.subjectimage segmentationvi_VN
dc.subjectAxial Attentionvi_VN
dc.subjectMulti-axis MLPvi_VN
dc.titleAMG-Mixer: A Multi-Axis Attention MLP-Mixer Architecture for Biomedical Image Segmentationvi_VN
dc.typeWorking Papervi_VN
Appears in Collections:CITA 2023 (International)

Files in This Item:

 Sign in to read

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.