Please use this identifier to cite or link to this item:
https://elib.vku.udn.vn/handle/123456789/4293
Title: | Multi-Scale Convolutions Meet Group Attention for Dense Prediction Tasks |
Authors: | Vo, Xuan Thuy Nguyen, Duy Linh Priadana, Adri Choi, Jehwan Hyun Jo, Kang |
Keywords: | Multi-Scale Convolutions Meet Group Attention for Dense Prediction Tasks On ImageNet-1K image classification, the proposed method achieves 77.6% Top-1 accuracy at 0.7 GFLOPs, surpassing other methods under similar computational costs |
Issue Date: | Nov-2024 |
Publisher: | Springer Nature |
Abstract: | Self-attention can capture long-range dependencies from input sequences without inductive biases, resulting in quadratic complexity. When transferring Vision Transformers to dense prediction tasks, the models suffer huge computational costs. Recent methods have drawn sparse attention to approximate attention regions and injected convolution into self-attention layers. Motivated by this line of research, this paper introduces group attention that has linear complexity with input resolution while modeling global context features. Group attention shares information across channels, and convolution is spatial sharing. Both operations are complementary, and multi-scale convolution can capture multiple views of the input. Merging multi-scale convolution into group attention layers can help improve feature representation and modeling abilities. To verify the effectiveness of the proposed method, extensive experiments are conducted on benchmark datasets for various vision tasks. On ImageNet-1K image classification, the proposed method achieves 77.6% Top-1 accuracy at 0.7 GFLOPs, surpassing other methods under similar computational costs. When transferring pre-trained model on ImageNet-1K to dense prediction tasks, the proposed method attains consistent improvements across visual tasks. |
Description: | Lecture Notes in Networks and Systems (LNNS,volume 882); The 13th Conference on Information Technology and Its Applications (CITA 2024) ; pp: 360-371. |
URI: | https://elib.vku.udn.vn/handle/123456789/4293 https://doi.org/10.1007/978-3-031-74127-2_30 |
ISBN: | 978-3-031-74126-5 |
Appears in Collections: | CITA 2024 (International) |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.