Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này:
https://elib.vku.udn.vn/handle/123456789/4293
Nhan đề: | Multi-Scale Convolutions Meet Group Attention for Dense Prediction Tasks |
Tác giả: | Vo, Xuan Thuy Nguyen, Duy Linh Priadana, Adri Choi, Jehwan Hyun Jo, Kang |
Từ khoá: | Multi-Scale Convolutions Meet Group Attention for Dense Prediction Tasks On ImageNet-1K image classification, the proposed method achieves 77.6% Top-1 accuracy at 0.7 GFLOPs, surpassing other methods under similar computational costs |
Năm xuất bản: | thá-2024 |
Nhà xuất bản: | Springer Nature |
Tóm tắt: | Self-attention can capture long-range dependencies from input sequences without inductive biases, resulting in quadratic complexity. When transferring Vision Transformers to dense prediction tasks, the models suffer huge computational costs. Recent methods have drawn sparse attention to approximate attention regions and injected convolution into self-attention layers. Motivated by this line of research, this paper introduces group attention that has linear complexity with input resolution while modeling global context features. Group attention shares information across channels, and convolution is spatial sharing. Both operations are complementary, and multi-scale convolution can capture multiple views of the input. Merging multi-scale convolution into group attention layers can help improve feature representation and modeling abilities. To verify the effectiveness of the proposed method, extensive experiments are conducted on benchmark datasets for various vision tasks. On ImageNet-1K image classification, the proposed method achieves 77.6% Top-1 accuracy at 0.7 GFLOPs, surpassing other methods under similar computational costs. When transferring pre-trained model on ImageNet-1K to dense prediction tasks, the proposed method attains consistent improvements across visual tasks. |
Mô tả: | Lecture Notes in Networks and Systems (LNNS,volume 882); The 13th Conference on Information Technology and Its Applications (CITA 2024) ; pp: 360-371. |
Định danh: | https://elib.vku.udn.vn/handle/123456789/4293 https://doi.org/10.1007/978-3-031-74127-2_30 |
ISBN: | 978-3-031-74126-5 |
Bộ sưu tập: | CITA 2024 (International) |
Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.