Vui lòng dùng định danh này để trích dẫn hoặc liên kết đến tài liệu này: https://elib.vku.udn.vn/handle/123456789/4293
Nhan đề: Multi-Scale Convolutions Meet Group Attention for Dense Prediction Tasks
Tác giả: Vo, Xuan Thuy
Nguyen, Duy Linh
Priadana, Adri
Choi, Jehwan
Hyun Jo, Kang
Từ khoá: Multi-Scale Convolutions Meet Group Attention for Dense Prediction Tasks
On ImageNet-1K image classification, the proposed method achieves 77.6% Top-1 accuracy at 0.7 GFLOPs, surpassing other methods under similar computational costs
Năm xuất bản: thá-2024
Nhà xuất bản: Springer Nature
Tóm tắt: Self-attention can capture long-range dependencies from input sequences without inductive biases, resulting in quadratic complexity. When transferring Vision Transformers to dense prediction tasks, the models suffer huge computational costs. Recent methods have drawn sparse attention to approximate attention regions and injected convolution into self-attention layers. Motivated by this line of research, this paper introduces group attention that has linear complexity with input resolution while modeling global context features. Group attention shares information across channels, and convolution is spatial sharing. Both operations are complementary, and multi-scale convolution can capture multiple views of the input. Merging multi-scale convolution into group attention layers can help improve feature representation and modeling abilities. To verify the effectiveness of the proposed method, extensive experiments are conducted on benchmark datasets for various vision tasks. On ImageNet-1K image classification, the proposed method achieves 77.6% Top-1 accuracy at 0.7 GFLOPs, surpassing other methods under similar computational costs. When transferring pre-trained model on ImageNet-1K to dense prediction tasks, the proposed method attains consistent improvements across visual tasks.
Mô tả: Lecture Notes in Networks and Systems (LNNS,volume 882); The 13th Conference on Information Technology and Its Applications (CITA 2024) ; pp: 360-371.
Định danh: https://elib.vku.udn.vn/handle/123456789/4293
https://doi.org/10.1007/978-3-031-74127-2_30
ISBN: 978-3-031-74126-5
Bộ sưu tập: CITA 2024 (International)

Các tập tin trong tài liệu này:

 Đăng nhập để xem toàn văn



Khi sử dụng các tài liệu trong Thư viện số phải tuân thủ Luật bản quyền.