Squeeze-and-Excitation Networks, Jie Hu, Li Shen, Samuel Albanie, Gang Sun, Enhua Wu, 2018Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)DOI: 10.48550/arXiv.1709.01507 - Introduces the Squeeze-and-Excitation (SE) block, a channel attention mechanism discussed in detail in the section.
Non-local Neural Networks, Xiaolong Wang, Ross Girshick, Abhinav Gupta, and Kaiming He, 2018Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (IEEE)DOI: 10.1109/CVPR.2018.00749 - Presents Non-local Neural Networks, a method for capturing long-range spatial dependencies using self-attention, which is a core topic of the section.
Attention Is All You Need, Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin, 2017Advances in Neural Information Processing Systems (NeurIPS), Vol. 30DOI: 10.48550/arXiv.1706.03762 - The foundational paper introducing the Transformer architecture and the self-attention mechanism, which inspired many subsequent attention-based approaches in vision, including non-local networks.