Network In Network, Min Lin, Qiang Chen, Shuicheng Yan, 2014ICLRDOI: 10.48550/arXiv.1312.4400 - Introduces the Network-in-Network architecture, using 1x1 convolutions to build multi-layer perceptron micro-networks and proposing Global Average Pooling.
Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - A comprehensive textbook providing foundational knowledge on deep learning, including detailed sections on Convolutional Neural Networks and advanced architectures like Inception.