Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - Covers theoretical foundations of deep learning, including detailed explanations of activation functions and autoencoder architectures.
Extracting and Composing Robust Features with Denoising Autoencoders, Pascal Vincent, Hugo Larochelle, Yoshua Bengio, Pierre-Antoine Manzagol, 2008Proceedings of the 25th International Conference on Machine Learning (ACM)DOI: 10.1145/1390156.1390294 - A seminal paper introducing denoising autoencoders, demonstrating the decoder's role in reconstructing data and influencing output layer choices.
Neural Networks Part 1: Setting up the Architecture, Justin Johnson, Andrej Karpathy, Fei-Fei Li, and course staff, 2023 (Stanford University) - Provides an accessible overview of neural network architectures and the properties of common activation functions (Sigmoid, Tanh, ReLU) from a leading university course.