All Courses

Autoencoders and Representation Learning

Chapter 1: Foundations of Representation Learning

Review of Unsupervised Learning Principles

Limitations of Linear Dimensionality Reduction

Introduction to Manifold Learning Techniques

The Need for Non-linear Feature Extraction

Information Bottleneck Theory Primer

Mathematical Preliminaries Refresher

Chapter 2: The Classic Autoencoder Architecture

Encoder Network Design

The Bottleneck Layer

Decoder Network Design

Reconstruction Loss Functions

Mathematical Formulation of Basic Autoencoders

Implementation Considerations and Frameworks

Building a Simple Autoencoder: Hands-on Practical

Chapter 3: Regularized Autoencoders for Robust Representations

Addressing Overfitting in Autoencoders

Sparse Autoencoders: L1 and KL Divergence

Denoising Autoencoders Architecture and Training

Contractive Autoencoders Formulation

Comparison of Regularization Techniques

Implementing Denoising Autoencoders: Hands-on Practical

Implementing Sparse Autoencoders: Hands-on Practical

Chapter 4: Variational Autoencoders for Generative Modeling

Generative Limitations of Deterministic Autoencoders

Probabilistic Encoders and Decoders

The Latent Variable Model Perspective

The Reparameterization Trick Explained

Deriving the Evidence Lower Bound (ELBO)

KL Divergence Term Analysis

Reconstruction Loss Term in VAEs

Conditional Variational Autoencoders (CVAEs)

Implementing a VAE for Image Generation: Practice

Chapter 5: Advanced Autoencoder Architectures

Convolutional Autoencoders for Spatial Data

Recurrent Autoencoders for Sequential Data

Adversarial Autoencoders (AAEs)

Vector Quantized Variational Autoencoders (VQ-VAEs)

Transformer-Based Autoencoders Overview

Comparing Advanced Architectures

Implementing Convolutional Autoencoders: Practice

Chapter 6: Understanding and Manipulating Latent Spaces

Visualizing Latent Spaces with t-SNE and UMAP

Properties of Learned Representations

Disentangled Representations Theory

Techniques for Promoting Disentanglement

Interpolation and Traversal in Latent Space

Arithmetic Operations in Latent Space

Evaluating Representation Quality Metrics

Latent Space Visualization and Analysis: Hands-on Practical

Chapter 7: Applications and Training Strategies

Autoencoders for Anomaly Detection

Dimensionality Reduction and Data Compression Uses

Autoencoders for Pre-training Deep Networks

Image Denoising and Inpainting Applications

Sequence-to-Sequence Autoencoders Overview

Advanced Optimization Algorithms

Learning Rate Schedules and Adjustment

Hyperparameter Tuning Strategies

Implementing Anomaly Detection with Autoencoders: Practice

Convolutional Autoencoders for Spatial Data

Was this section helpful?

References

Gradient-based learning applied to document recognition, Yann LeCun, Léon Bottou, Yoshua Bengio, Patrick Haffner, 1998 Proceedings of the IEEE, Vol. 86 (IEEE) DOI: 10.1109/5.726791 - Introduces the foundational concepts of Convolutional Neural Networks (CNNs), including local receptive fields, shared weights, and pooling, which are essential for Convolutional Autoencoders.
Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - A comprehensive textbook providing in-depth theoretical foundations for both Convolutional Neural Networks and Autoencoders, covering architectures, training, and various forms of each.
Stacked Convolutional Auto-Encoders for Hierarchical Feature Extraction, Jonathan Masci, Ueli Meier, Dan Cireşan, Jürgen Schmidhuber, 2011 International Conference on Artificial Neural Networks (ICANN), Vol. 6791 (Springer, Berlin, Heidelberg) DOI: 10.1007/978-3-642-21735-7_7 - A seminal paper that explicitly introduces the architecture and application of Convolutional Autoencoders for learning hierarchical features from image data.
CS231n: Convolutional Neural Networks for Visual Recognition, Fei-Fei Li, Ehsan Adeli, Justin Johnson, Zane Durante, 2025 (Stanford University) - Provides detailed explanations and visualizations of Convolutional Neural Networks, pooling, strided convolutions, and transposed convolutions, which are fundamental components of CAEs.

© 2025 ApX Machine Learning