Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A foundational text for deep learning, providing theoretical background on recurrent neural networks and the challenges of sequence data with varying lengths.
tf.keras.layers.Embedding, TensorFlow Team, 2024 - Official documentation detailing the Embedding layer, including the mask_zero parameter for automatic mask generation from padding value 0.
tf.keras.layers.Masking, TensorFlow Team, 2024 (Google) - Official documentation for the Masking layer, which explicitly creates a mask based on a specified padding value.
Masking and padding with Keras, Scott Zhu, Francois Chollet, 2023 (TensorFlow) - A comprehensive Keras guide explaining how padding and masking work, their propagation through layers, and application in loss calculations.