The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Trevor Hastie, Robert Tibshirani, and Jerome Friedman, 2009 (Springer) - Authoritative textbook with detailed chapters on regularization methods (including Lasso) and tree-based models, providing statistical context for embedded feature selection.
Feature importance with ensembles, The scikit-learn developers, 2024 (scikit-learn project) - Official scikit-learn documentation illustrating how ensemble methods like Random Forests compute and use feature importances for selection, a common embedded technique.
A survey on feature selection methods, Jian Cai, Jiuyong Luo, Shuxin Wang, and Siheng Meng, 2018Journal of Computer Science and Technology, Vol. 33 (Springer Science and Business Media LLC)DOI: 10.1007/s11390-018-1807-7 - A recent review paper offering a broad overview of feature selection techniques, including a discussion of embedded methods and their role in machine learning.