Random Forests, Leo Breiman, 2001Machine Learning, Vol. 45DOI: 10.1023/A:1010933404324 - Introduces the Random Forest algorithm and discusses its inherent feature importance measures based on impurity reduction.
The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Trevor Hastie, Robert Tibshirani, and Jerome Friedman, 2009 (Springer) - A comprehensive textbook covering decision trees, ensemble methods like Random Forests and Gradient Boosting, and the theoretical basis of impurity-based feature importance.
Feature importances with ensembles of trees, Scikit-learn developers, 2023 - Official documentation demonstrating how to obtain and interpret feature importances from tree-based ensemble models in Scikit-learn.
Permutation feature importance, Scikit-learn developers, 2023 - Official Scikit-learn documentation explaining the concept and usage of permutation feature importance as a model-agnostic method.