Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A three-step variable selection procedure based on random forests, initially developed for high-dimensional data where variables exceed observations. The method is versatile for regression and supervised classification problems, as described by Genuer, Poggi, and Tuleau-Malot in a 2015 R Journal article. The package aims to eliminate irrelevant variables, select all response-related variables for interpretation, and refine the selection by removing redundancy for prediction.
This appears to be a description of a software package/method rather than a traditional dataset; the actual data format and structure are unspecified.