Feature selection / Dimensionality reduction for tall array
4 views (last 30 days)
I work with a tall array of more than 2 M observations and about 3000 numerical predictor variables. My response variable is binary (no / yes). I would like to know how and what algorithms I can use to select (or rank) the best features to develop a predictive model.