
Selects a the top k features that are significantly associated with the class to predict
filterfeaturesK.Rd
Runs statistics on the data and selects a subset of k features that are the most significant. Besides filtering this function can be used in a more larger statistical context.
Arguments
- data:
the dataset X
- trait:
is the equivalent of y (class, or numerical)
- k:
the number of features (default:10)
- type:
the statistics to be run (default:wilcoxon)
- restrict:
Run the statistics in a subset of the dataset (default: a vector of all TRUE)
- multiple.adjust:
the multiple testing adjustment method (default:BH)
- paired:
wether paired statistics should be run (default:FALSE)
- sort:
return variables sorted by p-value significance (default:TRUE)
- verbose:
print out information indicating progress (default:FALSE)
- verbose.step:
Showing a 1 percent progress.
- return.data:
if (default:FALSE) this returns the statistics of X, otherwise the restricted data subset