Selects a the top k features that are significantly associated with the class to predict

Runs statistics on the data and selects a subset of k features that are the most significant. Besides filtering this function can be used in a more larger statistical context.

Usage

filterfeaturesK(
  data,
  trait,
  k = 10,
  type = "wilcoxon",
  restrict = rep(TRUE, ncol(data)),
  multiple.adjust = "BH",
  paired = FALSE,
  sort = TRUE,
  verbose = FALSE,
  verbose.step = NULL,
  return.data = FALSE
)

Arguments

data:: the dataset X
trait:: is the equivalent of y (class, or numerical)
k:: the number of features (default:10)
type:: the statistics to be run (default:wilcoxon)
restrict:: Run the statistics in a subset of the dataset (default: a vector of all TRUE)
multiple.adjust:: the multiple testing adjustment method (default:BH)
paired:: wether paired statistics should be run (default:FALSE)
sort:: return variables sorted by p-value significance (default:TRUE)
verbose:: print out information indicating progress (default:FALSE)
verbose.step:: Showing a 1 percent progress.
return.data:: if (default:FALSE) this returns the statistics of X, otherwise the restricted data subset