Skip to contents

Runs statistics on the data and selects a subset of k features that are the most significant. Besides filtering this function can be used in a more larger statistical context.

Usage

filterfeaturesK(
  data,
  trait,
  k = 10,
  type = "wilcoxon",
  restrict = rep(TRUE, ncol(data)),
  multiple.adjust = "BH",
  paired = FALSE,
  sort = TRUE,
  verbose = FALSE,
  verbose.step = NULL,
  return.data = FALSE
)

Arguments

data:

the dataset X

trait:

is the equivalent of y (class, or numerical)

k:

the number of features (default:10)

type:

the statistics to be run (default:wilcoxon)

restrict:

Run the statistics in a subset of the dataset (default: a vector of all TRUE)

multiple.adjust:

the multiple testing adjustment method (default:BH)

paired:

wether paired statistics should be run (default:FALSE)

sort:

return variables sorted by p-value significance (default:TRUE)

verbose:

print out information indicating progress (default:FALSE)

verbose.step:

Showing a 1 percent progress.

return.data:

if (default:FALSE) this returns the statistics of X, otherwise the restricted data subset