Title: | Calculate Feature Stability |
---|---|
Description: | Has two functions to help with calculating feature selection stability. 'Lump' is a function that groups subset vectors into a dataframe, and adds NA to shorter vectors so they all have the same length. 'ASM' is a function that takes a dataframe of subset vectors and the original vector of features as inputs, and calculates the Stability of the feature selection. The calculation for 'asm' uses the Adjusted Stability Measure proposed in: 'Lustgarten', 'Gopalakrishnan', & 'Visweswaran' (2009)<https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2815476/>. |
Authors: | Nicolas Ewen |
Maintainer: | Nicolas Ewen <[email protected]> |
License: | GPL-3 |
Version: | 0.1.2 |
Built: | 2024-12-19 06:43:19 UTC |
Source: | CRAN |
Calculates feature stability using asm.
asm(x,y)
asm(x,y)
x |
A dataframe where each column is a subset. |
y |
A vector of the original set. |
This formula to calculate feature selection stability comes from:
Lustgarten, J. L., Gopalakrishnan, V., & Visweswaran, S. (2009). Measuring Stability of Feature Selection in Biomedical Datasets. AMIA Annual Symposium Proceedings, 2009, 406–410.
x<-c(1,2,3,4) y<-c(3,4,5) z<-c(2,3,4,5,6) T<-lump(x,y,z) A<-c(1,2,3,4,5,6,7) asm(T,A)
x<-c(1,2,3,4) y<-c(3,4,5) z<-c(2,3,4,5,6) T<-lump(x,y,z) A<-c(1,2,3,4,5,6,7) asm(T,A)
Lumps vectors into a data frame. Adds NAs to make all the lengths the same.
lump(...)
lump(...)
... |
The subsets, given as vectors. |
x<-c(1,2,3,4) y<-c(3,4,5) z<-c(2,3,4,5,6) lump(x,y,z)
x<-c(1,2,3,4) y<-c(3,4,5) z<-c(2,3,4,5,6) lump(x,y,z)