Functionality for Fast Univariate Tests of Tabular Input
Closed this issue · 2 comments
Similar to rowFtests
and colFtests
in genefilter, could survival have functionality which does fast computation in C or C++ over either the rows or columns of tabular data, to allow univariate feature selection of variables associated with survival? colCoxTests
?
Could you be a little clearer about what exactly you want? (Most of the serious computation in the survival package is already in C, by the way).
I make a guess about what you want, which is to test for the significance of gene 'X' on survival, say after adjusting for age? There is a fast approximation for this: the sum of x* marginale-residuals is the numerator of the score test for the addition of 'x' to the regression, so you can very easily rank the variables. But variance is a bit harder.
If I am correct, I'm not sure this belongs in the survival package; it might more logically be placed into genefilter.
Thanks for the suggestion. Yes, fast computation on thousands of variables, one-at-a-time. I agree that it might be better suited to genefilter or a similar package and I will discuss it with the package developer there.