arq5x/gemini

gene list from `gemini burden` smaller than gene list from `gemini burden --calpha`

mjsduncan opened this issue · 1 comments

when i ran gemini burden --nonsynonymous and gemini burden --nonsynonymous --calpha on a large db, the number of genes/lines of output were the same, but without the --nonsynynous parameter the gene list for the per-sample counts was 1/10 the size of the gene list for the c alpha test. is this a bug or am i missing something?

this is with gemini 0.30.1 installed via bc-bio.

thanks for your work!

rtfm, using the --nonsynonymous flag invokes the same variant filter for burden and for burden --calpha (codon_change != 'None'), and not using the --nonsynonymous flag uses different filters (is_coding = 1 and (impact_severity = 'HIGH' or polyphen_pred = 'probably_damaging') vs impact_severity != 'LOW').