Incorrect gene.annotation processing for sciPlex3
Closed this issue · 1 comments
FarzanT commented
Using header=None
reads the first line (which is a header) as a row in the gene annotation dataframe. This subsequently affects the dimensions of the whole dataset.
Currently, sciPlex3's var
looks like this:
srivatsan.var
Out[4]:
ensembl_id ncounts ncells
gene_symbol
nan id gene_short_name 26582.0 23228
nan:1 ENSG00000000003 35.0 33
nan:2 ENSG00000000005 163109.0 116153
So all the genes are shifted somehow. This can drastically affect downstream tasks since it's no longer clear what genes are expressed.
tessadgreen commented
Fixed, error corrected in script, updated dataset will be included in v1.4 on Zenodo