gao-lab/GLUE

Question about preprocessing ATAC-Seq

Opened this issue · 4 comments

Hello,

I'm interested in using scglue to integrate my scRNAseq with scATACseq data, comprising four paired samples. These datasets are not multiome; they're only paired samples.

After processing with Cellranger, I used episcanpy's epi.ct.bld_mtx_fly to construct a count matrix from the TSV/TBI files. Subsequently, I filtered the data using epi.pp.filter_cells and epi.pp.filter_features.

My current concern pertains to the input matrix for the scglue.data.lsi function.

Could you please advise on whether I should use the Raw Matrix, Binarized Matrix, or normalizedMatrix for scglue.data.lsi?

Thank you. Your assistance is appreciated.

Hi @luglilab. Thanks for your interest in GLUE!

I would recommend using the raw count matrix, as was also recently confirmed in this paper.

Thanks a lot!

Hello @Jeff1995 ,

could you suggest a good preprocessing pipeline for 10X data compatible with your tool? I'm trying to figure it out if is better snapatac, episcanpy or signac but i'd like to have your comment.

Thank you.

Hi @luglilab, I haven't really fiddled with different preprocessing pipelines. As long as it outputs count matrices, it would work well with scglue. scglue has builtin lsi implementation and does not need further processing than the count matrices.