PaperWork scRNA-GSE111527-mouseSVZ

Reanalyze the data GSE111527 to get the Figure 2/3 from this paper.


The parper: Single-Cell Transcriptomics Characterizes Cell Types in the Subventricular Zone and Uncovers Molecular Defects Impairing Adult Neurogenesis

The data: GSE111527

Table of Contents

Brief Intro to the Paper

Summary:

Neural stem cells (NSCs) contribute to plasticity and repair of the adult brain. Niches harboring NSCs regulate stem cell self-renewal and differentiation. We used comprehensive and untargeted single-cell RNA profiling to generate a molecular cell atlas of the largest germinal region of the adult mouse brain, the subventricular zone (SVZ). We characterized >20 neural and non-neural cell types and gained insights into the dynamics of neurogenesis by predicting future cell states based on computational analysis of RNA kinetics. Furthermore, we applied our single-cell approach to document decreased numbers of NSCs, reduced proliferation activity of progenitors, and perturbations in Wnt and BMP signaling pathways in mice lacking LRP2, an endocytic receptor required for SVZ maintenance. Our data provide a valuable resource to study adult neurogenesis and a proof of principle for the power of single-cell RNA sequencing to elucidate neural cell-type-specific alterations in loss-of-function models.

Highlights:

  • Single-cell transcriptomics characterizes the SVZ adult neural stem cell niche
  • Different transcriptional dynamics along the neurogenic lineage
  • Cell-type-specific dysfunctions underlying impaired adult neurogenesis

Brief Intro to the Data

Drop-seq sequencing reads for microdissected SVZs from three to five mice. The author collected 9,804 cells with 734 genes and 1,137 unique molecular identifiers (UMIs) quantified per cell (medians) in five replicates. We will perform analysis for these five replicates: an002, an003_F, an003_L, an008 and an009.

Main Analysises and Tools Used

Usage

scripts-preprocess

Mapping reads and get digital gene expression matrix (DGE);
Geting .loom files using the aligned reads (bam files) from Drop-seq tools

1- Download data from NCBI and extract fastq data

bash wget.sh
bash fastqdump_batch.sh

2- Transfer fastq format to sam format

bash fastq2sam_batch.sh

3- Drop-seq reads align

bash dropseq_batch_align.sh

4- Generate (DGE)

bash dropseq_batch_count.sh

5- Getting loom files

velocyto run -b an002.filtered.cellbarcode.tsv -o ../velocyto/bamfiles/ -m mm10_rmsk.gtf -@ 20 ../velocyto/bamfiles/SRR6814853.bam gencode.vM25.chr_patch_hapl_scaff.annotation.gtf

PS: due to the lack of detail paramters for seleceting cells (not provied in the paper), we counldn't get the exactly same DGE as the paper, so we used the DGE from GSE111527 in the follwing analysis.

scripts-plot

Clustering and visualization of the cells

1- Generate the merged Seurat object

Rscript step2_seurat_mergeReplicates_new.R

2- Clustering and tSNE plot (Figure 2A)

Rscript step3_seurat_clustering_new.R

3- Stacked violin plots,featureplot and heatmap for marker genes (Figure 2B-2D)

Rscript step4_seurat_makerExpression.R

4- Estimating RNA velocity and visualization (Figure 3A)

Rscript step5_velocyto.R

or

python velocyto.test.py

5- Subclustering of neurogenesis cells (Figure 3C)

Rscript step6_seurat_subclustering.R

6- Gene set scoring for subclustering cells (Figure 3D)

Rscript step7_geneSet_score.R

7- calculation of pseudotime and visualization for subclustering cells (Figure 3E)

Rscript step9_scanpy.R

or

python scanpy.py

data

DGE counts; Seurat obeject; loom files

results

visualization results in pdf format

Figure 2A

Figure 2B

Figure 2C

Figure 2D

Figure 3A

Figure 3C

Figure 3D

Contact Us

If you have any questions about these scripts, please feel free to contact Yuting Liu, lyt17@pku.edu.cn