A collection of utility scripts in R for data analysis and visualization to serve as a reference for lab members.
In general, the scripts are prepared to be downloaded and manually edited. There is usually an INPUTS
section at the beginning of the script where you can specify the input files and parameters. The scripts are prepared to be run from RStudio, but you can also run them in the terminal with Rscript <scriptname>
.
-
AS_boxplot.R
: Boxplot of PSI values for alternative splicing events. Includes a Wilcoxon test for differential splicing between all of the groups. -
AS_kmeans.R
: K-means clustering of PSI values for alternative splicing events. -
AS_PCA-heatmap-UMAP.R
: PCA, heatmap, and UMAP plots for alternative splicing data. Allows to filter events by overall variance. Specific event and sample-lists can be provided. -
rmats_summary.R
: Script to create basic plots (PCAs and volcanos) and tables listing all of events (both tables by event type and full tables) from multiple experiments.It expects an
rmats
directory containing the output of rMATS experiments (there can be many), namedrmats_<StudyGroup>_<ControlGroup>
. Inside, it is expected to be a folder namedrmats_output
, containing all of the output files.
GE_CGGA.R
: PCA, heatmap and UMAP plots for gene expression data. Used for the CGGA dataset.GE_plots.R
: general script with boxplots with tests, PCA, heatmap and UMAP plots. Allows to subset specific genes and samples, as well as to filter genes by overall variance.