Hello! I am a Senior Data Scientist at Bank of America. I was previously an Assistant Professor at UIUC, with a focus on NLP, ML, and Causal Inference.
University of Illinois at Urbana–Champaign
Pinned Repositories
Implements a bootstrap-based heterogeneity test for standardized mean differences (d), Fisher-transformed Pearson's correlations (r), and natural-logarithm-transformed odds ratio (or) in meta-analysis studies. Depending on the presence of moderators, this Monte Carlo based test can be implemented in the random- or mixed-effects model. This package uses rma() function from the R package 'metafor' to obtain parameter estimates and likelihoods, so installation of R package 'metafor' is required. This approach refers to the studies of Anscombe (1956) <doi:10.2307/2332926>, Haldane (1940) <doi:10.2307/2332614>, Hedges (1981) <doi:10.3102/10769986006002107>, Hedges & Olkin (1985, ISBN:978-0123363800), Silagy, Lancaster, Stead, Mant, & Fowler (2004) <doi:10.1002/14651858.CD000146.pub2>, Viechtbauer (2010) <doi:10.18637/jss.v036.i03>, and Zuckerman (1994, ISBN:978-0521432009).
This package implements a new method ClussCluster to simultaneously perform clustering analysis and signature gene selection on high-dimensional transcriptome data sets. To do so, ClussCluster incorporates a Lasso-type regularization penalty term to the objective function of K-means so that cell-type-specific signature genes can be identified while clustering the cells.
An archive of datasets
:exclamation: This is a read-only mirror of the CRAN R package repository. EFAutilities — Utility Functions for Exploratory Factor Analysis
This is online book written for my graduate-level course on Structural Equation Modeling (SEM). Generated using R bookdown.
This package includes a number of functions for users to examine measurement invariance via equivalence testing along with adjusted RMSEA cutoff values. In particular, a projection-based method is implemented to test the equality of latent factor means across groups without assuming the equality of intercepts.
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
an R package for structural equation modeling and more
🤖 LLM Twin FREE Course: Building Your Production-Ready AI Replica | An End-to-End Framework for Production-Ready LLM Systems by Building Your LLM Twin | WIP...
gabriellajg's Repositories
This is online book written for my graduate-level course on Structural Equation Modeling (SEM). Generated using R bookdown.
This package includes a number of functions for users to examine measurement invariance via equivalence testing along with adjusted RMSEA cutoff values. In particular, a projection-based method is implemented to test the equality of latent factor means across groups without assuming the equality of intercepts.
Implements a bootstrap-based heterogeneity test for standardized mean differences (d), Fisher-transformed Pearson's correlations (r), and natural-logarithm-transformed odds ratio (or) in meta-analysis studies. Depending on the presence of moderators, this Monte Carlo based test can be implemented in the random- or mixed-effects model. This package uses rma() function from the R package 'metafor' to obtain parameter estimates and likelihoods, so installation of R package 'metafor' is required. This approach refers to the studies of Anscombe (1956) <doi:10.2307/2332926>, Haldane (1940) <doi:10.2307/2332614>, Hedges (1981) <doi:10.3102/10769986006002107>, Hedges & Olkin (1985, ISBN:978-0123363800), Silagy, Lancaster, Stead, Mant, & Fowler (2004) <doi:10.1002/14651858.CD000146.pub2>, Viechtbauer (2010) <doi:10.18637/jss.v036.i03>, and Zuckerman (1994, ISBN:978-0521432009).
This package implements a new method ClussCluster to simultaneously perform clustering analysis and signature gene selection on high-dimensional transcriptome data sets. To do so, ClussCluster incorporates a Lasso-type regularization penalty term to the objective function of K-means so that cell-type-specific signature genes can be identified while clustering the cells.
An archive of datasets
:exclamation: This is a read-only mirror of the CRAN R package repository. EFAutilities — Utility Functions for Exploratory Factor Analysis
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
an R package for structural equation modeling and more
🤖 LLM Twin FREE Course: Building Your Production-Ready AI Replica | An End-to-End Framework for Production-Ready LLM Systems by Building Your LLM Twin | WIP...
Contains results for a statistical method to tackle missing data problem in SEM.
Boosted regression trees for multivariate, longitudinal, and hierarchically clustered data.
LaTeX style for Notre Dame Dissertations
Online R book written for a graduate-level course on quasi-experimental designs/causal inference. Generated using R bookdown.
Finite-sample inference for RD designs using local randomization and related methods.
R package to identify genes with differential distributions in single-cell RNA-seq