/Predictive-Modelling-of-Multimodal-Single-Cell-Genomic-Data-with-Machine-Learning-Algorithms

This code demonstrates the use of machine learning to model the multimodal nature of a single cell. Using machine learning to predict RNA from DNA, that is, using chromatin accessibility data to predict the RNA gene expression and to predict surface protein from RNA, that is, using RNA sequence data to predict surface protein levels in a cell

Primary LanguageJupyter Notebook

Predictive Modelling of Multimodal Single Cell Genomic Data with Machine Learning Algorithms

  • Following code is for analyzing the Multimodal nature of a Single Cell
  • Data provided by https://openproblems.bio/
  • Compress the Multiome data to sparse matrix first and unzip all the data in a folder named "Data" before running rest of the files
  • Libraries to install : numpy, pandas, matplotlib, seaborn, scikit-learn, keras, XGBoost, LightGBM, CatBoost, tables

Multiome Problem Pipeline

Picture9

Citeseq Problem Pipeline

Picture10