/Gene-Cancer-Classification_Principal-Component-Analysis

Project includes categorization of acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL) using Principal Component Analysis and sorting algorithms on datasets of gene sequencing.(In Progress)

Primary LanguageJupyter NotebookMIT LicenseMIT

Gene Cancer Classification using Principal Component Analysis:


Datasets containing the initial (training, 38 samples) and independent (test, 34 samples) datasets used in the paper : Golub et al "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring".

These datasets contain measurements corresponding to acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL) from Bone Marrow and Peripheral Blood using gene expression monitoring (via DNA microarray) . Intensity values have been re-scaled such that overall intensities for each chip are equivalent.

The motive is to categorize the samples into AMP and ALL using Principal Component Analysis.


Author:


+ Vedant Shrivastava | vedantshrivastava466@gmail.com