/Stellar-Classification

Performing stellar classification using 100,000 observations of stellar objects taken by the Sloan Digital Sky Survey (SDSS). Each observation contains 17 explanatory variables. Using models including multinomial, random forest, and kNN to identify a stellar object as either a galaxy, quasar, or star.

Primary LanguageJupyter Notebook

# inspecting code file in sequence of index\
1_Final_Report_PhaseA.ipynb\
\
1.5_kNN_Final_Report_PhaseA_R.ipynb\
\
1.5_kNN_Final_Report_PhaseA_FeatureSelection_Python.ipynb\
\
2_Final_Report_PhaseB.ipynb\
\
3_Final_Report_PhaseC.ipynb\
\
Data csv, in the order of usage\
\
1. star_classification.csv  - -loc: Rdata,  original dataset from Kaggle\
\
2. PhaseA_star.csv - - loc: Rdata, transformed full dataset, generated from PhaseA\
\
3. PhaseA_star_subset.csv - - loc: Rdata, transformed full dataset, generated from PhaseA\
\
4. kNN_PhaseA_star.csv - - loc: kNN_data, transformed full dataset for kNN,  generated from 1.5_R\
\
\pard\tx566\tx1133\tx1700\tx2267\tx2834\tx3401\tx3968\tx4535\tx5102\tx5669\tx6236\tx6803\pardirnatural\partightenfactor0
\cf0 5. kNN_PhaseA_star_subset.csv - - loc: kNN_data, transformed subset dataset for kNN,  generated from 1.5_R\
\
6. kNN_PhaseA_star_rf.csv  loc: kNN_data, transformed full dataset by RF,  generated from 1.5_R\
\
7. kNN_PhaseA_star_subset_rf.csv  loc: kNN_data, transformed sub dataset by RF,  generated from 1.5_R\
\
8. kNN_PhaseA_star_perm.csv \'97 loc: kNN_data, transformed full dataset by Perm,  generated from 1.5_R\
\
9. kNN_PhaseA_star_subset_perm.csv \'97 loc: kNN_data, transformed sub dataset by Perm , generated from 1.5_R\