Over a period of nine years in deep space, the NASA Kepler space telescope has been out on a planet-hunting mission to discover hidden planets outside of our solar system.
To help process this data, I will create machine learning models capable of classifying candidate exoplanets from the raw dataset.
- Preprocessed the dataset prior to fitting the model.
- Performed feature selection and remove unnecessary features.
- Used
MinMaxScaler
to scale the numerical data. - Separated the data into training and testing data.
- Used
GridSearch
to tune model parameters. - Tuned and compared at least two different classifiers.
- Created a README that reports a comparison of each model's performance as well as a summary about my findings and any assumptions I made based on my model.
-
Start by cleaning the data, removing unnecessary columns, and scaling the data.
-
Not all variables are significant be sure to remove any insignificant variables.
-
Make sure your
sklearn
package is up to date. -
Try a simple model first, and then tune the model using
GridSearch
.