- This project is going to use AWS S3 and AWS Sagemaker to do machine learning model code part. All analysis are done within an AWS Sagemaker notebook.
- I will use the dataset traffic to do the project.
- Create an S3 bucket and upload the dataset to it. I would either use the AWS S3 console or the AWS CLI to do this.
- Use AWS S3 to store our data and use AWS Sagemaker to do Machine Learning.
- Build a model to predict the traffic volume with AWS S3 and AWS Sagemaker.
- Fit the model to the data and evaluate the model.
- This dataset contains 48.1k (48120) observations of the number of vehicles each hour in four different junctions:
- DateTime
- Juction
- Vehicles
- ID
- Before analyzing the data, I would use Rust to preprocess and clean the data.
- import important module
- import data file.
- data anlysis
- data cleaning
- build model and do evaluation
- create a bucket on AWS S3
- upload data
- create a notebook instance in AWS Sagemaker
- create a IAM role for sagemaker AWS
- create a notebook
- Coding