This repo includes various jupyter notbooks, notes from courses, white papers that helped me pass the AWS ML Specialty certification in 2020. It is not intended to list every resource that exists out there but should help some of my fellow associates to get started with some good documentation.
I personally passed that exam on 11/10/2020 with score of 922 / 1000 or 92.2%. The passing score is 75%. I found the exam quite hard and I am glad I invested time (roughly 3 months) to refresh and/or learn on various topics in modeling, operations, monitoring, security and data engineering/analysis.
"The AWS Certified Machine Learning - Specialty certification is intended for individuals who perform a development or data science role. It validates a candidate's ability to design, implement, deploy, and maintain machine learning (ML) solutions for given business problems."
Topics covered during the exam:
- Data Engineering (20%): S3 (and VPC Endpoint Gateway), Kinesis (Streams, FireHose, Data Analytics, Video), Glue (Data Catalog and Crawler), Athena, AWS Data Stores (Redshift, RDS/Aurora, DynamoDB, ElasticSearch, ElastiCache), AWS Data Pipelines, AWS Batch, AWS DMS, AWS Step Functions
- Exploratory Data Analysis (24%): Data Types and Distribution, Time Series, Amazon Athena, Quicksight, Ground Truth, EMR, Spark, Dat binning, transforming, encoding, scaling and shuffling, Dealing with Missing data, outliers, unbalanced data, outliers,
- Modeling (36%): CNN, RNN, Tuning neural networks, Regularization, Gradient, L1 and L2 regularization, Confusion matrix (Precision, Recall, F1, AUC), Ensemble methods (Bagging and Boosting), Amazon Sagemaker, Amazon Algorithms (Linear Learner, XGBoost, Seq2Seq, BlazingText, DeepAR, Object2Vec, ObjectDetection, Image Classification, Semantic Segmentation, RCF, LDA, KNN, K-Means, PCA, Factorization Machine), Amazon AI Services (Comprehend, Translate, Transcribe, Polly, Rekognition, Forecast, Lex, ...)
- ML Implementations and Operations (20%): SageMaker Production Variants, Neo, IoT Greengrass, Encryption at Rest and in Transit, VPC, IAM, Logging, Monitoring, Instance Types and Spot Instances, Elasstic Inference, Auto-Scaling, Availability Zones, Inference Pipelines, ...
Cost; $300 180 mn long (3 hours) and ~65 questions
- multiple choice
- multiple response
Notes:
- No partial credit for questions (if we get 2 or 3 right out of 5, no credit)
- Can mark questions an g back to them
- No points for unanswered
- Scores: between 100 and 1000
- Minimum passing score: 750
- Scaled scoring models are used
I have tried many SageMaker notebooks in my personal account to really get a good feel for the various algorithms and modeling techniques. You can see the list of Sagemaker repos in this other repo: aws-sagemaker-notebooks
The Cloud Guru - AWS Certified Machine Learning - Specialty 2020 Course includes over 17 hours of videos, 79 lessons, 8 course quizzes and 1 practice exam. There are also some great labs to get hands on.
Below are some of my notes/snapshots from the course:
- 1-data-collection
- 2-streaming-data-collection
- 3-data-preparation
- 4-data-visualization
- 5-modeling
- 6-algorithms
- 7-evaluation-optimization
- course completion
The Udemy AWS Certified Machine Learning Specialty 2020 - Hands On! Course includes over 9 hours of videos, 114 lessons and 1 practice exam. The full list of course slides is available here.
Below are some of my notes/snapshots from the course:
- 1-data-engineering (part1)
- 1-data-engineering (part2) Below are some of my notes/snapshots from the course:
- 1-data-engineering (part1)
- 1-data-engineering (part2)
- 2-exploratory-data-analysis)
- 2-exploratory-data-analysis (lab))
- 3-modeling-concepts
- 3-modeling-ml-services (part1)
- 3-modeling-ml-services (part2)
- 3-modeling-ai-services
- 3-modeling-lab
- 4-ml-implementation-operation
- 4-ml-implementation-operation (lab)
- exam tips
- warmup test - quick assessment
- course completion
The Whizlabs Course (and tests) is a great course, with a lot of examples/labs via Jupyter notebook to grasp the materials taught. I also found the Tests a lot harder than Udemy and CloudGuru and would recommend passing these tests last.
Below are some of my notes/snapshots from the course:
- 1-data-engineering (part1)
- 1-data-engineering (part2)
- 2-data-analysis (part1)
- 2-data-analysis (part2)
- 2-data-analysis (part3)
- 3-modeling (part1)
- lab - xgboost notebook
- 3-modeling (part2)
- lab - hyperparameter tuning notebook
- lab - train with tuned parameters notebook
- 4-algorithms (part 1)
- lab - regression (LinearLearner) notebook
- lab - clustering (K-Means) notebook
- 4-algorithms (part 2)
- lab - classification (XGboost) notebook
- 4-algorithms (part 3)
- lab - text analysis (BlazingText) notebook
- 5-implementation-operations
- Deep Learning on AWS - pdf extract
- Full white paper
- AWS White Paper - Machine Learning Foundations - pdf extract
- Full white paper
- AWS White Paper - Power Machine Learning at Scale - pdf extract
- Full white paper
- AWS White Paper - Streaming Data Solutions on AWS with Amazon Kinesis - pdf extract
- Full white paper )
passed on 10/18/2020: scored 75% and used 1h35 from the 3 hours to cover the 65 questions
- cloudguru practice exam - part 1
- cloudguru practice exam - part 2
- cloudguru practice exam - review
- cloudguru practice exam 2nd attempt
- aws sample exam questions
- aws exam readiness - certified ml specialty
- aws exam readiness - domain1 - data engineering -
- aws exam readiness - domain2 - exploratory data analysis
- aws exam readiness - domain3 - modeling
- aws exam readiness - domain4 - ml implementation operations.pdf
- aws exam readiness - study questions
- aws exam readiness - study results
- aws practice exam - 20 questions
Great set of courses from Andrew Ng in Coursera that I highly recommend:
Article extracts and notes (pdf):
Other:
- A Cloud Guru - ML Specialty Course
- A Cloud guru AWS Certification prep
- A Cloud Guru Labs - Github
- AWS Training and Certification - this is where you can pass the Exam Readiness course and quiz
- Amazon SageMaker Developer guide
- Whizlabs course and several solid practice exams