/sagemaker_plagiarism_detector

A machine learning deployment project on detecting plagiarism from text

Primary LanguageJupyter Notebook

Sagemaker Plagiarism Detector

A machine learning project on detecting plagiarism, deployed using Sagemaker, delivered as part of the Udacity machine learning engineer nanodegree.

Project summary

Containment and longest common sequence are used to identify similarities among pairs of text. Naive Bayes is used as classifier and returns 92% accuracy.

Usage

The project runs in aws environment

Tools used

Python, AWS Sagemaker, AWS Lambda, Amazon API Gateway and Locust.io