This repository is a collection of projects I completed for the Udacity Computer Vision Nanodegree, which I implemented with various computer vision tools from PyTorch to OpenCV.
- Project 1: Facial Keypoint Detection
- Project 2: Image Captioning
- Project 3: Landmark Detection (SLAM)
For this project, I built a facial keypoint detection system using Haar Casacdes and Convolutional Neural Network (CNN). This facial keypoint detection system that takes in any image with faces, and predicts the location of 68 distinguishing keypoints on each face!
Demo: results produced by my model
For this project, I designed and created a Convolutional Neural Network (CNN), Recurrent Neural Network (RNN) and LSTM (Long Short Term Memory) architecture to automatically generate captions from images. The model is trained on Microsoft Common Objects in COntext (MS COCO) dataset.
Demo: results produced by my model
For this project, I implemented SLAM (Simultaneous Localization and Mapping) for a 2 dimensional world by combining robot sensor measurements and movement to create a map of an environment.
Demo: results produced by my model