Pinned Repositories
bda_A2
manal-aamir
Config files for my GitHub profile.
amazon-frequent-items-kafka
This repository houses an implementation of finding frequent items utilizing A-Priori and PCY Algorithms on Apache Kafka. It leverages a 15GB .json file as a sample of the 100+GB Amazon_Reviews_Metadata Dataset. This was developed as part of an assignment for the course Fundamentals of Big Data Analytics (DS2004).
project-spotify
This repository houses an implementation of a Spotify-esque streaming service utilizing Apache Spark and Kafka. It leverages a 100GB dataset of various mp3 files from the Free Music Archive. This was developed as part of a project for the course Fundamentals of Big Data Analytics (DS2004).
wikipedia-naive-search
This repository houses a naïve search engine utilising MapReduce technology which leverages a 5GB csv file as dataset. It makes use of the Vector Space Model for Information Retrieval. This was developed as part of an assignment for the course Fundamentals of Big Data Analytics (DS2004).
manal-aamir's Repositories
manal-aamir/bda_A2
manal-aamir/manal-aamir
Config files for my GitHub profile.