"Datasette" is a play on the word "cassette." A cassette is a small plastic container holding magnetic tape for playing or recording audio. Similarly, datasette refers to a collection or repository of mini data science projects, akin to how a cassette holds data in audio format.
-
- 🎓 Supervised Learning: Analysis of Graduate Admissions Machine Learning Models: Utilizing K-Nearest Neighbors, Decision Trees, and Random Forest algorithms for classification and regression tasks, this project aims to provide insights into the importance of various admission parameters such as GRE scores, TOEFL scores, University Rating, and others.
Tools: scikit-learn, Pandas, Matplotlib
-
- 🪙 E-Commerce Sales Forecast: Utilized exploratory data analysis techniques to identify patterns, trends, and insights within the e-commerce dataset and applied time series forecasting techniques using Prophet to predict sales for the upcoming year. Applied K-Means clustering algorithm to categorize customers into distinct segments based on their purchasing behavior.
- 🍃 Air Quality Analysis in Pre- and Post-Pandemic Philippines: work in progress 💼
Tools: Pandas, Seaborn, Matplotlib, Plotly Express, scikit-learn, Prophet
-
- 💭 Agent-Based Modelling: Rumor Propagation Model: Conducted an agent-based modeling project to simulate and analyze the propagation of rumors in various scenarios, considering factors such as acceptance rates, introduction times, and starting locations. The results highlight the significant impact of these factors on the spread of rumor and truth, providing insights for effective intervention strategies.
Tools: agentpy
-
- 🎶 DynamoDB: Song Database: Developed a DynamoDB-based song database for an online streaming service, enabling efficient querying with over 5000 stream entries, artist, album, and song information. This project utilized various access patterns, including base tables, LSIs, and GSIs, to cater to users, artists, developers, and the music industry.
Tools: DynamoDB