/datasette

Primary LanguageJupyter Notebook

📼 Datasette

"Datasette" is a play on the word "cassette." A cassette is a small plastic container holding magnetic tape for playing or recording audio. Similarly, datasette refers to a collection or repository of mini data science projects, akin to how a cassette holds data in audio format.

Content

  • Machine Learning

    Tools: scikit-learn, Pandas, Matplotlib

  • Data Analysis and Visualization

    • 🪙 E-Commerce Sales Forecast: Utilized exploratory data analysis techniques to identify patterns, trends, and insights within the e-commerce dataset and applied time series forecasting techniques using Prophet to predict sales for the upcoming year. Applied K-Means clustering algorithm to categorize customers into distinct segments based on their purchasing behavior.
    • 🍃 Air Quality Analysis in Pre- and Post-Pandemic Philippines: work in progress 💼

    Tools: Pandas, Seaborn, Matplotlib, Plotly Express, scikit-learn, Prophet

  • Computer Simulations and Modeling

    • 💭 Agent-Based Modelling: Rumor Propagation Model: Conducted an agent-based modeling project to simulate and analyze the propagation of rumors in various scenarios, considering factors such as acceptance rates, introduction times, and starting locations. The results highlight the significant impact of these factors on the spread of rumor and truth, providing insights for effective intervention strategies.

    Tools: agentpy

  • Database Technologies

    • 🎶 DynamoDB: Song Database: Developed a DynamoDB-based song database for an online streaming service, enabling efficient querying with over 5000 stream entries, artist, album, and song information. This project utilized various access patterns, including base tables, LSIs, and GSIs, to cater to users, artists, developers, and the music industry.

    Tools: DynamoDB