Pinned Repositories
DataFlow-with-GCP
This project demonstrates the workflow of a Data Engineer. It utilizes the Google Cloud Platform and Google Colab as the main tools.
Exploring-and-Analyzing-Data-in-Oracle-Database
This project focuses on data analysis using SQL with Oracle Database 21c. It aims to familiarize with data management and data analysis using SQL commands and Oracle Database 21c.
Facebook-Fanpage-Keyword-Analyzer
This script allows you to analyze the posts on a Facebook fanpage and determine the percentage of posts containing a specific keyword.
Head-Require
head-require is a library that aims to simplify the creation of requirements.txt files. head-require generates requirements.txt based on the packages used in your project.
Make_knowledge_graph
This is a website for creating a simple sample knowledge graph.
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
PySpark-Big-Data-RDD-Operations
This project illustrates Apache Spark RDD operations, from creation and transformation to actions and results, enhancing users' understanding of distributed data processing.
PySpark-DataFrame-Operations
This project focuses on utilizing PySpark DataFrames to analyze and visualize data sourced from external datasets, such as CSV files. It provides a practical example of how to manipulate, transform, and gain insights from large datasets using the PySpark framework.
Real-Time-PySpark
This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, components, and applications for real-time data analysis.
Thanaraklee
Thanaraklee's Repositories
Thanaraklee/Real-Time-PySpark
This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, components, and applications for real-time data analysis.
Thanaraklee/Thanaraklee
Thanaraklee/Exploring-and-Analyzing-Data-in-Oracle-Database
This project focuses on data analysis using SQL with Oracle Database 21c. It aims to familiarize with data management and data analysis using SQL commands and Oracle Database 21c.
Thanaraklee/Head-Require
head-require is a library that aims to simplify the creation of requirements.txt files. head-require generates requirements.txt based on the packages used in your project.
Thanaraklee/DataFlow-with-GCP
This project demonstrates the workflow of a Data Engineer. It utilizes the Google Cloud Platform and Google Colab as the main tools.
Thanaraklee/Facebook-Fanpage-Keyword-Analyzer
This script allows you to analyze the posts on a Facebook fanpage and determine the percentage of posts containing a specific keyword.
Thanaraklee/Make_knowledge_graph
This is a website for creating a simple sample knowledge graph.
Thanaraklee/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Thanaraklee/PySpark-Big-Data-RDD-Operations
This project illustrates Apache Spark RDD operations, from creation and transformation to actions and results, enhancing users' understanding of distributed data processing.
Thanaraklee/PySpark-DataFrame-Operations
This project focuses on utilizing PySpark DataFrames to analyze and visualize data sourced from external datasets, such as CSV files. It provides a practical example of how to manipulate, transform, and gain insights from large datasets using the PySpark framework.
Thanaraklee/Web-Scraper-LineNotify
This project scrapes data from a website using BeautifulSoup (bs4) and automates the process with GitHub Actions schedule.