Pinned Repositories
craw-news
Project: News Article Crawling and Scheduling with NestJS and MongoDB Objective Develop a system to crawl news articles from specified sources and schedule these tasks to run periodically. The project will use NestJS for the backend framework and MongoDB
tictactoe-demo
Project: Tic-Tac-Toe Game Using Python Objective Create a simple Tic-Tac-Toe game using Python, employing the tkinter library for the graphical user interface and random to handle the computer's moves (in single-player mode).
web-scraping
The process includes steps from data collection (web scraping), data processing with PySpark, to process management with Apache Airflow. You can expand this project by adding more complex data processing tasks or deploying the process on different schedules through Airflow.
Realtime-Slotions
Project: Sentiment Analysis Using Python Objective Develop a sentiment analysis system to classify text into different sentiment categories (e.g., positive, negative, neutral) using Python. The system will leverage various libraries for natural language processing and machine learning.
e2e-data-engineering
This project serves as a comprehensive guide to building an end-to-end data engineering pipeline. It covers each stage from data ingestion to processing and finally to storage, utilizing a robust tech stack that includes Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra.
logs-analyzer
The system is designed to handle large volumes of log data efficiently and provides real-time analysis, enabling quick identification of issues and trends
tranhuy25
Config files for my GitHub profile.
tranhuy25's Repositories
tranhuy25/logs-analyzer
The system is designed to handle large volumes of log data efficiently and provides real-time analysis, enabling quick identification of issues and trends
tranhuy25/e2e-data-engineering
This project serves as a comprehensive guide to building an end-to-end data engineering pipeline. It covers each stage from data ingestion to processing and finally to storage, utilizing a robust tech stack that includes Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra.
tranhuy25/Realtime-Slotions
Project: Sentiment Analysis Using Python Objective Develop a sentiment analysis system to classify text into different sentiment categories (e.g., positive, negative, neutral) using Python. The system will leverage various libraries for natural language processing and machine learning.
tranhuy25/web-scraping
The process includes steps from data collection (web scraping), data processing with PySpark, to process management with Apache Airflow. You can expand this project by adding more complex data processing tasks or deploying the process on different schedules through Airflow.
tranhuy25/tranhuy25
Config files for my GitHub profile.
tranhuy25/tictactoe-demo
Project: Tic-Tac-Toe Game Using Python Objective Create a simple Tic-Tac-Toe game using Python, employing the tkinter library for the graphical user interface and random to handle the computer's moves (in single-player mode).
tranhuy25/craw-news
Project: News Article Crawling and Scheduling with NestJS and MongoDB Objective Develop a system to crawl news articles from specified sources and schedule these tasks to run periodically. The project will use NestJS for the backend framework and MongoDB