Pinned Repositories
Automation-tool
AWS-Hive-implementation
Integrating PySpark with Apache Hive to perform ETL(Extract-Transform-Load) and ELT(Extract-Load-Transform) operations
Build-A-Streaming-Data-Pipeline-by-using-Azure-Stream-Analytics
Building a pipeline in Azure platform by using Azure Stream Analytics, Azure Event Hub, and Azure SQL database to perform data analysis on the transportation dataset, then creating an interactive dashboard in Power BI.
CI-CD-search-engine
CI/CD pipeline for search model (Jenkins)
Data-pipeline-in-Azure-and-use-Synapse-Analysis
Analyze the 2021 Olympics dataset, which includes the information of participating Teams, Athletes, Coaches, and Entries by gender
Design-a-Data-Warehouse-for-analysing-E-Commerce-shopping-pattern
Creating an AWS EC2 instance, use docker to apply tools SQL, Sqoop, Spark, and Hive to ingest, transform and analyze data
IRP-Trust-Prediction-Deploy-by-FlaskAPI
Applying multi-classification algorithm to figure out high accuracy model and creating API to deploy it
Movies-Data-Processing-with-Spark-SQL-using-Scala-on-AWS-EC2
Analyzing the Movies and Ratings Dataset by using Spark SQL and Scala programming language
NLP--room-service-chatbot
Use NLTK libraries to build a text classification chatbot
Retail-price-optimization
Implementing OLE model to estimate unknown parameter in linear regression and evaluate the max profit through price elasticity
spanner4715's Repositories
spanner4715/Design-a-Data-Warehouse-for-analysing-E-Commerce-shopping-pattern
Creating an AWS EC2 instance, use docker to apply tools SQL, Sqoop, Spark, and Hive to ingest, transform and analyze data
spanner4715/Movies-Data-Processing-with-Spark-SQL-using-Scala-on-AWS-EC2
Analyzing the Movies and Ratings Dataset by using Spark SQL and Scala programming language
spanner4715/Retail-price-optimization
Implementing OLE model to estimate unknown parameter in linear regression and evaluate the max profit through price elasticity
spanner4715/Automation-tool
spanner4715/AWS-Hive-implementation
Integrating PySpark with Apache Hive to perform ETL(Extract-Transform-Load) and ELT(Extract-Load-Transform) operations
spanner4715/Build-A-Streaming-Data-Pipeline-by-using-Azure-Stream-Analytics
Building a pipeline in Azure platform by using Azure Stream Analytics, Azure Event Hub, and Azure SQL database to perform data analysis on the transportation dataset, then creating an interactive dashboard in Power BI.
spanner4715/CI-CD-search-engine
CI/CD pipeline for search model (Jenkins)
spanner4715/Data-pipeline-in-Azure-and-use-Synapse-Analysis
Analyze the 2021 Olympics dataset, which includes the information of participating Teams, Athletes, Coaches, and Entries by gender
spanner4715/IRP-Trust-Prediction-Deploy-by-FlaskAPI
Applying multi-classification algorithm to figure out high accuracy model and creating API to deploy it
spanner4715/NLP--room-service-chatbot
Use NLTK libraries to build a text classification chatbot
spanner4715/Create-Scalable-CD-CI-pipeline-to-deploy-ML-model--by-using-Azure-Devops
Create a CI/CD pipeline of the Azure DevOps, enables Machine learning app runs faster
spanner4715/Deploy-Mongo-Express-and-MongoDB
spanner4715/E-commerce-online-microservices-deployment-Helm-
spanner4715/Flask-API-example-with-ML-model-GCP
spanner4715/github-actions-python-test
spanner4715/github-actions-test
spanner4715/Group-project-CNN_Doornumber_classification
Build CNN model to make classification in image
spanner4715/java-maven-app
spanner4715/my-project
spanner4715/pvc-autoscaler
PVC Autoscaler is an open-source project aimed at providing autoscaling functionality to Persistent Volume Claims (PVCs) in Kubernetes environments.
spanner4715/Real-time-log-analysis-with-Spark-streaming-and-Kafka
Real-time log analysis with the visualization web app
spanner4715/Recommender-system-built-by-using-Collaborative-filtering-algorithm
Use a ratings dataset to build a product recommendation system using collaborative filtering
spanner4715/Store_sales_prediction-Kaggle-
Use a set of regression model to evaluate the correlation parameters with "sales", R-squared value & Mean Absolute Error are used for assessing model performance
spanner4715/Terraform_aws
spanner4715/terraform_aws_ecs
spanner4715/Using-AWS-S3-and-MySQL-to-build-a-data-pipeline-and-perform-ETL
Build a data pipeline and perform ETL operations by using AWS S3 and MySQL
spanner4715/Website-Monitoring-using-AWS-Lambda-and-Aurora
Real-time monitoring of websites using AWS services