Pinned Repositories
algorithms-1
Minimal examples of data structures and algorithms in Python
awesome-python
A curated list of awesome Python frameworks, libraries, software and resources
AWS_SageMaker
Personal guide and examples to learn and use AWS SageMaker to deploy your ML model at scale.
ETL_with_airflow
Self-edited Airflow tutorial based on the ETL Best practices with airflow repository.
Git-Influencer
Insight Data Engineering project: A platform built in HDFS, Spark and Airflow to help you to find social influencers from GitHub Network.
kafka-connect-github-source
Get a stream of issues and pull requests for your chosen GitHub repository
Multithreading_python
Tutorials and collections on multithreading and async in python
Presto_Hands_on_tutorials
Collections and sample code for learning PrestoDB.
Realtime-Stock-Monitoring
Real Time Stock Data Monitoring Platform - A practice project using Kafka, Cassandra and Spark.
Scala-Spark
Spark Streaming and Machine Learning with Scala.
chuqiaoshen's Repositories
chuqiaoshen/Social-network-analysis
Social Media data, from mining to modeling
chuqiaoshen/Bitly
Playing with Bitly
chuqiaoshen/Computer-Vision
chuqiaoshen/Data-Cleaning-FeatureEngineering-Visualization
Data visualization experiments using Python
chuqiaoshen/data-engineering-ecosystem
Repo to migrate old wiki to, esp for devs and code examples
chuqiaoshen/Healthcare-datasets-analysis
Healthcare Related datasets analysis
chuqiaoshen/JupyterStyle
A custom, minimalist stylesheet for Jupyter notebook
chuqiaoshen/kube-airflow
A docker image and kubernetes config files to run Airflow on Kubernetes
chuqiaoshen/Morvan-Python-Tutorials
Notes about Morvan Python Tutorials
chuqiaoshen/MXNet-Gluon
chuqiaoshen/NLP-projects
chuqiaoshen/PyData-New-York-City-2017
chuqiaoshen/Python-and-Spark
chuqiaoshen/python-github3
[In Progress] Python wrapper for the new GitHub API.
chuqiaoshen/PythonToScala
A short guide for transitioning from Python to Scala
chuqiaoshen/Scrapy-Projects
chuqiaoshen/Text-Mining-in-Python
chuqiaoshen/Timeseries-analysis
time series learning and Kaggle time series competition