ponggung's Stars
reata/sqllineage
SQL Lineage Analysis Tool powered by Python
spark-examples/pyspark-examples
Pyspark RDD, DataFrame and Dataset Examples in Python language
graviraja/MLOps-Basics
dexplo/bar_chart_race
Create animated bar chart races in Python with matplotlib
l12203685/pymafe
Python for MAE/MFE Analysis
MrMimic/data-scientist-roadmap
Toturials coming with the "data science roadmap" picture.
datastacktv/data-engineer-roadmap
Roadmap to becoming a data engineer in 2021
FinMind/FinMind
Open Data, more than 50 financial data. 提供超過 50 個金融資料(台股為主),每天更新 https://finmind.github.io/
rasbt/python-machine-learning-book-3rd-edition
The "Python Machine Learning (3rd edition)" book code repository
yehjames/python_machine_learning_tutorial
python machine learning tutorial
ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
finlab-python/finlab_crypto
Documentation
erlcssont29i/Expanded-knowledge-for-data-analysis
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
GoogleCloudPlatform/training-data-analyst
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
apache/beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
GeneralMills/pytrends
Pseudo API for Google Trends
curlconverter/curlconverter
Transpile curl commands into Python, JavaScript and 27 other languages
pypa/pipenv
Python Development Workflow for Humans.
PacktPublishing/Mastering-Python-Design-Patterns-Second-Edition
Mastering-Python-Design-Patterns-Second-Edition, published by Packt
databricks/koalas
Koalas: pandas API on Apache Spark
TaiwanSparkUserGroup/spark-programming-guide-zh-tw
Spark 編程指南繁體中文版
crawles/spark-nba-analytics
Analyzing NBA data using Spark 2.1
TritonHo/slides
it is a repository to store all slides used by Triton Ho's public presentation and course.
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Netflix/metaflow
Open Source Platform for developing, scaling and deploying serious ML, AI, and data science systems
ckiplab/ckiptagger
CKIP Neural Chinese Word Segmentation, POS Tagging, and NER
graphistry/pygraphistry
PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
chrislgarry/Apollo-11
Original Apollo 11 Guidance Computer (AGC) source code for the command and lunar modules.
metabase/metabase
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum: