EricGao888
PMC-member@Apache DolphinScheduler, SDE@Alibaba Cloud, Ex-SDE@Amazon, Alumni@Purdue, Alumni@SJTU
@alibabaShanghai, China
Pinned Repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
airflow-local-dev-5-min
Set up airflow local dev env in 5 minutes
dbt-spark
dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
JavaWeb
Web Development based on MVC using Servlet, JSP, JSTL and JDBC
MachineLearning
Basic machine learning algorithm implemented from scratch
Network
Network Projects and System Programming
open-source-guides
Contribute to or build an open-source community
EricGao888's Repositories
EricGao888/dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
EricGao888/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
EricGao888/airflow-local-dev-5-min
Set up airflow local dev env in 5 minutes
EricGao888/dbt-spark
dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks
EricGao888/airflow_alibaba_provider
airflow_alibaba_provider
EricGao888/aquaman
Take a guess what I'm going to do with this repo : )
EricGao888/cassandra
Mirror of Apache Cassandra
EricGao888/chatgpt-java
ChatGPT SDK and CLI for Java
EricGao888/dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
EricGao888/DocsGPT
GPT-powered chat for documentation search & assistance.
EricGao888/dolphinscheduler-website
Apache DolphinScheduler website
EricGao888/EricGao888
EricGao888/hive
Apache Hive
EricGao888/hue
Open source SQL Query Assistant service for Databases/Warehouses
EricGao888/jaffle-shop-classic
A self-contained dbt project for testing purposes
EricGao888/jdk
JDK main-line development https://openjdk.org/projects/jdk
EricGao888/kafka
Mirror of Apache Kafka
EricGao888/keda
KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes
EricGao888/kubernetes-client
Java client for Kubernetes & OpenShift
EricGao888/kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
EricGao888/LearningSparkV2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
EricGao888/linkis
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
EricGao888/mage-ai
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
EricGao888/micrometer
An application metrics facade for the most popular monitoring tools. Think SLF4J, but for metrics.
EricGao888/OpenLineage
An Open Standard for lineage metadata collection
EricGao888/quartz
Code for Quartz Scheduler
EricGao888/skywalking
APM, Application Performance Monitoring System
EricGao888/spark
Apache Spark - A unified analytics engine for large-scale data processing
EricGao888/spark-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
EricGao888/zookeeper
Apache ZooKeeper