Pinned Repositories
100-shell-script-examples
Collection of shell scripts found on the internet
2014
2014要做的一些事的记录
airflow-dag-creation-manager-plugin
A plugin for Airflow that create and manage your DAG with web UI.
dataworker-sql-parser
基于antlr4 解析器,支持spark sql, tidb sql, flink sql, Spark/flink jar 运行命令解析器
SeanZou's Repositories
SeanZou/dataworker-sql-parser
基于antlr4 解析器,支持spark sql, tidb sql, flink sql, Spark/flink jar 运行命令解析器
SeanZou/airflow-dag-creation-manager-plugin
A plugin for Airflow that create and manage your DAG with web UI.
SeanZou/algorithm-pattern
算法模板,最科学的刷题方式,最快速的刷题路径,你值得拥有~
SeanZou/algs4
Algorithms, 4th edition textbook code and libraries
SeanZou/applied-ml
📚 Papers of companies sharing their work on applied data science & machine learning.
SeanZou/bash-snippets
bash usefull snippet
SeanZou/bigdata
SeanZou/Data-Pipelines-with-Apache-Airflow
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
SeanZou/Data-Visualization-Dashboard
20套数据可视化驾驶舱源码
SeanZou/DataSphereStudio
DSS covers scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, task scheduling and data exporting.
SeanZou/datax-web
DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。
SeanZou/flink-rookie
Flink 菜鸟公众号代码地址
SeanZou/Hands-On-Scala-Programming
Hands-On Scala Programming [Video], published by Packt
SeanZou/handsonscala
Discussion and and code examples for the book Hands-on Scala Programming
SeanZou/Hive-JDBC-Proxy
Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。
SeanZou/Hive-JDBC-Storage-Handler
Hive Storage Handler for JDBC
SeanZou/incubator-dolphinscheduler
Dolphin Scheduler is a distributed and easy-to-expand visual DAG workflow scheduling system, dedicated to solving the complex dependencies in data processing, making the scheduling system out of the box for data processing.
SeanZou/IntelliJ-IDEA-Tutorial
IntelliJ IDEA 简体中文专题教程
SeanZou/kudu-rpm
RPM packages for Apache Kudu on CentOS 7
SeanZou/Linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
SeanZou/p3c
Alibaba Java Coding Guidelines pmd implements and IDE plugin
SeanZou/programming-and-algorithm
这是北京大学在coursera上开设的「程序设计与算法」专项课程
SeanZou/PyHive
Python interface to Hive and Presto. 🐝
SeanZou/ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
SeanZou/Reading-Books
save books
SeanZou/Scriptis
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
SeanZou/shell-examples
Little Bash shell scripting examples
SeanZou/spark-3.0-examples
Examples of Spark 3.0
SeanZou/spark-ranger
ACL Management for Apache Spark SQL with Apache Ranger. This library has been contributed to https://github.com/apache/submarine as a sub-module, and that module can still be used individually. The project here will no longer be updated. If you have any questions please go to https://github.com/apache/submarine/tree/master/docs/submarine-security/spark-security/README.md to learn how to use and give feedback to the apache submarine community by following https://submarine.apache.org/community/contributors.html
SeanZou/tinyid
ID Generator id生成器 分布式id生成系统,简单易用、高性能、高可用的id生成系统