Pinned Repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
antd-admin
A admin dashboard application demo built upon Ant Design and Dva.js
aws_notebook
aws notebook
dask
Parallel computing with task scheduling
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
dva
🌱 React and redux based, lightweight and elm-style framework. (Inspired by elm and choo)
dva-example-user-dashboard
👲 👬 👨👩👧 👨👩👦👦
elyra
Elyra extends JupyterLab Notebooks with an AI centric approach.
enterprise_gateway
A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.
hadoop-yarn-api-python-client
Python client for Hadoop® YARN API
abzymeinsjtu's Repositories
abzymeinsjtu/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
abzymeinsjtu/aws_notebook
aws notebook
abzymeinsjtu/dask
Parallel computing with task scheduling
abzymeinsjtu/dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
abzymeinsjtu/dva-example-user-dashboard
👲 👬 👨👩👧 👨👩👦👦
abzymeinsjtu/elyra
Elyra extends JupyterLab Notebooks with an AI centric approach.
abzymeinsjtu/enterprise_gateway
A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.
abzymeinsjtu/hadoop-yarn-api-python-client
Python client for Hadoop® YARN API
abzymeinsjtu/hudi
Upserts, Deletes And Incremental Processing on Big Data.
abzymeinsjtu/hue
Open source SQL Query Assistant service for Databases/Warehouses
abzymeinsjtu/incubator-livy
Mirror of Apache livy (Incubating)
abzymeinsjtu/incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
abzymeinsjtu/ipython-sql
%%sql magic for IPython, hopefully evolving into full SQL client
abzymeinsjtu/jupyter_client
Jupyter protocol client APIs
abzymeinsjtu/jupyter_http_over_ws
abzymeinsjtu/jupyter_server
The backend—i.e. core services, APIs, and REST endpoints—to Jupyter web applications.
abzymeinsjtu/jupyterhub
Multi-user server for Jupyter notebooks
abzymeinsjtu/mage-ai
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
abzymeinsjtu/python
Official Python client library for kubernetes
abzymeinsjtu/remote_provisioners
abzymeinsjtu/scala
Scala 2 compiler and standard library. For bugs, see scala/bug
abzymeinsjtu/skein
A tool and library for easily deploying applications on Apache YARN
abzymeinsjtu/spark
Apache Spark - A unified analytics engine for large-scale data processing
abzymeinsjtu/spring-security-samples
abzymeinsjtu/sqlalchemy-clickhouse
abzymeinsjtu/sudospawner
Spawn JupyterHub single-user servers with sudo
abzymeinsjtu/superset
Apache Superset is a Data Visualization and Data Exploration Platform
abzymeinsjtu/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
abzymeinsjtu/watchdog
Python library and shell utilities to monitor filesystem events.
abzymeinsjtu/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow