shellyhh

shellyhh's Stars

simbafl/DataWarehouse
从数据仓库到用户画像，从数据建设到数据应用
550160
apache/apisix
The Cloud-Native API Gateway
Language:Lua14.6k2.5k
debezium/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
Language:Java10.9k2.6k
apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
Language:Java5.5k2.4k
GradleUp/shadow
Gradle plugin to create fat/uber JARs, apply file transforms, and relocate packages for applications and libraries. Gradle version of Maven's Shade plugin.
Language:Kotlin3.8k403
linkease/ddnsto-openwrt
ddnsto for openwrt
Language:Makefile2916
alievk/avatarify-python
Avatars for Zoom, Skype and other video-conferencing apps.
Language:Python16.3k4.1k
plasma-umass/scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Language:Python12.3k402
mingrammer/diagrams
:art: Diagram as Code for prototyping cloud system architectures
Language:Python40k2.6k
jupyter/jupyter
Jupyter metapackage for installation, docs and chat
Language:Python15k4.1k
shijinkui/spark_study
spark源码学习
30594
MoRan1607/BigDataGuide
大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料
2.8k885
stupidloud/nanopi-openwrt
Openwrt for Nanopi R1S R2S R4S R5S 香橙派 R1 Plus 固件编译纯净版与大杂烩
Language:Shell5.5k2.7k
endymecy/spark-config-and-tuning
spark性能调优总结 spark config and tuning
12172
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Language:Python17.9k1.7k
apache/seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Language:Java8.2k1.9k
MarquezProject/marquez-airflow
Airflow support for Marquez
Language:Python3213
MarquezProject/marquez
Collect, aggregate, and visualize a data ecosystem's metadata
Language:Java1.8k325
microsoft/Bringing-Old-Photos-Back-to-Life
Bringing Old Photo Back to Life (CVPR 2020 oral)
Language:Python15.2k2k
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python53.1k8.8k
soimort/you-get
:arrow_double_down: Dumb downloader that scrapes the web
Language:Python54.2k9.7k
geekxh/hello-algorithm
🌍 针对小白的算法训练 | 包括四部分：①.大厂面经 ②.力扣图解 ③.千本开源电子书 ④.百张技术思维导图（项目花了上百小时，希望可以点 star 支持，🌹感谢~）推荐免费ChatGPT使用网站
Language:Java35.4k6.5k
Qihoo360/XSQL
Unified SQL Analytics Engine Based on SparkSQL
Language:Scala21062
zhisheng17/flink-learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、监控告警）分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Language:Java14.6k3.9k
hashicorp/vagrant
Vagrant is a tool for building and distributing development environments.
Language:Ruby26.4k4.4k
byzer-org/byzer-lang
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
Language:Scala1.8k548
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python34.6k5.9k
allwefantasy/spark-binlog
A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).
Language:Scala15454
spark-jobserver/spark-jobserver
REST job server for Apache Spark
Language:Scala2.8k994
apache/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
Language:Java12.9k3.3k