lsyldliu

Apache Flink Committer, focus on Big Data Computing & Storage

AlibabaShanghai

Pinned Repositories

airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language:Python0 1 00
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
Language:C++0 2 00
AthenaX
SQL-based streaming analytics platform at scale
Language:Java0 3 00
bahir-flink
Mirror of Apache Bahir Flink
Language:Java0 2 00
beam
Apache Beam is a unified programming model for Batch and Streaming
Language:Java0 2 00
calcite
Mirror of Apache Calcite
Language:Java0 2 00
ceshi
0 3 00
fucking-algorithm
手把手撕LeetCode题目，扒各种算法套路的裤子。English version supported! Crack LeetCode, not only how, but also why.
2 2 01
realtime-technology
realtime data、realtime computer engine、realtime storage engine
9 4 05
spark
Mirror of Apache Spark
Language:Scala1 3 00

lsyldliu's Repositories

lsyldliu/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language:Python0 1 00
lsyldliu/arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
Language:C++0 2 00
lsyldliu/differential-dataflow
An implementation of differential dataflow using timely dataflow on Rust.
Language:Rust2 0
lsyldliu/dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Language:Java1 0
lsyldliu/duckdb
DuckDB is an in-process SQL OLAP Database Management System
Language:C++1 0
lsyldliu/flink
Mirror of Apache Flink
Language:Java3 03
lsyldliu/flink-cdc-connectors
Change Data Capture (CDC) Connectors for Apache Flink
Language:Java2 0
lsyldliu/flink-docker
Docker packaging for Apache Flink
Language:Shell2 0
lsyldliu/flink-kubernetes-operator
Apache Flink Kubernetes Operator
Language:Java1 0
lsyldliu/flink-remote-shuffle
Remote Shuffle Service for Flink
Language:Java2 0
lsyldliu/fluss
Fluss is a streaming storage built for real-time analytics.
lsyldliu/gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
Language:Scala2 0
lsyldliu/hudi
Upserts, Deletes And Incremental Processing on Big Data.
Language:Java2 0
lsyldliu/incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
Language:Java1 0
lsyldliu/jdk
JDK main-line development
Language:Java2 0
lsyldliu/leveldb
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
Language:C++1 0
lsyldliu/lsyldliu
2 0
lsyldliu/mlsql
The Programming Language Designed For Big Data and AI
Language:JavaScript2 0
lsyldliu/nexmark
Benchmarks for queries over continuous data streams.
Language:Java1 0
lsyldliu/papers-we-love
Papers from the computer science community to read and discuss.
Language:Shell2 0
lsyldliu/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Language:Python2 0
lsyldliu/RxJava
RxJava – Reactive Extensions for the JVM – a library for composing asynchronous and event-based programs using observable sequences for the Java VM.
Language:Java2 0
lsyldliu/schema-registry
Confluent Schema Registry for Kafka
Language:Java2 0
lsyldliu/spring-framework
Spring Framework
Language:Java2 0
lsyldliu/streaming-sql
Kubernetes deployments and examples for various streaming SQL implementations
Language:Python2 0
lsyldliu/streamx
Make Flink|Spark easier!!! The original intention of StreamX is to make the development of Flink easier. StreamX focuses on the management of development phases and tasks. Our ultimate goal is to build a one-stop big data solution integrating stream processing, batch processing, data warehouse and data laker.
Language:Java2 0
lsyldliu/tiflash
The analytical engine for TiDB
Language:C++2 0
lsyldliu/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL
Language:Java2 0
lsyldliu/useful-scripts
🐌 useful scripts for making developer's everyday life easier and happier, involved java, shell etc.
Language:Shell2 0
lsyldliu/velox
A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
Language:C++2 0