RocMarshal
@Apache StreamPark Committer, @apache Flink & Spark & @alibaba Fluss Contributor
@Shopee @apache Ex @jd-opensourceBeijing
Pinned Repositories
fluss
Fluss is a streaming storage built for real-time analytics.
flink
Apache Flink
incubator-streampark
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
spark
Apache Spark - A unified analytics engine for large-scale data processing
apache-calcite-tutorial
https://blog.csdn.net/QXC1281/article/details/89070285
kafka
Mirror of Apache Kafka
paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
reasearch-bigdata
看书看源码看第三方学习视频
skill-map
程序员技能图谱
SZT-bigdata
深圳地铁大数据客流分析系统🚇🚄🌟
RocMarshal's Repositories
RocMarshal/Alink
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
RocMarshal/flink
Apache Flink
RocMarshal/hadoop
Apache Hadoop
RocMarshal/kafka
Mirror of Apache Kafka
RocMarshal/RocMarshal
RocMarshal/spark
Apache Spark - A unified analytics engine for large-scale data processing
RocMarshal/incubator-streampark
StreamPark, Make stream processing easier! easy-to-use streaming application development framework and operation platform
RocMarshal/paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
RocMarshal/beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
RocMarshal/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
RocMarshal/flink-benchmarks
Benchmarks for Apache Flink
RocMarshal/flink-cdc
Flink CDC is a streaming data integration tool
RocMarshal/flink-connector-jdbc
Apache flink
RocMarshal/flink-connector-rabbitmq
Apache flink
RocMarshal/flink-kubernetes-operator
Apache Flink Kubernetes Operator
RocMarshal/fluss
Fluss is a streaming storage built for real-time analytics.
RocMarshal/HiBench
HiBench is a big data benchmark suite.
RocMarshal/incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
RocMarshal/incubator-streampark-website
Apache streampark Website
RocMarshal/jemalloc
RocMarshal/jmh
https://openjdk.org/projects/code-tools/jmh
RocMarshal/LLM-Dojo
欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓
RocMarshal/Natural_Language_Processing_with_Transformers
Natural Language Processing with Transformers 中译本,最权威Transformers教程
RocMarshal/oneDAL
oneAPI Data Analytics Library (oneDAL)
RocMarshal/pandoc
Universal markup converter
RocMarshal/presto
The official home of the Presto distributed SQL query engine for big data
RocMarshal/proxy
Proxy: Next Generation Polymorphism in C++
RocMarshal/sedona
A cluster computing framework for processing large-scale geospatial data
RocMarshal/SimpleKernel
Simple kernel for learning operating systems. 用于学习操作系统的简单内核
RocMarshal/xv6-riscv
Xv6 for RISC-V