RocMarshal
@Apache StreamPark Committer, Flink & Spark Contributor
@Shopee @apache Ex @jd-opensourceBeijing
Pinned Repositories
fluss
Fluss is a streaming storage built for real-time analytics.
flink
Apache Flink
spark
Apache Spark - A unified analytics engine for large-scale data processing
streampark
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
apache-calcite-tutorial
https://blog.csdn.net/QXC1281/article/details/89070285
kafka
Mirror of Apache Kafka
paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
reasearch-bigdata
看书看源码看第三方学习视频
skill-map
程序员技能图谱
SZT-bigdata
深圳地铁大数据客流分析系统🚇🚄🌟
RocMarshal's Repositories
RocMarshal/flink
Apache Flink
RocMarshal/hadoop
Apache Hadoop
RocMarshal/kafka
Mirror of Apache Kafka
RocMarshal/spark
Apache Spark - A unified analytics engine for large-scale data processing
RocMarshal/paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
RocMarshal/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
RocMarshal/async-profiler
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
RocMarshal/awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
RocMarshal/beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
RocMarshal/clash-for-linux
clash-for-linux
RocMarshal/CloudSimPy
CloudSimPy: Datacenter job scheduling simulation framework
RocMarshal/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
RocMarshal/flink-benchmarks
Benchmarks for Apache Flink
RocMarshal/flink-connector-jdbc
Apache flink
RocMarshal/flink-kubernetes-operator
Apache Flink Kubernetes Operator
RocMarshal/fluss
Fluss is a streaming storage built for real-time analytics.
RocMarshal/HiBench
HiBench is a big data benchmark suite.
RocMarshal/LLM-Dojo
欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓
RocMarshal/Natural_Language_Processing_with_Transformers
Natural Language Processing with Transformers 中译本,最权威Transformers教程
RocMarshal/oneDAL
oneAPI Data Analytics Library (oneDAL)
RocMarshal/opennlp
Apache OpenNLP
RocMarshal/presto
The official home of the Presto distributed SQL query engine for big data
RocMarshal/proxy
Proxy: Next Generation Polymorphism in C++
RocMarshal/sedona
A cluster computing framework for processing large-scale geospatial data
RocMarshal/SimpleKernel
Simple kernel for learning operating systems. 用于学习操作系统的简单内核
RocMarshal/Startup-CTO-Handbook
The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams
RocMarshal/streaming-benchmarks
Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...
RocMarshal/streampark
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
RocMarshal/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
RocMarshal/xv6-riscv
Xv6 for RISC-V