Pinned Repositories
AgentGPT
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
AkkaNotes_Messaging
Project accompanying Akka Notes - Part 1 (Fire and forget Messaging)
AlfredWorkflow.com
A public Collection of Alfred Workflows.
analytics-zoo
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
apache-arrow-parquet
Apache Arrow and Parquet data format integration for Scala
apache-calcite-tutorial
https://blog.csdn.net/QXC1281/article/details/89070285
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
scala-exercises
The easy way to learn Scala.
fishcus's Repositories
fishcus/delight
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
fishcus/dpkb
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
fishcus/incubator-yunikorn-scheduler-interface
Apache Yunikorn Common scheduler interface - Incubating
fishcus/tpcds-kit
TPC-DS benchmark kit with some modifications/fixes
fishcus/apache-calcite-tutorial
https://blog.csdn.net/QXC1281/article/details/89070285
fishcus/awesome-spark
A curated list of awesome Apache Spark packages and resources.
fishcus/jetcd
etcd java client
fishcus/pmem-shuffle
Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote persistent memory (for read) to provide extremely high performance and low latency shuffle solutions for Spark*.
fishcus/kubernetes-client
Java client for Kubernetes & OpenShift
fishcus/kubebuilder
Kubebuilder - SDK for building Kubernetes APIs using CRDs
fishcus/mastering-spark-sql-book
The Internals of Spark SQL
fishcus/k8s-spark-scheduler-lib
Kubernetes CRDs and API definitions used by the k8s-spark-scheduler and other related services
fishcus/spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
fishcus/k8s-spark-scheduler
A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes
fishcus/kube-batch
A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
fishcus/k8s-for-docker-desktop
为Docker Desktop for Mac/Windows开启Kubernetes和Istio。
fishcus/Firestorm
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark applications to store shuffle data on remote servers
fishcus/spark-kubernetes-book
The Internals of Spark on Kubernetes
fishcus/OpenMLDB
OpenMLDB is an open-source database that is designed and optimized to enable data integrity and efficiency for machine learning driven applications. In addition to 10x faster ML application landing experience, OpenMLDB provides unified computing and storage engines to reduce the complexity and cost of development and operation.
fishcus/arrow-datafusion
Apache Arrow DataFusion and Ballista query engines
fishcus/gperftools
Main gperftools repository
fishcus/parquet-mr
Apache Parquet
fishcus/incubator-doris
Apache Doris (Incubating)
fishcus/pulsar
Apache Pulsar - distributed pub-sub messaging system
fishcus/bookkeeper
Apache Bookkeeper
fishcus/FlameGraph
Stack trace visualizer
fishcus/HybridSE
An OpenSource Hybird SQL Engine based on LLVM for hatp, olap, oltp, mpp, sparksql and flinksql
fishcus/gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
fishcus/perf-tools
Performance analysis tools based on Linux perf_events (aka perf) and ftrace
fishcus/sql-ds-cache
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.