Pinned Repositories
AgentGPT
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
AkkaNotes_Messaging
Project accompanying Akka Notes - Part 1 (Fire and forget Messaging)
AlfredWorkflow.com
A public Collection of Alfred Workflows.
analytics-zoo
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
apache-arrow-parquet
Apache Arrow and Parquet data format integration for Scala
apache-calcite-tutorial
https://blog.csdn.net/QXC1281/article/details/89070285
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
scala-exercises
The easy way to learn Scala.
fishcus's Repositories
fishcus/remote-shuffle
Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-disks.
fishcus/RemoteShuffleService-1
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
fishcus/pmem-common
Common library for accessing PMEM native library functions including memkind, vmemcache and so on.
fishcus/poseidon
A Firmament-based Kubernetes scheduler
fishcus/chaos-mesh
A Chaos Engineering Platform for Kubernetes.
fishcus/spark-3-volcano
Integration of Apache Spark with Volcano Resource Manager
fishcus/chaosd
A Chaos Engineering toolkit.
fishcus/spark-rqg-archived
spark random query generator
fishcus/sparklens
Qubole Sparklens tool for performance tuning Apache Spark
fishcus/sparkMeasure
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
fishcus/chaosmonkey
Chaos Monkey is a resiliency tool that helps applications tolerate random instance failures.
fishcus/spark-dashboard
Tooling to deploy an Apache Spark performance dashboard. Run this on standalone Docker containers or install the helm chart on Kubernetes.
fishcus/jfr-flame-graph
Get Method Sampling from Java Flight Recorder Dump and convert to FlameGraph compatible format.
fishcus/SparkPlugins
Code and examples of how to deploy Apache Spark Plugins with Spark 3.x. This allows extending the Spark metrics systems with user-provided monitoring probes for OS, I/O, and custom libraries/applications.
fishcus/jitwatch
Log analyser / visualiser for Java HotSpot JIT compiler. Inspect inlining decisions, hot methods, bytecode, and assembly. View results in the JavaFX user interface.
fishcus/k8s-source-code-analysis
《k8s-1.13版本源码分析》
fishcus/SparkFE
The native Spark execution engine for AI and Feature Engineering.
fishcus/duckdb
DuckDB is an in-process SQL OLAP Database Management System
fishcus/graal
GraalVM: Run Programs Faster Anywhere :rocket:
fishcus/modin
Modin: Speed up your Pandas workflows by changing a single line of code
fishcus/kylin
Apache Kylin
fishcus/javacpp-presets
The missing Java distribution of native C++ libraries
fishcus/javacpp
The missing bridge between Java and native C++
fishcus/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
fishcus/arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
fishcus/God-Of-BigData
大数据面试题,大数据成神之路开启...Flink/Spark/Hadoop/Hbase/Hive...
fishcus/arrow-data-source
Spark DataSouce plugin for reading files from various formats like Parquet into Arrow compatible columnar vectors.
fishcus/jni-rs
Rust bindings to the Java Native Interface — JNI
fishcus/xpocket
fishcus/PL-Compiler-Resource
程序语言与编译技术相关资料(持续更新中)