Pinned Repositories
AhoCorasickDoubleArrayTrie
An implemention of Aho Corasick algorithm based on Double Array Trie.
Alink
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
CRF
CRF is a Java implementation of Conditional Random Fields, an algorithm for learning from labeled sequences of examples. It also includes an implementation of Maximum Entropy learning.
fast_trie
A super fast, efficiently stored Trie for Ruby. Uses libdatrie.
nilsimsa
A distance based hash (one where similar input gives similar output, the opposite of a cryptographic hash), suitable for text applications.
spark
Mirror of Apache Spark
zen
Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logistic regression, latent dirichilet allocation, DNN, and gradient boosting decision tree.
witgo's Repositories
witgo/spark
Mirror of Apache Spark
witgo/Alink
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
witgo/alluxio
Alluxio, formerly Tachyon, Unify Data at Memory Speed
witgo/async-profiler
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
witgo/bclm
macOS command-line utility to limit max battery charge
witgo/byteps
A high performance and general PS framework for distributed training
witgo/cassandra
Mirror of Apache Cassandra
witgo/charts
Curated applications for Kubernetes
witgo/docker-java
Java Docker API Client
witgo/Firestorm
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark applications to store shuffle data on remote servers
witgo/flink-remote-shuffle
Remote Shuffle Service for Flink
witgo/flinkStreamSQL
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
witgo/flinkx
基于flink的分布式数据同步工具
witgo/grpc-java
The Java gRPC implementation. HTTP/2 based RPC
witgo/iceberg
Apache Iceberg
witgo/incubator-livy
Mirror of Apache livy (Incubating)
witgo/incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
witgo/jib
🏗 Build container images for your Java applications.
witgo/k8s-for-docker-desktop
为Docker Desktop for Mac/Windows开启Kubernetes和Istio - Enable Kubernetes/Istio on Docker Desktop in China
witgo/kubernetes
Production-Grade Container Scheduling and Management
witgo/kubernetes-client
Java client for Kubernetes & OpenShift 3
witgo/kyuubi
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache Spark
witgo/OAP
Optimized Analytics Package for Spark* Platform
witgo/openjdk-docker
Scripts for creating Docker images of OpenJDK binaries.
witgo/RemoteShuffleService
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
witgo/ShadowsocksX-NG
Next Generation of ShadowsocksX
witgo/sofa-jraft
A production-grade java implementation of RAFT consensus algorithm.
witgo/splash
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
witgo/tensorflow-on-arm
TensorFlow for Arm
witgo/volcano
A Kubernetes Native Batch System