CodingCat

OpenAISeattle

Pinned Repositories

docker-scripts
docker-scripts for daily dev
Language:Shell1 2 10
HappyHadooping
an automatic tool to deploy Hadoop on EC2
Language:Shell6 3 00
HederaInFloodlight
Implementation of Hedera based on Floodlight
Language:Java3 2 04
KittenWhisker
debugging performance issues for Spark applications
Language:C8 3 41
LoadWeaver
a flexible and lightweight workload generator for Hadoop 1.x
Language:Java2 3 01
LongTermFairScheduler
LongTermFairScheduler
Language:Java1 2 00
mininet_stuffs
a fat tree topology developed within mininet env
Language:Python2 2 01
Self-Learning-Notebooks
RLLearning
Language:HTML1 1 00
xgboost4j-spark-scalability
a benchmark to test scalability of xgboost4j-spark and relevant projects
Language:Scala22 7 29
XGBoostExperiments
repo containing XGBoost-based ML project for various purposes
Language:Scala7 2 00

CodingCat's Repositories

CodingCat/xgboost4j-spark-scalability
a benchmark to test scalability of xgboost4j-spark and relevant projects
Language:Scala22 7 29
CodingCat/Self-Learning-Notebooks
RLLearning
Language:HTML1 1 00
CodingCat/spark
Mirror of Apache Spark
Language:Scala1 2 01
CodingCat/analytics-zoo
Distributed Tensorflow, Keras and BigDL on Apache Spark
Language:Jupyter Notebook1 0
CodingCat/arrow-datafusion
Apache Arrow DataFusion and Ballista query engines
Language:Rust1 0
CodingCat/BigDL
BigDL: Distributed Deep Learning Library for Apache Spark
Language:Scala1 0
CodingCat/celeborn-website
Apache Celeborn Site
Language:Shell0 0
CodingCat/cockroachdb-todo-apps
CockroachDB To-Do Apps
Language:Python1 0
CodingCat/cockroachdb_playground
some programs to play around cockroachdb
Language:Python2 0
CodingCat/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Language:Scala1 0
CodingCat/dmlc-core
A common bricks library for building scalable and portable distributed machine learning.
Language:C++1 0
CodingCat/ec2-selector-cli
the cli tool to select ec2 instances based on filters
Language:Rust1 0
CodingCat/frameless
Expressive types for Spark.
Language:Scala0 0
CodingCat/gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
Language:Scala1 0
CodingCat/github-markdown-toc
Easy TOC creation for GitHub README.md
Language:Shell1 0
CodingCat/gluten
Language:Scala0 01
CodingCat/how-query-engines-work
This is the companion repository for the book How Query Engines Work.
Language:Kotlin1 0
CodingCat/iceberg
Apache Iceberg
Language:Java1 0
CodingCat/incubator-celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Language:Java0 0
CodingCat/incubator-sedona
A cluster computing framework for processing large-scale geospatial data
Language:Java1 0
CodingCat/incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
Language:Java0 0
CodingCat/morpheus
Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.
Language:Scala1 0
CodingCat/noisepage
Self-Driving Database Management System from Carnegie Mellon University
Language:C++1 0
CodingCat/rabit
Reliable Allreduce and Broadcast Interface for distributed machine learning
Language:C++1 0
CodingCat/spark-lineage
Spark SQL listener to record lineage information
Language:Scala1 0
CodingCat/spark-sql-macros
Spark SQL Macros provides a mechanism similar to Spark User-Defined function registration; with the key enhancement being that custom code gets compiled to equivalent Catalyst Expressions at macro define time.
Language:Scala1 0
CodingCat/string_encoder
Language:Rust1 0
CodingCat/terraform-aws-eks-node-group
Terraform module to provision a fully managed AWS EKS Node Group
Language:HCL1 0
CodingCat/velox-intel
A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
Language:C++0 0
CodingCat/xgboost
Large-scale and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, on single node, hadoop yarn and more.
Language:C++2 0