Pinned Repositories
analytics-zoo
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
analytics-zoo.github.io
apache.github.io
Top Level Github Pages for the Apache Software Foundation
arrow-data-source
Spark DataSouce plugin for reading files from various formats like Parquet into Arrow compatible columnar vectors.
cloudtik
Cloud scaling platform for distributed analytics and AI on Spark
ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
spark
Apache Spark - A unified analytics engine for large-scale data processing
oap-project.github.io
The OAP project web site
oap-tools
Tools for building, packaging, and OAP public cloud integrations such as AWS EMR, Google Dataproc and K8S.
sql-ds-cache
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
HongW2019's Repositories
HongW2019/oap-cache-test
HongW2019/mkdocs-versioning
A tool that allows for versioning sites built with mkdocs
HongW2019/spark-rapids
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
HongW2019/oap-perf-suite
HongW2019/spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
HongW2019/apache.github.io
Top Level Github Pages for the Apache Software Foundation