Pinned Repositories
analytics-zoo
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
analytics-zoo.github.io
apache.github.io
Top Level Github Pages for the Apache Software Foundation
arrow-data-source
Spark DataSouce plugin for reading files from various formats like Parquet into Arrow compatible columnar vectors.
cloudtik
Cloud scaling platform for distributed analytics and AI on Spark
ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
spark
Apache Spark - A unified analytics engine for large-scale data processing
oap-project.github.io
The OAP project web site
oap-tools
Tools for building, packaging, and OAP public cloud integrations such as AWS EMR, Google Dataproc and K8S.
sql-ds-cache
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
HongW2019's Repositories
HongW2019/analytics-zoo
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
HongW2019/analytics-zoo.github.io
HongW2019/arrow-data-source
Spark DataSouce plugin for reading files from various formats like Parquet into Arrow compatible columnar vectors.
HongW2019/cloudtik
Cloud scaling platform for distributed analytics and AI on Spark
HongW2019/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
HongW2019/spark
Apache Spark - A unified analytics engine for large-scale data processing
HongW2019/best-of
🏆 Discover best-of lists with awesome open-source projects on all kinds of topics.
HongW2019/best-of-jupyter
🏆 A ranked list of awesome Jupyter Notebook, Hub and Lab projects (extensions, kernels, tools). Updated weekly.
HongW2019/bootstrap
The most popular HTML, CSS, and JavaScript framework for developing responsive, mobile first projects on the web.
HongW2019/gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
HongW2019/HiBench
HiBench is a big data benchmark suite.
HongW2019/models
Model Zoo for Intel® Architecture: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors
HongW2019/OAP
Optimized Analytics Package for Spark Platform
HongW2019/OAP-1.0.0
HongW2019/OAP-all
HongW2019/OAP-bot
Add scripts to OAP for better automatization
HongW2019/OAP-Cache
HongW2019/oap-mllib
Optimized Spark package to accelerate machine learning algorithms in Apache Spark MLlib.
HongW2019/oap-project.github.io
The OAP project web site
HongW2019/OAP-spark2.4.3
OAP-0.6 support spark-2.4.3
HongW2019/OAP-test
HongW2019/oap-tools
HongW2019/pagestest
HongW2019/pmem-common
Common library for accessing PMEM native library functions including memkind, vmemcache and so on.
HongW2019/pmem-shuffle
Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote persistent memory (for read) to provide extremely high performance and low latency shuffle solutions for Spark*.
HongW2019/pmem-spill
Spark plug-in package for accelerating Spark runtime spill functions using PMem such as RDD cache PMem extension.
HongW2019/remote-shuffle
Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-disks.
HongW2019/scripts
HongW2019/sql-ds-cache
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
HongW2019/tutorial-template
a template for ReadtheDocs