Pinned Repositories
byungsoo-oh.github.io
byungsoo-oh.github.io.old
Build a Jekyll blog in minutes, without touching the command line.
computernetworks-fa24
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
FastFlow
FastFlow is a system that automatically detects CPU bottlenecks in deep learning training pipelines and resolves the bottlenecks with data pipeline offloading to remote resources .
ml-systems-papers
Curated collection of papers in machine learning systems
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
spark
Apache Spark - A unified analytics engine for large-scale data processing
tensorflow
An Open Source Machine Learning Framework for Everyone
byungsoo-oh's Repositories
byungsoo-oh/ml-systems-papers
Curated collection of papers in machine learning systems
byungsoo-oh/byungsoo-oh.github.io
byungsoo-oh/byungsoo-oh.github.io.old
Build a Jekyll blog in minutes, without touching the command line.
byungsoo-oh/computernetworks-fa24
byungsoo-oh/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
byungsoo-oh/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
byungsoo-oh/druid
Apache Druid: a high performance real-time analytics database.
byungsoo-oh/FastFlow
FastFlow is a system that automatically detects CPU bottlenecks in deep learning training pipelines and resolves the bottlenecks with data pipeline offloading to remote resources .
byungsoo-oh/nccl
Optimized primitives for collective multi-GPU communication
byungsoo-oh/petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
byungsoo-oh/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
byungsoo-oh/spark
Apache Spark - A unified analytics engine for large-scale data processing
byungsoo-oh/tensorflow
An Open Source Machine Learning Framework for Everyone
byungsoo-oh/tensorflow-alpa
byungsoo-oh/zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.