Pinned Repositories
FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
attention_superoptimizer
An Attention Superoptimizer
mirage
A multi-level tensor algebra superoptimizer
academicpages
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
cmu-catalyst.github.io
FlexFlow
A distributed deep learning framework that supports flexible parallelization strategies.
llama.cpp
Port of Facebook's LLaMA model in C/C++
mirage
A multi-level tensor algebra superoptimizer
OS2021_Fall
wmdi.github.io
:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.
wmdi's Repositories
wmdi/mirage
A multi-level tensor algebra superoptimizer
wmdi/academicpages
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
wmdi/cmu-catalyst.github.io
wmdi/FlexFlow
A distributed deep learning framework that supports flexible parallelization strategies.
wmdi/llama.cpp
Port of Facebook's LLaMA model in C/C++
wmdi/OS2021_Fall
wmdi/wmdi.github.io
:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.