marsupialtail's Stars
vnatesh/CAKE_on_CPU
CAKE Library for constant-bandwidth matrix multiplication on CPUs
bytedance/byteps
A high performance and generic framework for distributed DNN training
apache/arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
dmlc/treelite
Universal model exchange and serialization format for decision tree forests
CODAIT/graph_def_editor
GraphDef Editor: A port of the TensorFlow contrib.graph_editor package that operates over serialized graphs
microsoft/lightgbm-benchmark
Benchmark tools for LightGBM
WojciechMula/sse-popcount
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
rogersce/cnpy
library to read/write .npy and .npz files in C/C++
microsoft/hummingbird
Hummingbird compiles trained ML models into tensor computation for faster inference.
realpython/materials
Bonus materials, exercises, and example projects for our Python tutorials
StanfordSNR/gg
The Stanford Builder
marsupialtail/sparsednn
Fast sparse deep learning on CPUs
nelhage/llama
janakiramm/serverless_inference
Hosting PyTorch models in AWS Lambda backed by Amazon EFS
simdjson/simdjson
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
neuralmagic/deepsparse
Sparsity-aware deep learning inference runtime for CPUs
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
dillonhuff/clockwork
A polyhedral compiler for hardware accelerators