david-macleod's Stars
tatref/linux-mem
Linux memory tools
McHughes288/evals_template
Template for any evals project using LLM apis
Lawrencium77/Obsidian-Notes
Public copy of notes I've made on various CS and ML topics.
speechmatics/speechmatics-js-sdk
Javascript and Typescript SDK for Speechmatics
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
speechmatics/speechmatics-python
Python library and CLI for Speechmatics
ggerganov/ggml
Tensor library for machine learning
anadim/the-little-retrieval-test
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
jundaf2/INT8-Flash-Attention-FMHA-Quantization
termux/termux-app
Termux - a terminal emulator application for Android OS extendible by variety of packages.
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
sol0invictus/cuda-experiments
Some experiments playing around with cublas,cudnn and cuda
Guangxuan-Xiao/torch-int
This repository contains integer operators on GPUs for PyTorch.
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
erees1/dotfiles
zsh, vim and other configs for mac and linux
PINTO0309/simple-onnx-processing-tools
A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change opset, change to the specified input order, addition of OP, RGB to BGR conversion, change batch size, batch rename of OP, and JSON convertion for ONNX models.
apple/ml-ane-transformers
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)
charmbracelet/gum
A tool for glamorous shell scripts 🎀
chalk-diagrams/chalk
A declarative drawing API in Python
gaogaotiantian/viztracer
VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.
BaguaSys/bagua
Bagua Speeds up PyTorch
triton-lang/triton
Development repository for the Triton language and compiler
google-research/google-research
Google Research
laekov/fastmoe
A fast MoE impl for PyTorch
magjac/d3-graphviz
Graphviz DOT rendering and animated transitions using D3
jettify/pytorch-optimizer
torch-optimizer -- collection of optimizers for Pytorch
rmccorm4/tensorrt-utils
⚡ Useful scripts when using TensorRT
ahkarami/Deep-Learning-in-Production
In this repository, I will share some useful notes and references about deploying deep learning-based models in production.