cody-moveworks

cody-moveworks's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python167k 1.6k 2.6k44.1k
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python132k 1.1k 15.7k26.3k
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python67.6k 564 08k
wagoodman/dive
A tool for exploring each layer in a docker image
Language:Go45.5k 356 3321.7k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python34.7k 342 2.7k4k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python26.8k 224 4.5k3.9k
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python19k 277 2.9k2.6k
guidance-ai/guidance
A guidance language for controlling large language models.
Language:Jupyter Notebook18.7k 117 5261k
onnx/onnx
Open standard for machine learning interoperability
Language:Python17.6k 439 2.8k3.7k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python15.8k 104 1k1.5k
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Language:Python8.7k 95 1812k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.2k 87 1.8k908
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python8k 139 3.7k1.4k
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python7.7k 96 1.6k933
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++5.8k 62 625882
TimDettmers/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python5.8k 48 968584
jcjohnson/pytorch-examples
Simple examples to introduce PyTorch
Language:Python4.7k 144 29926
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.2k 35 1.3k377
kelvins/awesome-mlops
:sunglasses: A curated list of awesome MLOps tools
Language:Python4k 85 7561
adapter-hub/adapters
A Unified Library for Parameter-Efficient and Modular Transfer Learning
Language:Jupyter Notebook2.5k 31 381338
huggingface/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Language:Python2.5k 55 733439
VertaAI/modeldb
Open Source ML Model Versioning, Metadata, and Experiment Management
Language:Java1.7k 66 141283
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Language:C++1.6k 33 649224
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Language:Python1.2k 21 87134
brainsik/virtualenv-burrito
One command to have a working virtualenv + virtualenvwrapper environment.
Language:Python851 31 5654
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
Language:Python508 2 1448
idiap/importance-sampling
Code for experiments regarding importance sampling for training neural networks
Language:Python320 14 3560
xuyxu/Soft-Decision-Tree
PyTorch Implementation of "Distilling a Neural Network Into a Soft Decision Tree." Nicholas Frosst, Geoffrey Hinton., 2017.
Language:Python95 4 1019
shreyansh26/Speculative-Sampling
Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind
Language:Python67 2 08