Inf1delis's Stars
josephmisiti/awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
state-spaces/mamba
Mamba SSM architecture
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
facebookresearch/metaseq
Repo for external large-scale work
google/flax
Flax is a neural network library for JAX that is designed for flexibility.
TimDettmers/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
ahkarami/Deep-Learning-in-Production
In this repository, I will share some useful notes and references about deploying deep learning-based models in production.
faridrashidi/kaggle-solutions
🏅 Collection of Kaggle Solutions and Ideas 🏅
wookayin/gpustat
📊 A simple command-line utility for querying and monitoring GPU status
mosaicml/llm-foundry
LLM training code for Databricks foundation models
bytedance/byteps
A high performance and generic framework for distributed DNN training
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
NVIDIA/nccl
Optimized primitives for collective multi-GPU communication
turboderp/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
ai-forever/ru-gpts
Russian GPT3 models.
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
yandexdataschool/Practical_DL
DL course co-developed by YSDA, HSE and Skoltech
laekov/fastmoe
A fast MoE impl for PyTorch
microsoft/mup
maximal update parametrization (µP)
AI-Yash/st-chat
Streamlit Component, for a Chatbot UI
mistralai/megablocks-public
alasdairforsythe/tokenmonster
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
lucidrains/ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
ai-forever/gigachain
⚡ Фреймворк для создания комплексных приложений с LLM ⚡
lucidrains/st-moe-pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch