Pinned Repositories
Distributed-training
Distributed_training
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
HF-TP-inference
PyTorch Native Tensor Parallel for HuggingFace models inference
large-model-inference
LLM_fine_tuning
ML-benchmarks
mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
serve
Serve, optimize and scale PyTorch models in production
HamidShojanazeri's Repositories
HamidShojanazeri/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
HamidShojanazeri/Distributed-training
HamidShojanazeri/EAGLE
EAGLE: Lossless Acceleration of LLM Decoding by Feature Extrapolation
HamidShojanazeri/H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
HamidShojanazeri/HF-TP-inference
PyTorch Native Tensor Parallel for HuggingFace models inference
HamidShojanazeri/LLM-deployment
HamidShojanazeri/llm.c
LLM training in simple, raw C/CUDA
HamidShojanazeri/LLM_fine_tuning
HamidShojanazeri/alpaca-lora
Instruct-tune LLaMA on consumer hardware
HamidShojanazeri/CS344_2021
Udacity CS344 Introduction to Parallell Programming (https://classroom.udacity.com/courses/cs344), with assignments/materials updated to build in 2021
HamidShojanazeri/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
HamidShojanazeri/fsdp_llm
FSDP optimizations for LLM training
HamidShojanazeri/LLaMA-Adapter
Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
HamidShojanazeri/llama-recipes-updates
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
HamidShojanazeri/llama-resources
HamidShojanazeri/LLM_finetuning_playground
HamidShojanazeri/LookaheadDecoding
HamidShojanazeri/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
HamidShojanazeri/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
HamidShojanazeri/notes
HamidShojanazeri/project_hera
decorate your room with your voice (generative AI)
HamidShojanazeri/resource-stream
CUDA related news and material links
HamidShojanazeri/stablediffusion2
High-Resolution Image Synthesis with Latent Diffusion Models
HamidShojanazeri/streaming-llm
Efficient Streaming Language Models with Attention Sinks
HamidShojanazeri/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
HamidShojanazeri/ToolBench
An open platform for training, serving, and evaluating large language model for tool learning.
HamidShojanazeri/transformer_framework
framework for plug and play of various transformers (vision and nlp) with FSDP
HamidShojanazeri/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
HamidShojanazeri/triton
Development repository for the Triton language and compiler
HamidShojanazeri/yarn
YaRN: Efficient Context Window Extension of Large Language Models