Pinned Repositories
100-days-of-gpu
mamba-notes
Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)
pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
pytorch-paligemma
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw
pytorch-stable-diffusion
Stable Diffusion implemented from scratch in PyTorch
pytorch-transformer
Attention is all you need implementation
rlhf-ppo
Notes and commented code for RLHF (PPO)
transformer-from-scratch-notes
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
triton-flash-attention
hkproj's Repositories
hkproj/pytorch-transformer
Attention is all you need implementation
hkproj/pytorch-stable-diffusion
Stable Diffusion implemented from scratch in PyTorch
hkproj/pytorch-paligemma
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw
hkproj/100-days-of-gpu
hkproj/pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
hkproj/triton-flash-attention
hkproj/mamba-notes
Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)
hkproj/pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
hkproj/quantization-notes
Notes on quantization in neural networks
hkproj/rlhf-ppo
Notes and commented code for RLHF (PPO)
hkproj/pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
hkproj/pytorch-llama-notes
Notes about LLaMA 2 model
hkproj/pytorch-ddpm
Implementation of the paper "Denoising Diffusion Probabilistic Models" in PyTorch
hkproj/mistral-src-commented
Reference implementation of Mistral AI 7B v0.1 model.
hkproj/lazy-ml
ML algorithms implementations that are good for learning the underlying principles
hkproj/retrieval-augmented-generation-notes
Slides for "Retrieval Augmented Generation" video
hkproj/mistral-llm-notes
Notes on the Mistral AI model
hkproj/dpo-notes
Notes on Direct Preference Optimization
hkproj/kan-notes
hkproj/bert-from-scratch
BERT explained from scratch
hkproj/python-longnet
Tools and experiments with the LongNet model
hkproj/hanbaobao
Google Chrome extension that helps you learn Chinese, by providing a dictionary and also shows known words.
hkproj/segment-anything-slides
Slides for "Segment Anything" video
hkproj/ml-interpretability-notes
hkproj/hkproj.github.io
hkproj/adversarial_example_generator
A Python library to generate adversarial examples for classification models
hkproj/ai-project
hkproj/mamba
hkproj/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
hkproj/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs