hkproj

每天努力

Pinned Repositories

100-days-of-gpu
366 2 034
mamba-notes
Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)
170 2 112
pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
Language:Python350 8 1465
pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
Language:Jupyter Notebook116 2 337
pytorch-paligemma
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw
Language:Python434 5 769
pytorch-stable-diffusion
Stable Diffusion implemented from scratch in PyTorch
Language:Jupyter Notebook961 11 18189
pytorch-transformer
Attention is all you need implementation
Language:Jupyter Notebook1k 14 28358
rlhf-ppo
Notes and commented code for RLHF (PPO)
Language:Python84 2 125
transformer-from-scratch-notes
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
308 3 170
triton-flash-attention
Language:Python199 1 022

hkproj's Repositories

hkproj/pytorch-transformer
Attention is all you need implementation
Language:Jupyter Notebook1k 14 28358
hkproj/pytorch-stable-diffusion
Stable Diffusion implemented from scratch in PyTorch
Language:Jupyter Notebook961 11 18189
hkproj/pytorch-paligemma
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw
Language:Python434 5 769
hkproj/100-days-of-gpu
366 2 034
hkproj/pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
Language:Python350 8 1465
hkproj/triton-flash-attention
Language:Python199 1 022
hkproj/mamba-notes
Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)
170 2 112
hkproj/pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
Language:Jupyter Notebook116 2 337
hkproj/quantization-notes
Notes on quantization in neural networks
Language:Jupyter Notebook99 2 116
hkproj/rlhf-ppo
Notes and commented code for RLHF (PPO)
Language:Python84 2 125
hkproj/pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
Language:Python83 2 236
hkproj/pytorch-llama-notes
Notes about LLaMA 2 model
Language:Python68 6 16
hkproj/pytorch-ddpm
Implementation of the paper "Denoising Diffusion Probabilistic Models" in PyTorch
Language:Jupyter Notebook64 3 012
hkproj/mistral-src-commented
Reference implementation of Mistral AI 7B v0.1 model.
Language:Python28 1 04
hkproj/lazy-ml
ML algorithms implementations that are good for learning the underlying principles
Language:Jupyter Notebook23 2 03
hkproj/retrieval-augmented-generation-notes
Slides for "Retrieval Augmented Generation" video
Language:Jupyter Notebook21 3 02
hkproj/mistral-llm-notes
Notes on the Mistral AI model
Language:Jupyter Notebook20 2 06
hkproj/dpo-notes
Notes on Direct Preference Optimization
19 1 0
hkproj/kan-notes
18 1 01
hkproj/bert-from-scratch
BERT explained from scratch
13 2 16
hkproj/python-longnet
Tools and experiments with the LongNet model
Language:Jupyter Notebook9 2 02
hkproj/hanbaobao
Google Chrome extension that helps you learn Chinese, by providing a dictionary and also shows known words.
Language:TypeScript8 3 01
hkproj/segment-anything-slides
Slides for "Segment Anything" video
8 3 02
hkproj/ml-interpretability-notes
Language:Jupyter Notebook7 1 02
hkproj/hkproj.github.io
Language:TypeScript4 2 00
hkproj/adversarial_example_generator
A Python library to generate adversarial examples for classification models
Language:Python3 2 0
hkproj/ai-project
Language:Python3 2 0
hkproj/mamba
Language:Python2 1 0
hkproj/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python1 0
hkproj/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0