hmxiong

Dalian University of Technology

Pinned Repositories

CRATE
Code for CRATE (Coding RAte reduction TransformEr).
Language:Python0 0 00
CUDA-Learn-Note
🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记，更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
Language:Cuda0 0 00
GaLore
Language:Python0 0 00
github-slideshow
A robot powered training repository :robot:
Language:Ruby0 1 10
hallow
i just wanna learn deep learning
0 1 00
llama2.c
Inference Llama 2 in one file of pure C
Language:C0 0 00
OpenMMLabCamp
Language:Python2 1 01
paper-reading
深度学习经典、新论文逐段精读
1 0 00
ScanNet_Vis
Language:Python0 1 00
Transformer-Series
Language:Python7 1 10

hmxiong's Repositories

hmxiong/Transformer-Series
Language:Python7 1 10
hmxiong/OpenMMLabCamp
Language:Python2 1 01
hmxiong/paper-reading
深度学习经典、新论文逐段精读
1 0 00
hmxiong/PathWeave
Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024
Language:Python1
hmxiong/CRATE
Code for CRATE (Coding RAte reduction TransformEr).
Language:Python0 0 00
hmxiong/CUDA-Learn-Note
🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记，更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
Language:Cuda0 0 00
hmxiong/GaLore
Language:Python0 0 00
hmxiong/github-slideshow
A robot powered training repository :robot:
Language:Ruby0 1 10
hmxiong/hallow
i just wanna learn deep learning
0 1 00
hmxiong/llama2.c
Inference Llama 2 in one file of pure C
Language:C0 0 00
hmxiong/ScanNet_Vis
Language:Python0 1 00
hmxiong/Tarurs
competition files
0 1 00
hmxiong/pytorch-distributed-training
Simple tutorials on Pytorch DDP training
Language:Python0 0
hmxiong/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python0 0
hmxiong/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python0 0

hmxiong

Pinned Repositories

CRATE

CUDA-Learn-Note

GaLore

github-slideshow

hallow

llama2.c

OpenMMLabCamp

paper-reading

ScanNet_Vis

Transformer-Series

hmxiong's Repositories

hmxiong/Transformer-Series

hmxiong/OpenMMLabCamp

hmxiong/paper-reading

hmxiong/PathWeave

hmxiong/CRATE

hmxiong/CUDA-Learn-Note

hmxiong/GaLore

hmxiong/github-slideshow

hmxiong/hallow

hmxiong/llama2.c

hmxiong/ScanNet_Vis

hmxiong/Tarurs

hmxiong/pytorch-distributed-training

hmxiong/RWKV-LM

hmxiong/VILA