hongsunjang

🥨👨‍💻👩‍💻☕

@AIS_SNU, SNU ECE Seoul, Republic of Korea

hongsunjang's Stars

karpathy/llama2.c
Inference Llama 2 in one file of pure C
Language:C17.9k 191 2242.2k
pybind/pybind11
Seamless operability between C++11 and Python
Language:C++16k 249 2.1k2.1k
NVIDIA/open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
Language:C15.4k 179 3691.3k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook14k 99 181.1k
google-research/arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Language:Python5.5k 34 56336
ufrisk/pcileech
Direct Memory Access (DMA) Attack Software
Language:C5.2k 152 286761
intel/pcm
Intel® Performance Counter Monitor (Intel® PCM)
Language:C++2.9k 91 306480
Xilinx/PYNQ
Python Productivity for ZYNQ
Language:Jupyter Notebook2k 134 455821
hao-ai-lab/LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Language:Python1.2k 11 5871
ufrisk/pcileech-fpga
FPGA modules used together with the PCILeech Direct Memory Access (DMA) Attack Software
Language:Verilog1k 47 173226
NVIDIA/gdrcopy
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
Language:C++923 55 193147
Xilinx/Vitis_Libraries
Vitis Libraries
Language:C++922 61 189359
KastnerRG/pp4fpgas
Parallel Programming for FPGAs -- An open-source high-level synthesis book
Language:TeX808 56 18150
hpcaitech/FastFold
Optimizing AlphaFold Training and Inference on GPU Clusters
Language:Python581 17 7985
SHI-Labs/NATTEN
Neighborhood Attention Extension. Bringing attention to a neighborhood near you!
Language:Cuda392 11 12231
SqueezeAILab/KVQuant
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Language:Python325 12 1729
NVIDIA/gds-nvidia-fs
NVIDIA GPUDirect Storage Driver
Language:C215 14 3333
rapidstream-org/rapidstream-tapa
RapidStream TAPA compiles task-parallel HLS program into high-frequency FPGA accelerators.
Language:C++163 9 17734
itsnamgyu/block-transformer
Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)
Language:Python148 5 57
ZaidQureshi/bam
Language:Cuda143 11 4037
ogiroux/freestanding
Language:C++68 8 314
casys-kaist/NeuPIMs
NeuPIMs Simulator
Language:Jupyter Notebook65 1 519
SNU-ARC/Ginex
Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching
Language:Python36 2 78
template-hls/template-hls-float
Language:C++27 2 12
jaewonalive/PeerAiD
Language:Python21 1 11
UCLA-VAST/Serpens
Serpens is an HBM FPGA accelerator for SpMV
Language:Tcl17 2 03
svn2github/pagecache-management
This is a clone of an SVN repository at http://pagecache-mangagement.googlecode.com/svn/trunk. It had been cloned by http://svn2github.com/ , but the service was since closed. Please read a closing note on my blog post: http://piotr.gabryjeluk.pl/blog:closing-svn2github . If you want to continue synchronizing this repo, look at https://github.com/gabrys/svn2github
Language:C11 3 15
gem5-hpca-2024/gem5
Language:C++10 0 05
ogiroux/libcxx
Mirror of official libcxx git repository located at http://llvm.org/git/libcxx. Updated every five minutes.
Language:C++1 1 03
tjruwase/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
Language:Python1 1 00

hongsunjang

hongsunjang's Stars

karpathy/llama2.c

pybind/pybind11

NVIDIA/open-gpu-kernel-modules

naklecha/llama3-from-scratch

google-research/arxiv-latex-cleaner

ufrisk/pcileech

intel/pcm

Xilinx/PYNQ

hao-ai-lab/LookaheadDecoding

ufrisk/pcileech-fpga

NVIDIA/gdrcopy

Xilinx/Vitis_Libraries

KastnerRG/pp4fpgas

hpcaitech/FastFold

SHI-Labs/NATTEN

SqueezeAILab/KVQuant

NVIDIA/gds-nvidia-fs

rapidstream-org/rapidstream-tapa

itsnamgyu/block-transformer

ZaidQureshi/bam

ogiroux/freestanding

casys-kaist/NeuPIMs

SNU-ARC/Ginex

template-hls/template-hls-float

jaewonalive/PeerAiD

UCLA-VAST/Serpens

svn2github/pagecache-management

gem5-hpca-2024/gem5

ogiroux/libcxx

tjruwase/transformers