junesookang

MLSys Learner

UNISTRepublic of Korea

junesookang's Stars

microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python20.2k 307 1.4k2.5k
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python19.1k 124 5161.9k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.2k 99 5481.1k
NVIDIA/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Language:C6.4k 123 2421.8k
naganandy/graph-based-deep-learning-literature
links to conference publications in graph-based deep learning
Language:Jupyter Notebook4.8k 252 14776
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
Language:Python3.4k 39 98202
togethercomputer/MoA
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
Language:Python2.6k 35 18356
snap-stanford/ogb
Benchmark datasets, data loaders, and evaluators for graph machine learning
Language:Python1.9k 41 298401
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
Language:Python1.3k 42 391
NVIDIA/gdrcopy
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
Language:C++896 55 187144
AmberLJC/LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
645 25 024
forhaoliu/ringattention
Transformers with Arbitrarily Large Context
Language:Python625 6 1648
microsoft/ptgnn
A PyTorch Graph Neural Network Library
Language:Python375 12 640
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Language:Python354 4 1824
microsoft/mscclpp
MSCCL++: A GPU-driven communication stack for scalable AI applications
Language:C++250 18 9239
NVIDIA/gds-nvidia-fs
NVIDIA GPUDirect Storage Driver
Language:C202 14 3331
InfiniTensor/InfiniTensor
Language:C++164 4 4427
ZaidQureshi/bam
Language:Cuda129 11 3732
microsoft/DeepGNN
DeepGNN is a framework for training machine learning models on large scale graph data.
Language:Python122 10 1129
microsoft/ark
A GPU-driven system framework for scalable AI applications
Language:C++109 12 1815
hgyhungry/ge-spmm
Language:Cuda101 3 1310
snu-comparch/InfiniGen
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
Language:Python79 3 017
IllinoisGraphBenchmark/IGB-Datasets
Largest realworld open-source graph dataset - Worked done under IBM-Illinois Discovery Accelerator Institute and Amazon Research Awards and in collaboration with NVIDIA Research.
Language:Python76 3 3211
Sys-KU/DeepPlan
Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)
Language:C++548
K-Wu/pytorch-direct_dgl
PyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture (accepted by PVLDB)
45 3 75
jeongminpark417/GIDS
Language:Python26 2 116
unist-ssl/IIDP
Language:Python12
unist-ssl/JABAS
"JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)
Language:Python120
seijimaekawa/empirical-study-of-GNNs
Language:Jupyter Notebook7 1 01
HPMLL/HP-SpMM
Fast SpMM implementation on GPUs for GNN (IPDPS'23)
Language:C++30

junesookang

junesookang's Stars

microsoft/unilm

microsoft/graphrag

state-spaces/mamba

NVIDIA/cuda-samples

naganandy/graph-based-deep-learning-literature

linkedin/Liger-Kernel

togethercomputer/MoA

snap-stanford/ogb

horseee/Awesome-Efficient-LLM

NVIDIA/gdrcopy

AmberLJC/LLMSys-PaperList

forhaoliu/ringattention

microsoft/ptgnn

feifeibear/long-context-attention

microsoft/mscclpp

NVIDIA/gds-nvidia-fs

InfiniTensor/InfiniTensor

ZaidQureshi/bam

microsoft/DeepGNN

microsoft/ark

hgyhungry/ge-spmm

snu-comparch/InfiniGen

IllinoisGraphBenchmark/IGB-Datasets

Sys-KU/DeepPlan

K-Wu/pytorch-direct_dgl

jeongminpark417/GIDS

unist-ssl/IIDP

unist-ssl/JABAS

seijimaekawa/empirical-study-of-GNNs

HPMLL/HP-SpMM