sjjeong94

I'm not efficient, but I like efficiency.

Seoul, South Korea

sjjeong94's Stars

bayesian-optimization/BayesianOptimization
A Python implementation of global optimization with gaussian processes.
Language:Python7.8k1.5k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.4k952
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.5k606
nicksypark/rope-triton
Language:Python10
AGI-Edgerunners/LLM-Agents-Papers
A repo lists papers related to LLM based agent
Language:Python1k74
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.6k509
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.8k1.3k
haoliuhl/ringattention
Transformers with Arbitrarily Large Context
Language:Python62748
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python1.9k313
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python16.9k1.2k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python28.5k4.2k
jcpeterson/openwebtext
Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.
Language:Python71179
Lyken17/pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
Language:Python4.9k528
karpathy/ng-video-lecture
Language:Python3.5k911
meta-llama/llama
Inference code for Llama models
Language:Python56k9.5k
rayleizhu/BiFormer
[CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"
Language:Python48739
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python9.1k845
veritross/studiosr
PyTorch library to accelerate super-resolution research
Language:Python111
Zdafeng/SwinFIR
Language:Python734
microsoft/Llama-2-Onnx
Language:Python1k92
XPixelGroup/HAT
CVPR2023 - Activating More Pixels in Image Super-Resolution Transformer Arxiv - HAT: Hybrid Attention Transformer for Image Restoration
Language:Python1.2k149
zhengchen1999/NTIRE2023_ImageSR_x4
Solution of the NTIRE 2023 Challenge on Image Super-Resolution (x4)
Language:Python163
cszn/KAIR
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR
Language:Python2.9k628
JingyunLiang/VRT
VRT: A Video Restoration Transformer (official repository)
Language:Python1.4k130
Tangshitao/MVDiffusion
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion, NeurIPS 2023 (spotlight)
Language:Python48226
LeapLabTHU/DAT
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
Language:Python77571
karpathy/llama2.c
Inference Llama 2 in one file of pure C
Language:C17.3k2.1k
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C++35k3.6k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++66.3k9.5k
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python37k3.2k

sjjeong94

sjjeong94's Stars

bayesian-optimization/BayesianOptimization

NVIDIA/TensorRT-LLM

facebookresearch/xformers

nicksypark/rope-triton

AGI-Edgerunners/LLM-Agents-Papers

pytorch-labs/gpt-fast

Dao-AILab/flash-attention

haoliuhl/ringattention

NVIDIA/TransformerEngine

unslothai/unsloth

vllm-project/vllm

jcpeterson/openwebtext

Lyken17/pytorch-OpCounter

karpathy/ng-video-lecture

meta-llama/llama

rayleizhu/BiFormer

karpathy/minbpe

veritross/studiosr

Zdafeng/SwinFIR

microsoft/Llama-2-Onnx

XPixelGroup/HAT

zhengchen1999/NTIRE2023_ImageSR_x4

cszn/KAIR

JingyunLiang/VRT

Tangshitao/MVDiffusion

LeapLabTHU/DAT

karpathy/llama2.c

ggerganov/whisper.cpp

ggerganov/llama.cpp

LAION-AI/Open-Assistant