hongsunjang

🥨👨‍💻👩‍💻☕

@AIS_SNU, SNU ECE Seoul, Republic of Korea

hongsunjang's Stars

01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.8k 107 290486
meta-llama/llama-models
Utilities intended for use with Llama models.
Language:Python5.4k 76 148904
pytorch/torchtune
PyTorch native post-training library
Language:Python4.5k 48 804468
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
3.1k 104 6207
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in HEIM (https://arxiv.org/abs/2311.04287) and vision-language models in VHELM (https://arxiv.org/abs/2410.07112).
Language:Python2k 36 1.1k265
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
1.1k 47 841
dhm2013724/yolov2_xilinx_fpga
A demo for accelerating YOLOv2 in xilinx's fpga pynq/zedboard
Language:C802 22 126236
NVIDIA/RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Language:Python797 17 6053
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Language:Python678 8 4648
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Language:Python389 5 2028
Cornell-RelaxML/QuIP
Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"
Language:Python354 10 1232
Maratyszcza/FP16
Conversion to/from half-precision floating point formats
Language:C++337 21 2192
spcl/gemm_hls
Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.
Language:C++315 17 3055
UCLA-VAST/AutoSA
AutoSA: Polyhedral-Based Systolic Array Compiler
Language:C++205 9 1833
AminRezaei0x443/memory-efficient-attention
Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch
Language:Python181 6 419
GFNOrg/gfn-lm-tuning
Language:Jupyter Notebook168 4 1022
Xilinx/Vitis_Embedded_Platform_Source
Language:Tcl119 21 2769
microsoft/LongRoPE
LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
Language:Python116 3 912
HLSTransform/submission
Language:C78 6 212
linghaosong/Sextans
An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).
Language:C++74 2 113
cathalmccabe/PYNQ_tutorials
Language:Jupyter Notebook72 4 316
MicrochipTech/fpga-hls-examples
Open-Source HLS Examples for Microchip FPGAs
Language:C39 10 17
Sibylau/HLS_designs
Systolic array implementations for Cholesky, LU, and QR decomposition
Language:C++39 1 26
mesham/pynq_api
C API drivers for PYNQ FPGA board
Language:C31 5 25
twaclaw/matmult
A floating-point matrix multiplication implemented in hardware
Language:Tcl30 2 313
PSCLab-ASU/Systolic-CNN
Language:C15 6 04
Xilinx/libdfx
Language:C10 6 34
tzuj6/Object-detection-accelerator-in-Xilinx-PYNQ-z2
Language:C7 1 13
berkekisin/Pytorch_spmm_COO
Pytorch extension library of Sparse Dense Matrix Multiplication in COO format
Language:Cuda5 1 00
Edgecortix-Inc/fp_reduce_vitis
Full implementation of optimized floating-point reduction targeting any board supported in Vitis flow (Alveo, Zynq MPSoC, etc.)
Language:C++3 5 02

hongsunjang

hongsunjang's Stars

01-ai/Yi

meta-llama/llama-models

pytorch/torchtune

DefTruth/Awesome-LLM-Inference

stanford-crfm/helm

Xnhyacinth/Awesome-LLM-Long-Context-Modeling

dhm2013724/yolov2_xilinx_fpga

NVIDIA/RULER

jzhang38/EasyContext

feifeibear/long-context-attention

Cornell-RelaxML/QuIP

Maratyszcza/FP16

spcl/gemm_hls

UCLA-VAST/AutoSA

AminRezaei0x443/memory-efficient-attention

GFNOrg/gfn-lm-tuning

Xilinx/Vitis_Embedded_Platform_Source

microsoft/LongRoPE

HLSTransform/submission

linghaosong/Sextans

cathalmccabe/PYNQ_tutorials

MicrochipTech/fpga-hls-examples

Sibylau/HLS_designs

mesham/pynq_api

twaclaw/matmult

PSCLab-ASU/Systolic-CNN

Xilinx/libdfx

tzuj6/Object-detection-accelerator-in-Xilinx-PYNQ-z2

berkekisin/Pytorch_spmm_COO

Edgecortix-Inc/fp_reduce_vitis