hustzxd

PhD of Institute of Computing Technology (ICT), University of Chinese Academy of Sciences (UCAS).

AMDBeijing

hustzxd's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python38.2k 221 5.7k4.7k
lizongying/my-tv
我的电视电视直播软件，安装即可使用
Language:C31.3k 218 9073.5k
LlamaFamily/Llama-Chinese
Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用
Language:Python14.4k 149 3371.3k
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++14k 197 1.6k1.7k
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python11.2k 100 8221.1k
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python10.8k 159 65844
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
Language:Jupyter Notebook10.3k 170 32889
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python9.6k 105 1.4k1.1k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9.2k 98 2.2k1.1k
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6.2k 75 5461.1k
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++6k 63 625895
mosaicml/composer
Supercharge Your Model Training
Language:Python5.2k 49 552429
ahmetbersoz/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
3.5k 41 4312
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.7k 24 194223
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2.3k 33 211259
ashishpatel26/LLM-Finetuning
LLM Finetuning with peft
Language:Jupyter Notebook2.3k 33 3625
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
Language:Python1.4k 41 4103
RahulSChand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
Language:JavaScript1.2k 7 1662
pytorch/PiPPy
Pipeline Parallelism for PyTorch
Language:Python736 37 26486
locuslab/wanda
A simple and effective LLM pruning approach.
Language:Python704 9 6596
xuhangc/ChatGPT-Academic-Prompt
Use ChatGPT for academic writing
591 5 071
FMInference/DejaVu
Language:Python309 6 3538
llm-efficiency-challenge/neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
Language:Python252 16 1656
metebalci/pdftitle
a utility to extract the title from a PDF file
Language:Python136 8 4223
IST-DASLab/SparseFinetuning
Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry
Language:Python40 5 76
CASIA-IVA-Lab/FLAP
[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models
Language:Python39 3 1511
Raincleared-Song/sparse_gpu_operator
GPU operators for sparse tensor operations
Language:Python29 2 81
DS3Lab/Decentralized_FM_alpha
Language:Python19 8 07
rhhc/EfficientPaperList
Paper about Pruning, Quantization, and Efficient-inference/training.
Language:Python3 0 00
hustzxd/PaperListTemplate
This template makes it easy for you to manage papers.
Language:Python2 2 01

hustzxd

hustzxd's Stars

hiyouga/LLaMA-Factory

lizongying/my-tv

LlamaFamily/Llama-Chinese

triton-lang/triton

Lightning-AI/litgpt

RUCAIBox/LLMSurvey

srush/GPU-Puzzles

huggingface/text-generation-inference

NVIDIA/TensorRT-LLM

microsoft/DeepSpeedExamples

NVIDIA/FasterTransformer

mosaicml/composer

ahmetbersoz/chatgpt-prompts-for-academic-writing

mit-han-lab/llm-awq

intel/neural-compressor

ashishpatel26/LLM-Finetuning

horseee/Awesome-Efficient-LLM

RahulSChand/gpu_poor

pytorch/PiPPy

locuslab/wanda

xuhangc/ChatGPT-Academic-Prompt

FMInference/DejaVu

llm-efficiency-challenge/neurips_llm_efficiency_challenge

metebalci/pdftitle

IST-DASLab/SparseFinetuning

CASIA-IVA-Lab/FLAP

Raincleared-Song/sparse_gpu_operator

DS3Lab/Decentralized_FM_alpha

rhhc/EfficientPaperList

hustzxd/PaperListTemplate