polarispw

HKUSTHongKong SAR, China

polarispw's Stars

AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
1.1k87
state-spaces/mamba
Mamba SSM architecture
Language:Python13.7k1.2k
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
Language:Python1.4k102
HanGuo97/lq-lora
Language:Python12414
nomic-ai/nomic
Interact, analyze and structure massive text, image, embedding, audio and video datasets
Language:Python1.4k176
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.6k218
HuangOwen/Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
1.3k86
princeton-nlp/MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
Language:Python1.1k67
OpenMatch/UniVL-DR
[ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval".
Language:Python507
horseee/LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
Language:Python911107
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
Language:Python1.7k79
yuchenlin/rebiber
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
Language:Python2.7k161
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36.1k4.2k
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++6k896
brucefan1983/CUDA-Programming
Sample codes for my CUDA programming book
Language:Cuda1.6k333
ggerganov/ggml
Tensor library for machine learning
Language:C++11.5k1.1k
BlinkDL/RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
Language:Python12.9k880
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Language:Jupyter Notebook10.1k827
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.6k1.9k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70.1k10.1k
wangzhaode/mnn-llm
llm deploy project based mnn.
Language:C++1.5k168
QC-LY/Prompt-Tuning-For-Sentiment-Classification
Code for the Internship at NEU-NLP
Language:Python213
alibaba/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
Language:C++8.9k1.7k
Tencent/TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
Language:C++4.4k769
storage-db/ToolDiy
一本工具指南和开箱即用配置，旨在让大家选用和上手合适的工具。
Language:HTML15533
jaywcjlove/reference
为开发人员分享快速参考备忘清单(速查表)
Language:Dockerfile12.6k1.9k
fluctlight001/SampleCPU
Language:Verilog132
jgraph/drawio-desktop
Official electron build of draw.io
Language:JavaScript51.8k5.1k
NiuTrans/MTBook
《机器翻译：基础与模型》肖桐朱靖波著 - Machine Translation: Foundations and Models
Language:TeX2.7k761

polarispw

polarispw's Stars

AIoT-MLSys-Lab/Efficient-LLMs-Survey

state-spaces/mamba

horseee/Awesome-Efficient-LLM

HanGuo97/lq-lora

nomic-ai/nomic

mit-han-lab/llm-awq

HuangOwen/Awesome-LLM-Compression

princeton-nlp/MeZO

OpenMatch/UniVL-DR

horseee/LLM-Pruner

hkust-nlp/ceval

yuchenlin/rebiber

microsoft/DeepSpeed

NVIDIA/FasterTransformer

brucefan1983/CUDA-Programming

ggerganov/ggml

BlinkDL/RWKV-LM

artidoro/qlora

ymcui/Chinese-LLaMA-Alpaca

ggerganov/llama.cpp

wangzhaode/mnn-llm

QC-LY/Prompt-Tuning-For-Sentiment-Classification

alibaba/MNN

Tencent/TNN

storage-db/ToolDiy

jaywcjlove/reference

fluctlight001/SampleCPU

jgraph/drawio-desktop

NiuTrans/MTBook