polarispw

HKUSTHongKong SAR, China

polarispw's Stars

TheAlgorithms/Python
All Algorithms implemented in Python
Language:Python196k 5.9k 1.5k46k
chatanywhere/GPT_API_free
Free ChatGPT API Key，免费ChatGPT API，支持GPT4 API（免费），ChatGPT国内可用免费转发API，直连无需代理。可以搭配ChatBox等软件/插件使用，极大降低接口使用成本。国内即可无限制畅快聊天。
Language:Python26.3k 121 3162k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.7k 100 5821.2k
chen08209/FlClash
A multi-platform proxy client based on ClashMeta,simple and easy to use, open-source and ad-free.
Language:Dart12.6k 40 658717
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
Language:HTML12.3k 102 231.3k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python11k 166 8112.5k
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C++8k 78 172418
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python7.3k 39 1.2k2k
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.7k 65 84373
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python5k 40 1.6k451
tickstep/aliyunpan
阿里云盘命令行客户端，支持JavaScript插件，支持同步备份功能。
Language:Go4.3k 35 460357
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
4k 33 91190
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
Language:Python1.4k 41 499
facebookresearch/MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
Language:Python1.2k 23 1267
AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
1.1k 27 1187
SafeAILab/EAGLE
Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)
Language:Python885 13 15491
microsoft/MInference
[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.
Language:Python857 6 7239
locuslab/wanda
A simple and effective LLM pruning approach.
Language:Python693 9 6596
SqueezeAILab/SqueezeLLM
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
Language:Python664 18 2843
mazhengcn/suggested-notation-for-machine-learning
This introduces a suggestion of mathematical notation protocol for machine learning.
Language:TeX450 10 273
microsoft/TransformerCompression
For releasing code related to compression methods for transformers, accompanying our publications
Language:Python392 11 4842
catid/dora
Implementation of DoRA
Language:Python286 10 218
Infini-AI-Lab/TriForce
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Language:Python240 1 1013
HanGuo97/lq-lora
Language:Python124 3 714
AIoT-MLSys-Lab/SVD-LLM
Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"
Language:Python109 7 208
hahnyuan/ASVD4LLM
Activation-aware Singular Value Decomposition for Compressing Large Language Models
Language:Python53 5 66
QC-LY/UniBind
The source code for "UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All"
Language:Python34 3 31
Dousia/MetricPrompt
Code for KDD 2023 long paper: MetricPrompt: Prompting Model as a Relevance Metric for Few-Shot Text Classification
Language:Python18 2 11
MichaelYang-lyx/LLM-Code-Benchmark
This is a benchmark and corresponding evaluation system for llm
Language:Python14 2 00
Spidy20/AWS-Assistant-RAG-ChatBot
In this tutorial, we'll be creating a GPT-4 AWS Helper ChatBot utilizing Langchain, Lambda, API Gateway, and PostgreSQL PGVector hosted on an EC2 instance as our Vector database.
Language:Python9 1 01

polarispw

polarispw's Stars

TheAlgorithms/Python

chatanywhere/GPT_API_free

state-spaces/mamba

chen08209/FlClash

liguodongiot/llm-action

NVIDIA/Megatron-LM

SJTU-IPADS/PowerInfer

EleutherAI/lm-evaluation-harness

mit-han-lab/streaming-llm

InternLM/lmdeploy

tickstep/aliyunpan

deepseek-ai/DeepSeek-V2

horseee/Awesome-Efficient-LLM

facebookresearch/MobileLLM

AIoT-MLSys-Lab/Efficient-LLMs-Survey

SafeAILab/EAGLE

microsoft/MInference

locuslab/wanda

SqueezeAILab/SqueezeLLM

mazhengcn/suggested-notation-for-machine-learning

microsoft/TransformerCompression

catid/dora

Infini-AI-Lab/TriForce

HanGuo97/lq-lora

AIoT-MLSys-Lab/SVD-LLM

hahnyuan/ASVD4LLM

QC-LY/UniBind

Dousia/MetricPrompt

MichaelYang-lyx/LLM-Code-Benchmark

Spidy20/AWS-Assistant-RAG-ChatBot