chenbong

Xiamen UniversityXiamen

chenbong's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70.9k 565 4.3k10.3k
xai-org/grok-1
Grok open release
Language:Python49.9k 599 2148.3k
psf/black
The uncompromising Python code formatter
Language:Python39.4k 229 2.7k2.5k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python38.5k 224 5.7k4.7k
flameshot-org/flameshot
Powerful yet simple to use screenshot software :desktop_computer: :camera_flash:
Language:C++25.4k 207 2.6k1.6k
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
Language:Python19.7k 179 1.4k1.6k
richards199999/Thinking-Claude
Let your Claude able to think
Language:TypeScript13.7k 103 281.6k
d2phap/ImageGlass
🏞 A lightweight, versatile image viewer
Language:C#8.3k 86 1.6k513
xiaoyaDev/xiaoya-alist
小雅Alist的相关周边
Language:Shell6k 34 219882
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python5.2k 41 1.6k463
Calcium-Ion/new-api
AI模型接口管理与分发系统，支持将多种大模型转为OpenAI格式调用、支持Midjourney Proxy、Suno、Rerank，兼容易支付协议，可供个人或者企业内部管理与分发渠道使用，本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.
Language:Go4.6k 25 496994
jurplel/qView
Practical and minimal image viewer
Language:C++2.1k 19 613123
HuangJunJie2017/BEVDet
Code base of the BEVDet series .
Language:Python1.5k 35 363270
ZhangGe6/onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
Language:JavaScript1.4k 13 115174
NexaAI/Awesome-LLMs-on-device
Awesome LLMs on Device: A Comprehensive Survey
923 71 2101
OpenGVLab/OmniQuant
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
Language:Python755 17 8858
onnx/optimizer
Actively maintained ONNX Optimizer
Language:C++657 32 6491
mit-han-lab/qserve
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Language:Python485 8 4028
ModelTC/llmc
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
Language:Python385 10 4844
HFAiLab/hai-platform
一种任务级GPU算力分时调度的高性能深度学习训练平台
Language:Python366 8 1646
facebookresearch/LLM-QAT
Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"
Language:Python265 5 3025
jy-yuan/KIVI
[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
Language:Python264 5 2626
OpenGVLab/EfficientQAT
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Language:Python237 4 2518
nbasyl/LLM-FP4
The official implementation of the EMNLP 2023 paper LLM-FP4
Language:Python174 5 1012
tsingmicro-toolchain/OnnxSlim
A Toolkit to Help Optimize Large Onnx Model
Language:Python152 4 1710
gmalivenko/onnx-opcounter
Count number of parameters / MACs / FLOPS for ONNX models.
Language:Python90 2 421
BestAnHongjun/LMDeploy-Jetson
Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function independently without continuous internet access.
86 2 86
TCLResearchEurope/torch-dag
pruning vision models in torch
Language:Python14 1 63
AbelLin1214/FastMarriageBooker
用于自动预约民政局婚姻登记处的号，限广东省民政局
Language:Python9 1 11
richardliu11/marry_robber
初始提交
Language:Python1

chenbong

chenbong's Stars

ggerganov/llama.cpp

xai-org/grok-1

psf/black

hiyouga/LLaMA-Factory

flameshot-org/flameshot

mlc-ai/mlc-llm

richards199999/Thinking-Claude

d2phap/ImageGlass

xiaoyaDev/xiaoya-alist

InternLM/lmdeploy

Calcium-Ion/new-api

jurplel/qView

HuangJunJie2017/BEVDet

ZhangGe6/onnx-modifier

NexaAI/Awesome-LLMs-on-device

OpenGVLab/OmniQuant

onnx/optimizer

mit-han-lab/qserve

ModelTC/llmc

HFAiLab/hai-platform

facebookresearch/LLM-QAT

jy-yuan/KIVI

OpenGVLab/EfficientQAT

nbasyl/LLM-FP4

tsingmicro-toolchain/OnnxSlim

gmalivenko/onnx-opcounter

BestAnHongjun/LMDeploy-Jetson

TCLResearchEurope/torch-dag

AbelLin1214/FastMarriageBooker

richardliu11/marry_robber