SylvanLiu

University of Southampton

SylvanLiu's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70.2k 559 4.3k10.1k
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
Language:JavaScript29k 307 1.2k2.8k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.5k 218 4692.9k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21k 158 1.6k2.3k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.4k 259 129848
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python13k 107 615910
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Language:Jupyter Notebook10.1k 85 250827
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9.1k 97 2.1k1k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.9k 77 581635
onnx/models
A collection of pre-trained, state-of-the-art models in the ONNX format
Language:Jupyter Notebook8.1k 189 3931.4k
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C++8k 78 172418
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python6k 54 281533
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Language:Python4.6k 31 474491
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Language:Python3.4k 60 108344
LLaVA-VL/LLaVA-NeXT
Language:Python3.2k 37 340282
straight-tamago/misakaX
iOS /iPadOS 16.0 - 18.0 / 18.1 beta 4, An ultimate customization tool, uilitizing the bug that makes TrollRestore possible.
3.1k 40 40194
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！
Language:Jupyter Notebook2.7k 35 100249
Kyle-Ye/XcodeLLMEligible
Language:Shell2.6k 18 66149
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python2.5k 42 110227
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
Language:Jupyter Notebook2.2k 17 89394
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python1.9k 15 429228
GAIR-NLP/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
1.8k 35 1555
digital-standard/ThreeDPoseUnityBarracuda
Unity sample of 3D pose estimation using Barracuda
Language:C#1.5k 43 45278
Toyhom/Chinese-medical-dialogue-data
Chinese medical dialogue data 中文医疗对话数据集
Language:Python1.3k 20 9253
NiuTrans/Classical-Modern
非常全的文言文（古文）-现代文平行语料
Language:Python1.2k 14 15280
kakaobrain/coyo-dataset
COYO-700M: Large-scale Image-Text Pair Dataset
Language:Python1.2k 15 1638
SimpleBerry/LLaMA-O1
Large Reasoning Models
Language:Python761 19 2543
google-research/long-range-arena
Long Range Arena for Benchmarking Efficient Transformers
Language:Python738 24 5482
shuxueslpi/chatGLM-6B-QLoRA
使用peft库，对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调，并做lora model和base model的merge及4bit的量化（quantize）。
Language:Python356 6 5046
YuxiXie/MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
Language:Jupyter Notebook258 1 527

SylvanLiu

SylvanLiu's Stars

ggerganov/llama.cpp

lutzroeder/netron

Vision-CAIR/MiniGPT-4

haotian-liu/LLaVA

BradyFU/Awesome-Multimodal-Large-Language-Models

OpenBMB/MiniCPM-V

artidoro/qlora

NVIDIA/TensorRT-LLM

facebookresearch/xformers

onnx/models

SJTU-IPADS/PowerInfer

yangjianxin1/Firefly

AutoGPTQ/AutoGPTQ

NExT-GPT/NExT-GPT

LLaVA-VL/LLaVA-NeXT

straight-tamago/misakaX

PhoebusSi/Alpaca-CoT

Kyle-Ye/XcodeLLMEligible

haoheliu/AudioLDM

HarderThenHarder/transformers_tasks

casper-hansen/AutoAWQ

GAIR-NLP/O1-Journey

digital-standard/ThreeDPoseUnityBarracuda

Toyhom/Chinese-medical-dialogue-data

NiuTrans/Classical-Modern

kakaobrain/coyo-dataset

SimpleBerry/LLaMA-O1

google-research/long-range-arena

shuxueslpi/chatGLM-6B-QLoRA

YuxiXie/MCTS-DPO