ustcfd

ustcfd's Stars

BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.6k 134 216858
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.7k 106 290475
deepseek-ai/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Language:Python2.1k 19 46193
microsoft/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Language:Python1.9k 41 304176
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python1.8k 22 136128
ytongbai/LVM
Language:Python1.8k 120 2254
KimMeen/Time-LLM
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
Language:Python1.4k 16 150243
ActiveVisionLab/Awesome-LLM-3D
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
1.2k 44 780
stanfordnlp/pyreft
ReFT: Representation Finetuning for Language Models
Language:Python1.1k 16 90100
eric-ai-lab/MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
Language:Python851 12 4452
haoliuhl/ringattention
Transformers with Arbitrarily Large Context
Language:Python631 6 1750
google-deepmind/recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
Language:Python606 18 726
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
Language:Python594 18 7671
sail-sg/lorahub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
Language:Python583 11 2235
csuhan/OneLLM
[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language
Language:Python579 11 2732
Leeroo-AI/mergoo
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
Language:Python400 4 1325
thunlp/LLaVA-UHD
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Language:Python315 14 2615
GraphPKU/PiSSA
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)
Language:Jupyter Notebook260 4 239
FusionBrainLab/OmniFusion
OmniFusion — a multimodal model to communicate using text and images
Language:Python229 5 423
yfeng95/PoseGPT
Language:Python214 37 813
LingyvKong/OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
Language:Python190 1 1915
YuchenLiu98/COMM
Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
186 20 55
zamling/PSALM
[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"
Language:Python186 7 219
HuggingAGI/HuggingArxiv
185 9 123
taishan1994/Llama3.1-Finetuning
对llama3进行全参微调、lora微调以及qlora微调。
Language:Python143 3 814
uukuguy/multi_loras
Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answer based on user queries.
Language:Python141 6 49
Ivan-Tang-3D/Any2Point
[ECCV2024] Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding
Language:Python101 3 36
BAAI-DCAI/DataOptim
A collection of visual instruction tuning datasets.
Language:Python76 5 03
Suikasxt/PMG
The repository of paper Personalized Multimodal Response Generation with Large Language Models
Language:Python110
AdityaNG/DriveLLaVA
Training LLaVA on the CommaVQ dataset to produce tokenized control signals for driving
Language:Jupyter Notebook4 2 01