ustcfd

ustcfd's Stars

OpenMOSS/AnyGPT
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
Language:Python74157
CasualGANPapers/Make-A-Scene
Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Language:Python33119
luogen1996/LaVIN
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
Language:Python50036
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Language:Jupyter Notebook2.2k147
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python3.3k281
hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
Language:Python46528
NVlabs/RADIO
Official repository for "AM-RADIO: Reduce All Domains Into One"
Language:Python60423
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Language:Python1.4k141
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.5k2.5k
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language:Python2k85
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Language:Python5.7k370
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
3.4k134
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
2.5k161
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Language:Python1.3k84
joez17/ChatBridge
ChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without relying on all combinations of paired data.
Language:Python461
saidwivedi/TokenHMR
[CVPR 2024] TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation
Language:Python20117
mingyuan-zhang/LMM
Large Motion Model for Unified Multi-Modal Motion Generation
1631
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python1.8k140
CrazyBoyM/llama3-Chinese-chat
Llama3、Llama3.1 中文仓库（随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）
Language:Python3.9k318
HyperGAI/HPT
HPT - Open Multimodal LLMs from HyperGAI
Language:Python30916
treeaaa/fine_tune_llava1.6_copy
Language:Python2
TIGER-AI-Lab/Mantis
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
Language:Python15612
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Language:Python3.2k487
magic-research/PLLaVA
Official repository for the paper PLLaVA
Language:Python54836
TencentARC/LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
Language:Python46434
linjh1118/Llama3-Chinese-ORPO
基于Llama3，通过进一步CPT，SFT，ORPO得到的中文版Llama3
17
CrazyBoyM/phi3-Chinese
Phi3 中文仓库
Language:Python31319
AILab-CVC/SEED-X
Multimodal Models in Real World
Language:Jupyter Notebook37217
EricGuo5513/momask-codes
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
Language:Python77165
mbzuai-oryx/LLaVA-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
Language:Python79357