feymanpriv

BUPTBeijing

feymanpriv's Stars

hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python18.6k 164 3011.8k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python17.8k 185 7281.8k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
10.1k 236 98669
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++7.2k 84 1.5k776
LargeWorldModel/LWM
Language:Python7k 67 65537
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Language:Python5.7k 66 127499
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python4.1k 47 385323
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python3.7k 110 68282
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python3.1k 30 394241
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python3k 21 385325
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.5k 31 149232
LinkSoul-AI/Chinese-Llama-2-7b
开源社区第一个能下载、能运行的中文 LLaMA2 模型！
Language:Python2.2k 20 39201
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
Language:Python1.8k 21 81105
NUS-HPC-AI-Lab/OpenDiT
OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference
Language:Python1k 20 5161
shikras/shikra
Language:Python706 8 6144
PKU-YuanGroup/Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Language:Python688 7 4336
SkunkworksAI/BakLLaVA
Language:Python673 11 1845
LLaVA-VL/LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
Language:Python649 11 2449
allenai/unified-io-2
Language:Python522 15 1625
csuhan/OneLLM
[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language
Language:Python497 11 1825
iejMac/video2dataset
Easily create large video dataset from video urls
Language:Python481 9 15355
LinkSoul-AI/Chinese-LLaVA
支持中英文双语视觉-文本对话的开源可商用多模态模型。
Language:Python340 5 932
facebookresearch/mae_st
Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"
Language:Python288 8 2333
JialianW/GRiT
GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)
Language:Python281 2 1729
llava-rlhf/LLaVA-RLHF
Aligning LMMs with Factually Augmented RLHF
Language:Python262 8 3014
SALT-NLP/LLaVAR
Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"
Language:Python245 5 2112
RenShuhuai-Andy/TimeChat
[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
Language:Python225 5 3820
YuchenLiu98/COMM
Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
177 19 55
flaribbit/imgfind
根据文本描述搜索本地图片的工具，powered by Rust + candle + CLIP
Language:Rust135 1 83
facebookresearch/vsc2022
Code for the Video Similarity Challenge.
Language:Python74 11 513

feymanpriv

feymanpriv's Stars

hpcaitech/Open-Sora

ymcui/Chinese-LLaMA-Alpaca

BradyFU/Awesome-Multimodal-Large-Language-Models

NVIDIA/TensorRT-LLM

LargeWorldModel/LWM

baichuan-inc/Baichuan-7B

QwenLM/Qwen-VL

FoundationVision/VAR

InternLM/xtuner

open-compass/opencompass

DAMO-NLP-SG/Video-LLaMA

LinkSoul-AI/Chinese-Llama-2-7b

PKU-YuanGroup/MoE-LLaVA

NUS-HPC-AI-Lab/OpenDiT

shikras/shikra

PKU-YuanGroup/Chat-UniVi

SkunkworksAI/BakLLaVA

LLaVA-VL/LLaVA-Plus-Codebase

allenai/unified-io-2

csuhan/OneLLM

iejMac/video2dataset

LinkSoul-AI/Chinese-LLaVA

facebookresearch/mae_st

JialianW/GRiT

llava-rlhf/LLaVA-RLHF

SALT-NLP/LLaVAR

RenShuhuai-Andy/TimeChat

YuchenLiu98/COMM

flaribbit/imgfind

facebookresearch/vsc2022