1349949

1349949's Stars

xtekky/gpt4free
The official gpt4free repository | various collection of powerful language models
Language:Python63.1k 486 1.5k13.5k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook48.6k 313 6835.7k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21.1k 158 1.6k2.3k
LiLittleCat/awesome-free-chatgpt
🆓免费的 ChatGPT 镜像网站列表，持续更新。List of free ChatGPT mirror sites, continuously updated.
Language:Python19.2k 147 8291.3k
Stability-AI/StableLM
StableLM: Stability AI Language Models
Language:Jupyter Notebook15.8k 202 761k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.6k 115 3971.4k
haoel/haoel.github.io
Language:Shell12.8k 302 2422k
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10.1k 137 51869
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
9.6k 187 17746
deep-floyd/IF
Language:Python7.7k 84 101505
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++6k 63 625895
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Language:Python5.8k 78 145377
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Language:Python4.5k 59 153415
OpenDriveLab/UniAD
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
Language:Python3.7k 38 192421
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
3.2k 101 13255
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python3.1k 37 240257
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Language:Jupyter Notebook3k 27 163282
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.9k 33 158265
baaivision/Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
Language:Python2.5k 37 71176
lamini-ai/lamini
The Official Python Client for Lamini's API
Language:Python2.5k 34 36150
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Language:Python2.4k 30 233177
chenking2020/FindTheChatGPTer
ChatGPT爆火，开启了通往AGI的关键一步，本项目旨在汇总那些ChatGPT的开源平替们，包括文本大模型、多模态大模型等，为大家提供一些便利
2k 57 7200
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python1.6k 27 20297
lyuchenyang/Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
Language:Python1.5k 33 38130
open-mmlab/Multimodal-GPT
Multimodal-GPT
Language:Python1.5k 13 20128
microsoft/MM-REACT
Official repo for MM-REACT
Language:Python940 19 1071
exiawsh/StreamPETR
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
Language:Python613 12 24270
facebookresearch/SWAG
Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.
Language:Jupyter Notebook176 9 109
google-research/slot-attention-video
Language:Python156 7 2118
mit-han-lab/flatformer
[CVPR'23] FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer
Language:Python124 7 1517