feymanpriv

BUPTBeijing

feymanpriv's Stars

PlexPt/awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
51.3k 356 9313.5k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook45.2k 301 6555.3k
chenfei-wu/TaskMatrix
Language:Python34.5k 305 3513.3k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.1k 219 4502.9k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.1k 297 1.3k2.4k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python17.7k 157 1.4k1.9k
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
16.9k 282 2022.5k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook14k 113 3701.3k
togethercomputer/OpenChatKit
Language:Python9k 121 981k
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook4.4k 34 188587
futantan/OpenGpt
Create your own ChatGPT App in seconds.
Language:TypeScript4k 34 49391
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.5k 47 170268
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.5k 100 159240
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python2.1k 31 150148
visual-openllm/visual-openllm
something like visual-chatgpt, 文心一言的开源版
Language:Python1.2k 25 43162
lucidrains/flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
Language:Python1.2k 21 1360
unit-mesh/unit-minions
《AI 研发提效：自己动手训练 LoRA》，包含 Llama （Alpaca LoRA）模型、ChatGLM （ChatGLM Tuning）相关 Lora 的训练。训练内容：用户故事生成、测试代码生成、代码辅助生成、文本转 SQL、文本生成代码……
Language:Jupyter Notebook1k 20 12112
tianrun-chen/SAM-Adapter-PyTorch
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
Language:Python838 10 7475
showlab/Image2Paragraph
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
Language:Python767 11 2851
Confusezius/Deep-Metric-Learning-Baselines
PyTorch Implementation for Deep Metric Learning Pipelines
Language:Python573 17 2493
showlab/VLog
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
Language:Python503 6 1021
OpenGVLab/UniFormerV2
[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
Language:Python274 7 7515
whwu95/Cap4Video
【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Language:Python216 9 2816
deepglint/unicom
[ICLR 2023] Unicom: Universal and Compact Representation Learning for Image Retrieval
Language:Python204 8 2115
RupertLuo/Valley
The official repository of "Video assistant towards large language model makes everything easy"
Language:Python182 4 3413
xyzforever/BEVT
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
Language:Python153 7 1018
whwu95/BIKE
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Language:Python151 12 2017
lucidrains/MaMMUT-pytorch
Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch
Language:Python94 4 24
daniel-code/TubeViT
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
Language:Python76 10 138
satojkovic/DeepLogo2
A brand logo detection system by DETR
Language:Python51 3 811

feymanpriv

feymanpriv's Stars

PlexPt/awesome-chatgpt-prompts-zh

facebookresearch/segment-anything

chenfei-wu/TaskMatrix

Vision-CAIR/MiniGPT-4

microsoft/unilm

haotian-liu/LLaVA

amusi/CVPR2024-Papers-with-Code

IDEA-Research/Grounded-Segment-Anything

togethercomputer/OpenChatKit

salesforce/BLIP

futantan/OpenGpt

mlfoundations/open_flamingo

Luodian/Otter

baaivision/EVA

visual-openllm/visual-openllm

lucidrains/flamingo-pytorch

unit-mesh/unit-minions

tianrun-chen/SAM-Adapter-PyTorch

showlab/Image2Paragraph

Confusezius/Deep-Metric-Learning-Baselines

showlab/VLog

OpenGVLab/UniFormerV2

whwu95/Cap4Video

deepglint/unicom

RupertLuo/Valley

xyzforever/BEVT

whwu95/BIKE

lucidrains/MaMMUT-pytorch

daniel-code/TubeViT

satojkovic/DeepLogo2