Xeaver's Stars
mlfoundations/datacomp
DataComp: In search of the next generation of multimodal datasets
junshutang/Make-It-3D
[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Jiahao000/MFM
[ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
team-openpm/openpm
danielgross/LlamaAcademy
A school for camelids
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
ttengwang/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
mvlchallenge/mvl_toolkit
Official toolkit for Multi-View Layout Estimation Challenge in OmniCV workshop at CVPR'23.
showlab/Image2Paragraph
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
zixian2021/AI-interview-cards
最完整的AI算法面试题目仓库,1000道,25个类目
cvlab-columbia/viper
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
dddrrreee/cs140e-20win
cs140e course materials.
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
dmarx/bench-warmers
DigThatData's Public Brainstorming space
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
togethercomputer/OpenChatKit
eriklindernoren/ML-From-Scratch
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
OpenGVLab/STM-Evaluation
kakaobrain/coyo-dataset
COYO-700M: Large-scale Image-Text Pair Dataset
noameshed/novelty-detection
Analyzing basic network responses to novel classes
hirl-team/HIRL
HIRL: A General Framework for Hierarchical Image Representation Learning (http://arxiv.org/abs/2205.13159)
zhangyongshun/BagofTricks-LT
A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results
xulianuwa/MCTformer
Code for CVPR 2022 paper "Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation"