Xeaver

Xeaver's Stars

mlfoundations/datacomp
DataComp: In search of the next generation of multimodal datasets
Language:Python62249
junshutang/Make-It-3D
[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Language:Python1.7k115
Jiahao000/MFM
[ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training
Language:Python621
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.5k241
team-openpm/openpm
Language:TypeScript1392
danielgross/LlamaAcademy
A school for camelids
Language:Python1.2k80
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook89.9k14.2k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python18.4k2k
ttengwang/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Language:Python1.6k102
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook8.5k719
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.2k2.9k
mvlchallenge/mvl_toolkit
Official toolkit for Multi-View Layout Estimation Challenge in OmniCV workshop at CVPR'23.
Language:Python153
showlab/Image2Paragraph
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
Language:Python77251
zixian2021/AI-interview-cards
最完整的AI算法面试题目仓库，1000道，25个类目
99686
cvlab-columbia/viper
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
Language:Jupyter Notebook1.6k117
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
Language:Python1k92
dddrrreee/cs140e-20win
cs140e course materials.
Language:C1k120
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.6k273
dmarx/bench-warmers
DigThatData's Public Brainstorming space
Language:Python575
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python2.1k154
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook18.4k2.2k
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python35.2k5.4k
togethercomputer/OpenChatKit
Language:Python9k1k
eriklindernoren/ML-From-Scratch
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
Language:Python23.6k4.6k
OpenGVLab/STM-Evaluation
Language:Python696
kakaobrain/coyo-dataset
COYO-700M: Large-scale Image-Text Pair Dataset
Language:Python1.1k35
noameshed/novelty-detection
Analyzing basic network responses to novel classes
Language:Python3713
hirl-team/HIRL
HIRL: A General Framework for Hierarchical Image Representation Learning (http://arxiv.org/abs/2205.13159)
Language:Python404
zhangyongshun/BagofTricks-LT
A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results
Language:Python57276
xulianuwa/MCTformer
Code for CVPR 2022 paper "Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation"
Language:Python14615