Arking1995

Purdue University

Arking1995's Stars

voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
Language:Python8.1k 55 1.5k543
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.1k 45 80537
open-mmlab/mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
Language:Python5.6k 55 1.5k1.2k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python5.6k 50 559438
VAST-AI-Research/TripoSR
Language:Python4.4k 48 99505
nerfies/nerfies.github.io
Language:JavaScript2.4k 37 5843
google-research/kubric
A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.
Language:Jupyter Notebook2.3k 41 187226
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
2k 114 3626
Maks-s/sd-akashic
A compendium of informations regarding Stable Diffusion (SD)
1.6k 37 788
lxtGH/OMG-Seg
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
Language:Python1.2k 23 4747
BAAI-DCAI/Bunny
A family of lightweight multimodal models.
Language:Python889 19 11766
allenai/objaverse-xl
🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!
Language:Python726 10 4943
facebookresearch/omni3d
Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"
Language:Python710 22 5265
hzxie/CityDreamer
The official implementation of "CityDreamer: Compositional Generative Model of Unbounded 3D Cities". (Xie et al., CVPR 2024)
Language:Python593 26 2539
google-research-datasets/conceptual-captions
Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image captioning systems.
Language:Shell513 18 1926
CSAILVision/ADE20K
ADE20K Dataset
Language:Jupyter Notebook316 24 4455
OSU-NLP-Group/MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
Language:Python299 5 1814
bcmi/libcom
Image composition toolbox: everything you want to know about image composition or object insertion
Language:Python279 12 3518
mertyg/vision-language-models-are-bows
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
Language:Python238 8 3715
Karine-Huang/T2I-CompBench
[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
Language:Python189 2 216
CVMI-Lab/SyntheticData
Is synthetic data from generative models ready for image recognition?
Language:Python174 13 96
facebookresearch/unibench
Python Library to evaluate VLM models' robustness across diverse benchmarks
Language:Jupyter Notebook163 6 48
HaozheZhao/UltraEdit
Language:Python153 3 188
happy-fish-01/National_interest_waiver_waittime
USCIS Employment-based-2 national interest waiver wait time
76 24 47
ChenyanWu/MEBOW
Code for "MEBOW: Monocular Estimation of Body Orientation In the Wild", CVPR 2020
Language:Python58 4 1111
wufeim/DST3D
Official implementation of "Generating images with 3D annotations using diffusion models".
Language:Python58 12 110
arielnlee/LLaVA-1.6-ft
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python31 2 03
Lizw14/Super-CLEVR
Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"
Language:Python20 3 11
allenai/object-edit
Language:Python18 3 40
wufeim/imagenet3d
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Language:Python13 2 01

Arking1995

Arking1995's Stars

voxel51/fiftyone

facebookresearch/DiT

open-mmlab/mmpose

OpenGVLab/InternVL

VAST-AI-Research/TripoSR

nerfies/nerfies.github.io

google-research/kubric

lllyasviel/LayerDiffuse

Maks-s/sd-akashic

lxtGH/OMG-Seg

BAAI-DCAI/Bunny

allenai/objaverse-xl

facebookresearch/omni3d

hzxie/CityDreamer

google-research-datasets/conceptual-captions

CSAILVision/ADE20K

OSU-NLP-Group/MagicBrush

bcmi/libcom

mertyg/vision-language-models-are-bows

Karine-Huang/T2I-CompBench

CVMI-Lab/SyntheticData

facebookresearch/unibench

HaozheZhao/UltraEdit

happy-fish-01/National_interest_waiver_waittime

ChenyanWu/MEBOW

wufeim/DST3D

arielnlee/LLaVA-1.6-ft

Lizw14/Super-CLEVR

allenai/object-edit

wufeim/imagenet3d