KululuMi

KululuMi's Stars

showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
3.1k190
shap/shap
A game theoretic approach to explain the output of any machine learning model.
Language:Jupyter Notebook22.5k3.2k
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.2k46
google-research-datasets/richhf-18k
RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with the file name of the associated labeled images (no urls or images are included in this dataset).
962
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.2k1k
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language:Python1.3k124
mohuangrui/ucasthesis
LaTeX Thesis Template for the University of Chinese Academy of Sciences
Language:TeX3.4k925
Technion-Kishony-lab/data-to-paper
data-to-paper: Backward-traceable AI-driven scientific research
Language:Python43940
threestudio-project/threestudio
A unified framework for 3D content generation.
Language:Python6.1k469
jun0wanan/awesome-large-multimodal-agents
29218
lllyasviel/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
Language:Python3.8k326
Yutong-Zhou-cv/Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
2.1k186
openai/weak-to-strong
Language:Python2.5k300
InternLM/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Language:Python2.4k150
Harry24k/adversarial-attacks-pytorch
PyTorch implementation of adversarial attacks [torchattacks]
Language:Python1.8k345
duchengbin8/Stable_Diffusion_is_Unstable
Official implement of paper: Stable Diffusion is Unstable
Language:Python17
web-arena-x/webarena
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
Language:Python678105
j-min/DSG
Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)
Language:Jupyter Notebook745
dongyh20/Octopus
🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.
Language:Python24918
aiwaves-cn/agents
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
Language:Python5.2k408
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.3k2.1k
tgxs002/HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Language:Jupyter Notebook36412
j-min/DallEval
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)
Language:Jupyter Notebook1376
tgxs002/align_sd
Better Aligning Text-to-Image Models with Human Preference. ICCV 2023
Language:Python2618
PVIT-official/PVIT
Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models
Language:Python362
Wangt-CN/EqBen
[ICCV'23 Oral] The introduction and toolkit for EqBen Benchmark
Language:Python1251
microsoft/robustlearn
Robust machine learning for responsible AI
Language:Python44955
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
11.7k758
hyp1231/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
1.4k104
showlab/Image2Paragraph
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
Language:Python78353