Pinned Repositories
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
sd-webui-controlnet
WebUI extension for ControlNet
HyperHuman
[ICLR 2024] Github Repo for "HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion"
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
controlnet_diffusers_lightning
ControlNetPlus
ControlNet++: All-in-one ControlNet for image generations and editing!
Skin_analysis_megvii
llava-phi
xinsir6's Repositories
xinsir6/ControlNetPlus
ControlNet++: All-in-one ControlNet for image generations and editing!
xinsir6/controlnet_diffusers_lightning
xinsir6/Skin_analysis_megvii