lushanfu's Stars
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
lllyasviel/ControlNet
Let us control diffusion models!
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
qianqianwang68/omnimotion
CompVis/stable-diffusion
A latent text-to-image diffusion model
IDEA-Research/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
MrGiovanni/AbdomenAtlas
[NeurIPS 2023] AbdomenAtlas 1.0 (5,195 CT volumes + 9 annotated classes)
OpenMotionLab/MotionGPT
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
xuzhiqin1990/understanding_dl
A lecture note for understanding deep learning
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
nerdyrodent/VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Sierkinhane/ICCV2023-Diffusion-Papers
ICCV2023-Diffusion-Papers
microsoft/GLIP
Grounded Language-Image Pre-training
zhaoxin94/awesome-domain-adaptation
A collection of AWESOME things about domian adaptation
adobe-research/EntitySeg-Dataset
Adobe-EntitySeg dataset
qqlu/Entity
EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation
XiangchenYin/PE-YOLO
Immortalise/SearchAnything
A semantic local search engine powered by AI models.
eliahuhorwitz/Academic-project-page-template
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
OpenInterpreter/open-interpreter
A natural language interface for computers
wasserth/TotalSegmentator
Tool for robust segmentation of >100 important anatomical structures in CT and MR images
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
yuta-hi/pycuda_drr
Digitally recconstructed radiograph
researchmm/MM-Diffusion
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
LC1332/Luotuo-Chinese-LLM
骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
qiuyu96/CoDeF
[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
mbzuai-oryx/XrayGPT
[BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.
FreedomIntelligence/Medical_NLP
Medical NLP Competition, dataset, large models, paper
aleju/imgaug
Image augmentation for machine learning experiments.