Jevin754's Stars
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
ai-vip/stable-diffusion-tutorial
全网最全Stable Diffusion全套教程,从入门到进阶,耗时三个月制作
JetBrains/projector-server
Server-side library for running Swing applications remotely
OpenBMB/VisCPM
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
JosephKJ/OWOD
(CVPR 2021 Oral) Open World Object Detection
microsoft/SimMIM
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
peterljq/OpenMMD
OpenMMD is an OpenPose-based application that can convert real-person videos to the motion files (.vmd) which directly implement the 3D model (e.g. Miku, Anmicius) animated movies.
microsoft/MeshTransformer
Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"
subhadarship/kmeans_pytorch
kmeans using PyTorch
visonpon/human-motion-capture
collect papers about human motion capture
google/aistplusplus_api
API to support AIST++ Dataset: https://google.github.io/aistplusplus_dataset
fh2019ustc/DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
edvakf/MMD.js
MikuMikuDance on WebGL
ronething/xiudong-selenium
Implement showstart order service based on python with selenium and flask(基于 selenium 和 flask 实现的秀动辅助)
TencentARC/ArcNerf
Nerf and extensions in all
guillefix/transflower-lightning
multimodal transformer
zhigangjiang/WebGLMMD
Beautiful Web MMD Player
ustc-slr/DilatedSLR
PyTorch reimplementation of DilatedSLR (IJCAI'18) for continuous sign language recognition.
yuzhenbo/pose2carton
Educational API for 3D Vision using pose to control carton.
godzillalla/Dance-Synthesis-Project
WaterTian/mmdAvatar
MMD Avatar
marunrun/dm-ticket
大麦网自动购票, 支持docker一键部署。https://t.me/+2EELgNTYiMYxMTFl