Jevin754

Ph.D , Researcher @ ARC Lab, Tencent

TencentShenzhen, Guangdong

Jevin754's Stars

Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39.2k 446 3135.1k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.4k 218 4652.9k
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Language:Python13.9k 126 3142.1k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.7k 274 121812
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python8.4k 36 2971k
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language:Python4.6k 36 336473
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
Language:Jupyter Notebook3.7k 97 28395
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Language:Python2.7k 45 145186
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
Language:Python1.6k 13 141199
ai-vip/stable-diffusion-tutorial
全网最全Stable Diffusion全套教程，从入门到进阶，耗时三个月制作
1.4k 11 2126
JetBrains/projector-server
Server-side library for running Swing applications remotely
Language:Kotlin1.2k 46 0123
OpenBMB/VisCPM
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
Language:Python1.1k 15 4194
JosephKJ/OWOD
(CVPR 2021 Oral) Open World Object Detection
Language:Python1k 23 136155
microsoft/SimMIM
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
Language:Python929 23 4288
peterljq/OpenMMD
OpenMMD is an OpenPose-based application that can convert real-person videos to the motion files (.vmd) which directly implement the 3D model (e.g. Miku, Anmicius) animated movies.
Language:C++895 44 47184
microsoft/MeshTransformer
Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"
Language:Python610 17 8695
subhadarship/kmeans_pytorch
kmeans using PyTorch
Language:Jupyter Notebook488 7 3780
visonpon/human-motion-capture
collect papers about human motion capture
425 29 534
google/aistplusplus_api
API to support AIST++ Dataset: https://google.github.io/aistplusplus_dataset
Language:Python357 12 3369
fh2019ustc/DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Language:Python354 17 3149
edvakf/MMD.js
MikuMikuDance on WebGL
Language:CoffeeScript229 31 639
ronething/xiudong-selenium
Implement showstart order service based on python with selenium and flask(基于 selenium 和 flask 实现的秀动辅助)
Language:Python196 3 6731
TencentARC/ArcNerf
Nerf and extensions in all
Language:Jupyter Notebook106 5 37
guillefix/transflower-lightning
multimodal transformer
Language:Python74 6 79
zhigangjiang/WebGLMMD
Beautiful Web MMD Player
Language:JavaScript63 4 015
ustc-slr/DilatedSLR
PyTorch reimplementation of DilatedSLR (IJCAI'18) for continuous sign language recognition.
Language:Python43 4 412
yuzhenbo/pose2carton
Educational API for 3D Vision using pose to control carton.
Language:Python43 3 135
godzillalla/Dance-Synthesis-Project
Language:Python18 2 33
WaterTian/mmdAvatar
MMD Avatar
Language:JavaScript17 3 06
marunrun/dm-ticket
大麦网自动购票, 支持docker一键部署。https://t.me/+2EELgNTYiMYxMTFl
Language:Rust11 0 06

Jevin754

Jevin754's Stars

Stability-AI/stablediffusion

Vision-CAIR/MiniGPT-4

microsoft/Swin-Transformer

BradyFU/Awesome-Multimodal-Large-Language-Models

lucidrains/denoising-diffusion-pytorch

OFA-Sys/Chinese-CLIP

huggingface/diffusion-models-class

ToTheBeginning/PuLID

salesforce/ALBEF

ai-vip/stable-diffusion-tutorial

JetBrains/projector-server

OpenBMB/VisCPM

JosephKJ/OWOD

microsoft/SimMIM

peterljq/OpenMMD

microsoft/MeshTransformer

subhadarship/kmeans_pytorch

visonpon/human-motion-capture

google/aistplusplus_api

fh2019ustc/DocTr

edvakf/MMD.js

ronething/xiudong-selenium

TencentARC/ArcNerf

guillefix/transflower-lightning

zhigangjiang/WebGLMMD

ustc-slr/DilatedSLR

yuzhenbo/pose2carton

godzillalla/Dance-Synthesis-Project

WaterTian/mmdAvatar

marunrun/dm-ticket