yakunpku

SenseTimeBeijing, China

yakunpku's Stars

jimmycv07/DiffIR2VR-Zero
Language:Python11512
nianticlabs/mickey
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
Language:Python47328
Bing-su/adetailer
Auto detecting, masking and inpainting with detection model.
Language:Python4.2k327
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
Language:Python7.5k709
Nightmare-n/UniPAD
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)
Language:Python1737
SalesforceAIResearch/DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
Language:Python26723
large-ocr-model/large-ocr-model.github.io
Language:Python1565
lyuwenyu/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
Language:Python2.6k303
sanweiliti/RoHM
The official PyTorch code for RoHM: Robust Human Motion Reconstruction via Diffusion.
Language:Python32817
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++67.9k9.7k
KwaiVGI/I2V-Adapter
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
Language:Python18510
hellock/icrawler
A multi-thread crawler framework with many builtin image crawlers provided.
Language:Python855174
Imageomics/bioclip
This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].
Language:Python16414
facebookresearch/PlatoNeRF
PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar
Language:Python789
hiDaDeng/cntext
文本分析包，支持字数统计、可读性、文档相似度、情感分析在内的多种文本分析方法。chinese text sentiment analysis
Language:Python27528
ICTMCG/Make-Your-Anchor
[CVPR 2024] Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.
Language:Python31820
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python4.7k598
PRIS-CV/DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
Language:Jupyter Notebook2k229
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Language:Python2.5k263
NVlabs/instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
Language:Cuda16k1.9k
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python9.5k1.3k
whai362/PVT
Official implementation of PVT series
Language:Python1.7k245
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Language:Python1.1k85
PeizeSun/SparseR-CNN
[CVPR2021, PAMI2023] End-to-End Object Detection with Learnable Proposal
Language:Python1.3k187
ShoufaChen/DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
Language:Python2.1k162
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.3k56
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5.3k336
damian0815/compel
A prompting enhancement library for transformers-type text embedding systems
Language:Jupyter Notebook52547
seatgeek/thefuzz
Fuzzy String Matching in Python
Language:Python2.9k138
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.6k886

yakunpku

yakunpku's Stars

jimmycv07/DiffIR2VR-Zero

nianticlabs/mickey

Bing-su/adetailer

CASIA-IVA-Lab/FastSAM

Nightmare-n/UniPAD

SalesforceAIResearch/DiffusionDPO

large-ocr-model/large-ocr-model.github.io

lyuwenyu/RT-DETR

sanweiliti/RoHM

ggerganov/llama.cpp

KwaiVGI/I2V-Adapter

hellock/icrawler

Imageomics/bioclip

facebookresearch/PlatoNeRF

hiDaDeng/cntext

ICTMCG/Make-Your-Anchor

fudan-generative-vision/champ

PRIS-CV/DemoFusion

TMElyralab/MuseV

NVlabs/instant-ngp

fudan-generative-vision/hallo

whai362/PVT

FoundationVision/GLEE

PeizeSun/SparseR-CNN

ShoufaChen/DiffusionDet

FoundationVision/LlamaGen

tencent-ailab/IP-Adapter

damian0815/compel

seatgeek/thefuzz

OpenBMB/MiniCPM-V