liuguoyou

xiaomi

liuguoyou's Stars

QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python12k 96 1k975
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python10.2k 124 197751
HumanAIGC/OutfitAnyone
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
5.2k 209 52399
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Language:Python3.7k 34 334355
tryolabs/norfair
Lightweight Python library for adding real-time multi-object tracking to any detector.
Language:Python2.3k 35 159233
QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.2k 25 5482
open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA，你的个性化图像动画生成器，利用文本提示将图像变为奇妙的动画
Language:Python764 20 3361
Ucas-HaoranWei/Vary-toy
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
Language:Python550 13 3041
DavidZhangdw/Visual-Tracking-Development
Visual Object Tracking
Language:Python409 16 251
apple/ml-mobileclip
This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024
Language:Python398 14 020
oneTaken/Awesome-Denoise
One-paper-one-short-contribution-summary of all latest image/burst/video Denoising papers with code & citation published in top conference and journal.
395 21 055
xinghaochen/TinySAM
Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"
Language:Python366 12 2422
mindspore-lab/mindone
one for all, Optimal generator with No Exception
Language:Python325 11 4159
xushilin1/RAP-SAM
Language:Python197 10 69
zzh-tech/InterpAny-Clearer
Clearer anytime frame interpolation & Manipulated interpolation of anything
Language:Python167 7 129
XavierCHEN34/LivePhoto
Official implementations for paper: LivePhoto: Real Image Animation with Text-guided Motion Control
158 35 53
aim-uofa/AutoStory
137 22 34
luosiallen/Diff-Foley
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
Language:Python122 8 2512
mulab-mir/song-describer-dataset
The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.
Language:Jupyter Notebook119 4 15
liuxubo717/SimPFs
Code for "Simple Pooling Front-ends for Efficient Audio Calssification", ICASSP 2023
Language:Python53 3 01
haoyi-duan/DG-SCT
NeurIPS'2023 official implementation code
Language:Python52 4 64
xyongLu/SBCFormer
[Pytorch Impl.] SBCFormer: Lightweight Network Capable of Full-size ImageNet Classification at 1 FPS on Single Board Computers -WACV2024 -Official Code
Language:Python36 2 13
tomchen-ctj/OST
【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition
Language:Python29 5 20
tany0699/FMViT
28 5 30
Jason-Qiu/MMSum_model
[CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
Language:Python26 1 34
J911/MISO-VFI
Official implementation of "A Multi-In-Single-Out Network for Video Frame Interpolation without Optical Flow"
231
Sosdatasets/SoS_Dataset
13 5 1
SCZwangxiao/RTQ-MM2023
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
Language:Python11 4 51
shantistewart/Emo-CLIM
Emo-CLIM: Emotion-Aligned Contrastive Learning Between Images and Music [ICASSP 2024]
Language:Python70
saxenarohit/select_summ
2 3 10

liuguoyou

liuguoyou's Stars

QwenLM/Qwen

InstantID/InstantID

HumanAIGC/OutfitAnyone

AILab-CVC/YOLO-World

tryolabs/norfair

QwenLM/Qwen-Audio

open-mmlab/PIA

Ucas-HaoranWei/Vary-toy

DavidZhangdw/Visual-Tracking-Development

apple/ml-mobileclip

oneTaken/Awesome-Denoise

xinghaochen/TinySAM

mindspore-lab/mindone

xushilin1/RAP-SAM

zzh-tech/InterpAny-Clearer

XavierCHEN34/LivePhoto

aim-uofa/AutoStory

luosiallen/Diff-Foley

mulab-mir/song-describer-dataset

liuxubo717/SimPFs

haoyi-duan/DG-SCT

xyongLu/SBCFormer

tomchen-ctj/OST

tany0699/FMViT

Jason-Qiu/MMSum_model

J911/MISO-VFI

Sosdatasets/SoS_Dataset

SCZwangxiao/RTQ-MM2023

shantistewart/Emo-CLIM

saxenarohit/select_summ