Pinned Repositories
A-Renpy-Game-The-Only-Easy-Day-Was-Yesterday
This is a game build by Renpy.
audiocaps-download
This package aims at simplifying the download of the AudioCaps dataset.
AudioLDM2
Text-to-Audio/Music Generation
audioset-download
This package aims at simplifying the download of the AudioSet dataset.
ER-NeRF
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
HSIC-regularized-Kernel-Ridge-Regression
stable-audio-metrics
Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.
Stock-Price-Prediction-Based-on-MF-DFA-Facebook-Prophet
《基于多重分形谱的股价指数特征提取及预测》一文中的代码
Vision-Transformer-based-Short-range-behavior-recognition-using-Radar-Signals
wav2lip_vq
wav2lip in a Vector Quantized (VQ) space
BingliangLi's Repositories
BingliangLi/stable-audio-metrics
Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.
BingliangLi/audiocaps-download
This package aims at simplifying the download of the AudioCaps dataset.
BingliangLi/AudioLDM2
Text-to-Audio/Music Generation
BingliangLi/controlled-motion-latent-diffusion
BingliangLi/CUHKSZ-Radiance
BingliangLi/chain-of-table
Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
BingliangLi/CoCap
[ICCV 2023] Accurate and Fast Compressed Video Captioning
BingliangLi/DeepEdit_old
Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471
BingliangLi/detr
End-to-End Object Detection with Transformers
BingliangLi/Diff-Foley
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
BingliangLi/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
BingliangLi/EDGE
Official PyTorch Implementation of EDGE (CVPR 2023)
BingliangLi/Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
BingliangLi/guided-motion-diffusion
BingliangLi/hoi-prediction-gaze-transformer
BingliangLi/HumanML3D
HumanML3D: A large and diverse 3d human motion-language dataset.
BingliangLi/image-background-remove-tool
✂️ Automated high-quality background removal framework for an image using neural networks. ✂️
BingliangLi/LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
BingliangLi/MP-HOI.github.io
BingliangLi/OLAPH-old
OLAPH: Improving Factuality in Biomedical Long-form Question Answering
BingliangLi/OmniControl
OmniControl: Control Any Joint at Any Time for Human Motion Generation, arXiv 2023
BingliangLi/OneTrainer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
BingliangLi/pcpnet
Pytorch implementation of PCPNet
BingliangLi/ProTrix_unofficial
Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context
BingliangLi/pyramid-discrete-diffusion
Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)
BingliangLi/R2-Talker-code
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning
BingliangLi/sd-scripts
BingliangLi/Seeing-and-Hearing
[CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
BingliangLi/SpecVQGAN
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
BingliangLi/videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability