JinhuaLiang
A Ph.D. student from Centre for Digial Music (C4DM), Queen Mary University of London.
London
JinhuaLiang's Stars
lucidrains/video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
haoheliu/youtube-8m-videos-downloader
Download videos from YouTube-8M dataset for testing
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
roudimit/MUSIC_dataset
MUSIC Dataset from The Sound of Pixels (ECCV '18)
demoray/azure-pim-cli
Unofficial CLI to list and enable Azure Privileged Identity Management (PIM) roles
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
mhamilton723/DenseAV
Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language
DragonLiu1995/CVPR-2024-Speech_Audio_Music-Papers
A curated collections of papers related to speech, audio and music in CVPR 2024.
jlegewie/zotfile
Zotero plugin to manage your attachments: automatically rename, move, and attach PDFs (or other files) to Zotero items, sync PDFs from your Zotero library to your (mobile) PDF reader (e.g. an iPad, Android tablet, etc.), and extract PDF annotations.
yukara-ikemiya/friendly-stable-audio-tools
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.
abyildirim/inst-inpaint
A novel inpainting framework that can remove objects from images based on the instructions given as text prompts.
ChenyangSi/FreeU
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
anusfoil/DExter
DExter: Learning and Controlling Performance Expression through Diffusion models
TheAlgorithms/Python
All Algorithms implemented in Python
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
soulmachine/machine-learning-cheat-sheet
Classical equations and diagrams in machine learning
andrewekhalel/MLQuestions
Machine Learning and Computer Vision Engineer - Technical Interview Questions
Avik-Jain/100-Days-Of-ML-Code
100 Days of ML Coding
chiphuyen/machine-learning-systems-design
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
khangich/machine-learning-interview
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
Sara-Ahmed/SiT
Self-supervised vIsion Transformer (SiT)
jiwoogit/StyleID
[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer
leffff/adversarial-diffusion-distillation
My Implementation of Adversarial Diffusion Distillation https://arxiv.org/pdf/2311.17042.pdf
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
EmoryMLIP/OT-Flow
PyTorch implementation of the OT-Flow approach in arXiv:2006.00104
dmse4tts/DMSE4TTS
c4dm/dcase-few-shot-bioacoustic
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.