JinhuaLiang

A Ph.D. student from Centre for Digial Music (C4DM), Queen Mary University of London.

London

JinhuaLiang's Stars

lucidrains/video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
Language:Python1.2k125
haoheliu/youtube-8m-videos-downloader
Download videos from YouTube-8M dataset for testing
Language:Python6
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.8k109
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python1.7k113
roudimit/MUSIC_dataset
MUSIC Dataset from The Sound of Pixels (ECCV '18)
11726
demoray/azure-pim-cli
Unofficial CLI to list and enable Azure Privileged Identity Management (PIM) roles
Language:Rust272
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Language:Python2.3k111
mhamilton723/DenseAV
Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language
Language:Jupyter Notebook589
DragonLiu1995/CVPR-2024-Speech_Audio_Music-Papers
A curated collections of papers related to speech, audio and music in CVPR 2024.
6
jlegewie/zotfile
Zotero plugin to manage your attachments: automatically rename, move, and attach PDFs (or other files) to Zotero items, sync PDFs from your Zotero library to your (mobile) PDF reader (e.g. an iPad, Android tablet, etc.), and extract PDF annotations.
Language:Java4k280
yukara-ikemiya/friendly-stable-audio-tools
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.
Language:Python13010
abyildirim/inst-inpaint
A novel inpainting framework that can remove objects from images based on the instructions given as text prompts.
Language:Python35626
ChenyangSi/FreeU
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
1.7k73
gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Language:Python79591
anusfoil/DExter
DExter: Learning and Controlling Performance Expression through Diffusion models
Language:Python141
TheAlgorithms/Python
All Algorithms implemented in Python
Language:Python193k45.5k
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Language:Python273k46k
soulmachine/machine-learning-cheat-sheet
Classical equations and diagrams in machine learning
Language:TeX7.4k1.3k
andrewekhalel/MLQuestions
Machine Learning and Computer Vision Engineer - Technical Interview Questions
3k497
Avik-Jain/100-Days-Of-ML-Code
100 Days of ML Coding
45.2k10.6k
chiphuyen/machine-learning-systems-design
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
Language:HTML9k1.4k
khangich/machine-learning-interview
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
9.6k1.6k
Sara-Ahmed/SiT
Self-supervised vIsion Transformer (SiT)
Language:Python32249
jiwoogit/StyleID
[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer
Language:Python19511
leffff/adversarial-diffusion-distillation
My Implementation of Adversarial Diffusion Distillation https://arxiv.org/pdf/2311.17042.pdf
Language:Jupyter Notebook414
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.7k3.4k
EmoryMLIP/OT-Flow
PyTorch implementation of the OT-Flow approach in arXiv:2006.00104
Language:Python4916
dmse4tts/DMSE4TTS
Language:Python222
c4dm/dcase-few-shot-bioacoustic
Language:Python4836
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
Language:Jupyter Notebook70769

JinhuaLiang

JinhuaLiang's Stars

lucidrains/video-diffusion-pytorch

haoheliu/youtube-8m-videos-downloader

facebookresearch/chameleon

cambrian-mllm/cambrian

roudimit/MUSIC_dataset

demoray/azure-pim-cli

UX-Decoder/Semantic-SAM

mhamilton723/DenseAV

DragonLiu1995/CVPR-2024-Speech_Audio_Music-Papers

jlegewie/zotfile

yukara-ikemiya/friendly-stable-audio-tools

abyildirim/inst-inpaint

ChenyangSi/FreeU

gemelo-ai/vocos

anusfoil/DExter

TheAlgorithms/Python

donnemartin/system-design-primer

soulmachine/machine-learning-cheat-sheet

andrewekhalel/MLQuestions

Avik-Jain/100-Days-Of-ML-Code

chiphuyen/machine-learning-systems-design

khangich/machine-learning-interview

Sara-Ahmed/SiT

jiwoogit/StyleID

leffff/adversarial-diffusion-distillation

XingangPan/DragGAN

EmoryMLIP/OT-Flow

dmse4tts/DMSE4TTS

c4dm/dcase-few-shot-bioacoustic

teticio/audio-diffusion