Hannieliao

Tsinghua UniversityShen Zhen, China

Hannieliao's Stars

krahets/hello-algo
《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing
Language:Java98.3k 537 22612.4k
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook32.6k 350 1023.9k
datawhalechina/self-llm
《开源大模型食用指南》基于Linux环境快速部署开源大模型，更适合**宝宝的部署教程
Language:Jupyter Notebook9.3k 73 1631.1k
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Jupyter Notebook7.6k 78 189562
ashleve/lightning-hydra-template
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
Language:Python4.3k 28 245654
andrewekhalel/MLQuestions
Machine Learning and Computer Vision Engineer - Technical Interview Questions
3k 28 2504
Stability-AI/stable-audio-tools
Generative models for conditional audio generation
Language:Python2.7k 43 97258
awesome-mlss/awesome-mlss
🤖 Machine Learning Summer School deadlines
Language:HTML2.7k 254 49296
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
1.8k 53 1490
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Language:Python1.2k 14 9065
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Language:Python752 71 14110
LAION-AI/audio-dataset
Audio Dataset for training CLAP and other models
Language:Python632 21 5953
yangdongchao/UniAudio
The Open Source Code of UniAudio
Language:Python521 37 3332
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language:HTML348 13 420
haoheliu/audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
Language:Python302 6 931
haoheliu/AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
Language:Python209 17 3740
zhenye234/CoMoSpeech
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
Language:Python183 12 1120
yk7333/d3po
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
Language:Python170 7 1616
luosiallen/Diff-Foley
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
Language:Python160 9 3019
yzxing87/Seeing-and-Hearing
[CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
Language:Python127 11 107
MontrealCorpusTools/mfa-models
Collection of pretrained models for the Montreal Forced Aligner
Language:Python115 7 2220
speedyseal/audiosetdl
Scripts for download AudioSet
Language:Jupyter Notebook67 1 045
VincentHancoder/REPARO
The official implementation of work "REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment".
Language:Python49 3 2
dlrudco/Fast-Audioset-Download
Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing
Language:Python30 1 61
MorenoLaQuatra/audiocaps-download
This package aims at simplifying the download of the AudioCaps dataset.
Language:Python30 2 44
ExplainableML/ImageSelect
Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"
Language:Python27 7 21
wei-zeng98/piano-a2s
End-to-end real-world polyphonic piano audio-to-score transcription with hierarchical decoding (IJCAI 2024)
Language:Python25 3 10
Hannieliao/Baton
Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"
Language:Python18 1 00
DragonLiu1995/CVPR-2024-Speech_Audio_Music-Papers
A curated collections of papers related to speech, audio and music in CVPR 2024.
6 1 00
yash1994/youtube-8m-videos-downloader
Download videos from YouTube-8M dataset for testing
Language:Python6 0 03

Hannieliao

Hannieliao's Stars

krahets/hello-algo

rasbt/LLMs-from-scratch

datawhalechina/self-llm

open-mmlab/Amphion

ashleve/lightning-hydra-template

andrewekhalel/MLQuestions

Stability-AI/stable-audio-tools

awesome-mlss/awesome-mlss

ChenHsing/Awesome-Video-Diffusion-Models

THUDM/ImageReward

Text-to-Audio/Make-An-Audio

LAION-AI/audio-dataset

yangdongchao/UniAudio

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

haoheliu/audioldm_eval

haoheliu/AudioLDM-training-finetuning

zhenye234/CoMoSpeech

yk7333/d3po

luosiallen/Diff-Foley

yzxing87/Seeing-and-Hearing

MontrealCorpusTools/mfa-models

speedyseal/audiosetdl

VincentHancoder/REPARO

dlrudco/Fast-Audioset-Download

MorenoLaQuatra/audiocaps-download

ExplainableML/ImageSelect

wei-zeng98/piano-a2s

Hannieliao/Baton

DragonLiu1995/CVPR-2024-Speech_Audio_Music-Papers

yash1994/youtube-8m-videos-downloader