p0p4k's Stars
charlax/professional-programming
A collection of learning resources for curious software engineers
amix/vimrc
The ultimate Vim configuration (vimrc)
mhinz/vim-galore
:mortar_board: All things Vim!
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
cuda-mode/lectures
Material for cuda-mode lectures
microsoft/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
pytorch/ao
PyTorch native quantization and sparsity for training and inference
NVlabs/FasterViT
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
NVlabs/edm2
Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)
Jokeren/Awesome-GPU
Awesome resources for GPUs
hubertsiuzdak/snac
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
louaaron/Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
dongzhuoyao/awesome-flow-matching
A summary of related works about flow matching, stochastic interpolants
DNA-Rendering/DNA-Rendering
DNA-RENDERING: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering
pytorch-labs/float8_experimental
This repository contains the experimental PyTorch native float8 training UX
jishengpeng/Languagecodec
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models
locuslab/ect
Consistency Models Made Easy
hayeong0/DDDM-VC
Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)
roedoejet/g2p
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
AudiogenAI/agc
Audiogen Codec
nii-yamagishilab/ZMM-TTS
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
tadeephuy/GradientReversal
Gradient Reversal Layer for Domain Adaptation
malbergo/stochastic-interpolants
HannesStark/FlowSite
Implementation of FlowSite and HarmonicFlow from the paper "Harmonic Self-Conditioned Flow Matching for Multi-Ligand Docking and Binding Site Design"
resemble-ai/monotonic_align
Monotonic Alignment Search
j-min/MoChA-pytorch
PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)
duchenzhuang/FSQ-pytorch
A Pytorch Implementation of Finite Scalar Quantization
speechnovateur/languagecodec_tmp
Temporary anonymous version