p0p4k

medically diagnosed with imposter syndrome

p0p4k's Stars

charlax/professional-programming
A collection of learning resources for curious software engineers
Language:Python46.5k 993 283.7k
amix/vimrc
The ultimate Vim configuration (vimrc)
Language:Vim Script30.6k 778 5107.3k
mhinz/vim-galore
:mortar_board: All things Vim!
Language:Vim script16.8k 322 94604
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7.5k 89 128739
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Language:Python5.2k 39 37503
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
3.2k 133 18193
cuda-mode/lectures
Material for cuda-mode lectures
Language:Jupyter Notebook2.5k 35 7252
microsoft/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Language:Python1.2k 24 86113
pytorch/ao
PyTorch native quantization and sparsity for training and inference
Language:Python1.1k 39 219114
NVlabs/FasterViT
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
Language:Python768 18 4862
NVlabs/edm2
Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)
Language:Python489 12 519
Jokeren/Awesome-GPU
Awesome resources for GPUs
467 24 047
hubertsiuzdak/snac
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
Language:Python372 7 2221
louaaron/Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
Language:Python366 7 1134
dongzhuoyao/awesome-flow-matching
A summary of related works about flow matching, stochastic interpolants
282 11 210
DNA-Rendering/DNA-Rendering
DNA-RENDERING: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering
Language:Python218 14 164
pytorch-labs/float8_experimental
This repository contains the experimental PyTorch native float8 training UX
Language:Python212 25 4720
jishengpeng/Languagecodec
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models
Language:Python207 8 716
locuslab/ect
Consistency Models Made Easy
Language:Python196 6 137
hayeong0/DDDM-VC
Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)
Language:Python178 15 1719
roedoejet/g2p
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
Language:Python128 10 12427
AudiogenAI/agc
Audiogen Codec
Language:Python118 3 111
nii-yamagishilab/ZMM-TTS
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
Language:C116 5 68
tadeephuy/GradientReversal
Gradient Reversal Layer for Domain Adaptation
Language:Python103 1 110
malbergo/stochastic-interpolants
Language:Jupyter Notebook94 5 511
HannesStark/FlowSite
Implementation of FlowSite and HarmonicFlow from the paper "Harmonic Self-Conditioned Flow Matching for Multi-Ligand Docking and Binding Site Design"
Language:Python86 2 66
resemble-ai/monotonic_align
Monotonic Alignment Search
Language:Cython83 6 114
j-min/MoChA-pytorch
PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)
Language:Python76 10 520
duchenzhuang/FSQ-pytorch
A Pytorch Implementation of Finite Scalar Quantization
Language:Python70 5 44
speechnovateur/languagecodec_tmp
Temporary anonymous version
Language:Python22 4 01

p0p4k

p0p4k's Stars

charlax/professional-programming

amix/vimrc

mhinz/vim-galore

jasonppy/VoiceCraft

google/gemma_pytorch

showlab/Awesome-Video-Diffusion

cuda-mode/lectures

microsoft/SpeechT5

pytorch/ao

NVlabs/FasterViT

NVlabs/edm2

Jokeren/Awesome-GPU

hubertsiuzdak/snac

louaaron/Score-Entropy-Discrete-Diffusion

dongzhuoyao/awesome-flow-matching

DNA-Rendering/DNA-Rendering

pytorch-labs/float8_experimental

jishengpeng/Languagecodec

locuslab/ect

hayeong0/DDDM-VC

roedoejet/g2p

AudiogenAI/agc

nii-yamagishilab/ZMM-TTS

tadeephuy/GradientReversal

malbergo/stochastic-interpolants

HannesStark/FlowSite

resemble-ai/monotonic_align

j-min/MoChA-pytorch

duchenzhuang/FSQ-pytorch

speechnovateur/languagecodec_tmp