Ryu1845's Stars
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
biomejs/biome
A toolchain for web projects, aimed to provide functionalities to maintain them. Biome offers formatter and linter, usable via CLI and LSP.
unslothai/unsloth
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
beeware/toga
A Python native, OS native GUI toolkit.
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
grantjenks/python-sortedcontainers
Python Sorted Container Types: Sorted List, Sorted Dict, and Sorted Set
hao-ai-lab/LookaheadDecoding
bracesdev/errtrace
An alternative to stack traces for your Go errors
kmille/freetar
freetar - an alternative frontend for ultimate-guitar.com
f-dangel/cockpit
Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
toyxyz/ComfyUI_toyxyz_test_nodes
Custom node and script for sending webcam to ComfyUI
regeirk/pycwt
A Python module for continuous wavelet spectral analysis. It includes a collection of routines for wavelet transform and statistical analysis via FFT algorithm. In addition, the module also includes cross-wavelet transforms, wavelet coherence tests and sample scripts.
garibida/cross-image-attention
Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"
furiousteabag/doppelganger
Fine-tuning LLM on my Telegram chats
microsoft/ResiDual
ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802
wmmae/wmma_extension
An extension library of WMMA API (Tensor Core API)
voidful/AudioDecBenchmark
Audio Codec Benchmark
pytorch-labs/torchfix
TorchFix - a linter for PyTorch-using code with autofix support
insuhan/hyper-attn
Christoph-Lauer/Sonogram-Visible-Speech
A speech and sound anallysis tool.
Speech-Interaction-Technology-Aalto-U/itsp
Introduction to Speech Processing
0417keito/JEN-1-pytorch
Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.04729)
bhoov/energy-transformer-jax
The Energy Transformer block, in JAX
jzmzhong/Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
vvvm23/TchAIkovsky
Using JAX to generate piano music as MIDI
marcojira/fls
PyTorch code for FLS, FID, KID, Precision, Recall, etc. using DINOv2, InceptionV3, CLIP, etc.
JuanPZuluaga/accent-recog-slt2022
Repository for Accent Recognition (Hackathon @SLT2022)