Mikezz1

Moscow

Mikezz1's Stars

svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python26.3k 180 1304.9k
yandexdataschool/Practical_RL
A course in reinforcement learning in the wild
Language:Jupyter Notebook6k 210 1861.7k
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python5k 58 236431
neuralmagic/deepsparse
Sparsity-aware deep learning inference runtime for CPUs
Language:Python3.1k 58 142179
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.7k 32 142218
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.3k 52 223427
lucidrains/lion-pytorch
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
Language:Python2.1k 15 2452
homebrewltd/ichigo
Local realtime voice AI
Language:Python1.9k 19 6991
QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.6k 26 68112
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Language:Python1.3k 28 80120
adefossez/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Language:Python1.1k 32 0109
bytedance/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
Language:Python1.1k 29 7088
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
Language:Python972 43 433223
KanatnikovMax/znanie-drevnix
Language:C++884 18 2543
lucidrains/mixture-of-experts
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
Language:Python668 6 1150
lucidrains/rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
Language:Python609 11 3247
justinjohn0306/so-vits-svc-4.0-v2
SoftVC VITS Singing Voice Conversion
Language:Python569 5 1888
lucidrains/BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
Language:Python477 13 3118
IvanDrokin/torch-conv-kan
This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implementations of 1D, 2D, and 3D convolutions with different kernels, ResNet-like and DenseNet-like models, training code based on accelerate/PyTorch, as well as scripts for experiments with CIFAR-10 and Tiny ImageNet.
Language:Python458 7 1434
DmitryRyumin/ICASSP-2023-24-Papers
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
Language:Python425 29 417
JorisCos/LibriMix
An open source dataset for source separation
Language:Python391 10 2667
sleepymalc/VSCode-LaTeX-Inkscape
✍️ A way to integrate LaTeX, VS Code, and Inkscape in macOS
Language:Python389 4 1227
ruizhecao96/CMGAN
Conformer-based Metric GAN for speech enhancement
Language:Python335 9 4760
apple/ml-sigma-reparam
Language:Python296 13 014
Srijith-rkr/Whispering-LLaMA
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
Language:Jupyter Notebook243 5 1316
Hypotheses-Paradise/Hypo2Trans
Single-blind supplementary materials for NeurIPS 2023 submission
Language:Python96 7 25
alibabasglab/MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
89 3 78
georgygospodinov/speech_course
Deep Learning for Speech
Language:Jupyter Notebook82 3 08
ischurov/scientific-computing-2024
Bridging the gap between mathematical courses and ML
Language:Jupyter Notebook56 17 08
fattorib/fusedswiglu
Fused SwiGLU Triton kernels
Language:Python3 1 10

Mikezz1

Mikezz1's Stars

svc-develop-team/so-vits-svc

yandexdataschool/Practical_RL

lucidrains/x-transformers

neuralmagic/deepsparse

Doubiiu/DynamiCrafter

asteroid-team/asteroid

lucidrains/lion-pytorch

homebrewltd/ichigo

QwenLM/Qwen-Audio

descriptinc/descript-audio-codec

adefossez/demucs

bytedance/SALMONN

lhotse-speech/lhotse

KanatnikovMax/znanie-drevnix

lucidrains/mixture-of-experts

lucidrains/rotary-embedding-torch

justinjohn0306/so-vits-svc-4.0-v2

lucidrains/BS-RoFormer

IvanDrokin/torch-conv-kan

DmitryRyumin/ICASSP-2023-24-Papers

JorisCos/LibriMix

sleepymalc/VSCode-LaTeX-Inkscape

ruizhecao96/CMGAN

apple/ml-sigma-reparam

Srijith-rkr/Whispering-LLaMA

Hypotheses-Paradise/Hypo2Trans

alibabasglab/MossFormer

georgygospodinov/speech_course

ischurov/scientific-computing-2024

fattorib/fusedswiglu