DogeFlow

DogeFlow's Stars

jishengpeng/WavChat
A Survey of Spoken Dialogue Models (60 pages)
23813
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python9.2k883
coreweave/tensorizer
Module, Model, and Tensor Serialization/Deserialization
Language:Python20733
ohmyzsh/ohmyzsh
🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python, etc), 140+ themes to spice up your morning, and an auto-update tool that makes it easy to keep up with the latest updates from the community.
Language:Shell175k26k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python33.2k5k
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Language:Python2.2k249
Project-MONAI/MONAI
AI Toolkit for Healthcare Imaging
Language:Python6k1.1k
Shubhamsaboo/awesome-llm-apps
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
Language:Python11.1k1.2k
allenai/open-instruct
Language:Python2.2k256
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.5k1.4k
pytorch/torchtitan
A PyTorch native library for large model training
Language:Python2.9k233
modelscope/ClearerVoice-Studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Language:Python1.9k135
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.3k425
dmlguq456/SepReformer
Official repository of SepReformer for speech separation
Language:Python16214
huggingface/smol-course
A course on aligning smol models.
Language:Jupyter Notebook3.8k1.2k
JusperLee/Dual-Path-RNN-Pytorch
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
Language:Python42766
JusperLee/Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
Language:Python44577
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python2.8k433
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Language:Python7.1k533
juanmc2005/diart
A python package to build AI-powered real-time audio applications
Language:Python1.1k90
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.6k803
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Language:Python783126
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python1.4k119
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Language:Jupyter Notebook4k355
yeyupiaoling/VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
Language:Python864128
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Language:Python8.5k1.1k
TowerYsable/speech_enhancement_awesome
Language:Python203
alibabasglab/MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
898
coder/code-server
VS Code in the browser
Language:TypeScript69.2k5.7k
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
Language:Python23.9k5.5k