IMYBo

NWPUChina

IMYBo's Stars

LC044/WeChatMsg
提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手
Language:Python33.5k 171 4033.5k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python21.7k 185 4872.1k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python11.6k 206 2.2k2.4k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.3k 160 3001k
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.6k 106 291471
state-spaces/s4
Structured state space sequence models
Language:Jupyter Notebook2.4k 52 134285
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
1.6k 79 7224
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Language:Python1.1k 27 74103
vivo-ai-lab/BlueLM
BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab
Language:Python827 14 2755
microsoft/MS-SNSD
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
Language:HTML477 20 15146
subhadarship/kmeans_pytorch
kmeans using PyTorch
Language:Jupyter Notebook472 7 3775
metame-ai/awesome-audio-plaza
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
309 30 211
BUTSpeechFIT/VBx
Variational Bayes HMM over x-vectors diarization
Language:Python251 21 6357
rishikksh20/hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Language:Python202 10 1045
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
197 11 03
slp-rl/aero
This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)
Language:Python196 6 2726
pythad/nider
Python package to add text to images, textures and different backgrounds
Language:Python151 2 819
marianne-m/brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
Language:Jupyter Notebook137 10 1624
facebookresearch/ears_dataset
Expressive Anechoic Recordings of Speech (EARS)
Language:Python126 6 57
haoheliu/SemantiCodec-inference
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
Language:Python125 5 78
f-dangel/unfoldNd
(N=1,2,3)-dimensional unfold (im2col) and fold (col2im) in PyTorch
Language:Python83 4 196
desh2608/dover-lap
Python package for combining diarization system outputs.
Language:Python73 4 713
liyunlongaaa/NSD-MS2S
CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture
Language:Shell62 3 84
yuguochencuc/BAE-Net
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
Language:Python55 8 102
AkojimaSLP/Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming
speech-enhacement
Language:Python48 4 016
BUTSpeechFIT/EEND_dataprep
Language:Shell47 5 87
dmlguq456/NeXt_TDNN_ASV
Official repository of NeXt-TDNN for speaker verification
Language:Python47 2 54
Kuray107/S4ND-U-Net_speech_enhancement
Language:Python28 2 13
JusperLee/S4M
Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models
Language:Python18 1 22
wq2012/VB_diarization
VB Diarization with Eigenvoice and HMM Priors, refactored
Language:Python14 4 23

IMYBo

IMYBo's Stars

LC044/WeChatMsg

hpcaitech/Open-Sora

NVIDIA/NeMo

PKU-YuanGroup/Open-Sora-Plan

01-ai/Yi

state-spaces/s4

wq2012/awesome-diarization

descriptinc/descript-audio-codec

vivo-ai-lab/BlueLM

microsoft/MS-SNSD

subhadarship/kmeans_pytorch

metame-ai/awesome-audio-plaza

BUTSpeechFIT/VBx

rishikksh20/hifigan-denoiser

DongKeon/Awesome-Speaker-Diarization

slp-rl/aero

pythad/nider

marianne-m/brouhaha-vad

facebookresearch/ears_dataset

haoheliu/SemantiCodec-inference

f-dangel/unfoldNd

desh2608/dover-lap

liyunlongaaa/NSD-MS2S

yuguochencuc/BAE-Net

AkojimaSLP/Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming

BUTSpeechFIT/EEND_dataprep

dmlguq456/NeXt_TDNN_ASV

Kuray107/S4ND-U-Net_speech_enhancement

JusperLee/S4M

wq2012/VB_diarization