aqzlpm11

aqzlpm11's Stars

HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
15.2k1.4k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
11.9k769
cwx-worst-one/EAT
[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
Language:Python993
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
61127
EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
55531
RoyChao19477/SEMamba
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
Language:Python12111
mosaicml/streaming
A Data Streaming Library for Efficient Neural Network Training
Language:Python1.1k136
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
Language:Python2.5k236
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.1k536
speechnovateur/languagecodec_tmp
Temporary anonymous version
Language:Python221
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python33.2k3.8k
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
Language:Python1.2k134
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python4.5k383
OlaWod/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Language:Python590108
Azure/MS-AMP
Microsoft Automatic Mixed Precision Library
Language:Python51042
Vaibhavs10/ml-with-audio
HF's ML for Audio study group
Language:Jupyter Notebook18129
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Language:Python1.1k103
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.7k3.4k
datawhalechina/llm-cookbook
面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版
Language:Jupyter Notebook11.5k1.4k
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
1.9k67
FastGitORG/nginx-conf
⚙️ Nginx conf of FastGit, core part of fastgit web booster module
Language:Shell10833
ehabets/RIR-Generator
Generating room impulse responses
Language:C++420145
nico-zck/zotero-scholar-citations
Language:JavaScript772
jaywalnut310/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
Language:Python660151
maxielee/ctrl-space-ime
autohotkey script to easily toggle han/en mode in windows 10
Language:AutoHotkey22
sukumo28/vscode-audio-preview
VS Code extension that allows you to preview and play audio files.
Language:TypeScript14016
PaperCutSoftware/pdfsearch
A full text search library for PDFs.
Language:Go634
P3TERX/aria2.sh
Aria2 一键安装管理脚本增强版
Language:Shell2.9k754
NVIDIA/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook5.1k1.4k
agarden/remove-pdf-watermark
Short script for removing watermarks from PDF files. Requires pdftk.
Language:Python5721