Pinned Repositories
AIR-Bench
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension
allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
arctic_shift
Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web interface.
asr2k
asr2k
audio-dataset
Audio Dataset for training CLAP and other models
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
audio-flamingo
PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.
audio-slicer
A simple GUI application that slices audio with silence detection
AudioGPT
GuangkeChen's Repositories
GuangkeChen/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
GuangkeChen/awesome-llm-security
A curation of awesome tools, documents and projects about LLM Security.
GuangkeChen/Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
GuangkeChen/CPAD
The official dataset of paper "Goal-Oriented Prompt Attack and Safety Evaluation for LLMs".
GuangkeChen/CRoSS
[NeurIPS 2023] Official PyTorch implementation for the paper "CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography"
GuangkeChen/Diff-PGD
[NeurIPS'2023] Official Code Repo:Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability
GuangkeChen/EasyNMT
Easy to use, state-of-the-art Neural Machine Translation for 100+ languages
GuangkeChen/fast-detect-gpt
Code base for "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".
GuangkeChen/GPTFuzz
Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
GuangkeChen/Grad-SVC
Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei
GuangkeChen/JailbreakingLLMs
GuangkeChen/leon
🧠 Leon is your open-source personal assistant.
GuangkeChen/llark
Code for the paper "LLark: A Multimodal Foundation Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.
GuangkeChen/LLaSM
第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。
GuangkeChen/MOSS
An open-source tool-augmented conversational language model from Fudan University
GuangkeChen/musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
GuangkeChen/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
GuangkeChen/Pengi
An Audio Language model for Audio Tasks
GuangkeChen/persuasive_jailbreaker
Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!
GuangkeChen/promptbench
A robustness evaluation framework for large language models on adversarial prompts
GuangkeChen/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
GuangkeChen/Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
GuangkeChen/Singing-Voice-Conversion
Project of Singing Voice Conversion.
GuangkeChen/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
GuangkeChen/Speech-Editing-Toolkit
It's a repository for implementations of neural speech editing algorithms.
GuangkeChen/stable_signature
Official implementation of the paper "The Stable Signature Rooting Watermarks in Latent Diffusion Models"
GuangkeChen/synthea
Synthetic Patient Population Simulator
GuangkeChen/UnIVAL
[TMLR23] Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.
GuangkeChen/vall-e-a
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
GuangkeChen/WavJourney
WavJourney: Compositional Audio Creation with LLMs