Pinned Repositories
audio-slicer
Python script that slices audio with silence detection
AudioSlicer
Audio Slicer that uses silence detection to split .wav audio files into several .wav samples.
bark
🔊 Text-Prompted Generative Audio Model
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
chatgpt-mirror
A mirror of ChatGPT based on the gpt-3.5-turbo model.
chatgpt-web
用 Express 和 Vue3 搭建的 ChatGPT 演示网页
datasetapi
规范化管理labelme数据集并生成coco数据集
DDSP-SVC
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
scottsln's Repositories
scottsln/audio-slicer
Python script that slices audio with silence detection
scottsln/AudioSlicer
Audio Slicer that uses silence detection to split .wav audio files into several .wav samples.
scottsln/bark
🔊 Text-Prompted Generative Audio Model
scottsln/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
scottsln/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
scottsln/chatgpt-mirror
A mirror of ChatGPT based on the gpt-3.5-turbo model.
scottsln/chatgpt-web
用 Express 和 Vue3 搭建的 ChatGPT 演示网页
scottsln/DDSP-SVC
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
scottsln/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
scottsln/HairCLIPv2
[ICCV 2023] HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending
scottsln/langchain-ChatGLM
langchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识库的 ChatGLM 问答
scottsln/mmdetection
OpenMMLab Detection Toolbox and Benchmark
scottsln/MoeVoiceStudio
一个使用C++编写的音频处理软件
scottsln/Openai-whisper
Robust Speech Recognition via Large-Scale Weak Supervision
scottsln/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
scottsln/piper
A fast, local neural text to speech system
scottsln/rustdesk
An open-source remote desktop, and alternative to TeamViewer.
scottsln/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
scottsln/so-vits-svc
SoftVC VITS Singing Voice Conversion
scottsln/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
scottsln/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
scottsln/stylegan2
StyleGAN2 - Official TensorFlow Implementation
scottsln/stylegan2-ada-pytorch
StyleGAN2-ADA - Official PyTorch implementation
scottsln/stylegan3
Official PyTorch implementation of StyleGAN3
scottsln/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
scottsln/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
scottsln/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
scottsln/voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
scottsln/Yi
A series of large language models trained from scratch by developers @01-ai
scottsln/yt-dlp
A youtube-dl fork with additional features and fixes