zqlsnr

Audio Signal Processing

Nanjing, China

Pinned Repositories

ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
Language:Python0 0 00
all-in-one
All-In-One Music Structure Analyzer
Language:Python0 0 00
AnyEdit
Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"
Language:Jupyter Notebook00
audio_tagging_onnx
Easy to use Audio Tagging in Onnx
Language:Python1 1 00
automatic
SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models
Language:Python0 0 00
DPCRN
real-time speech enhance
Language:C++12 1 03
DTLN
real-time speech enhance
Language:Python3 1 00
filtfilt
voice-blur with filtfilt，forward-pass backward-pass
Language:C++1 1 00
speech-music-detection
tensorflow for speech-music-detection task，acc 96%+
Language:Python2 2 10
train-kolors-inpainting-lora
Language:Python10

zqlsnr's Repositories

zqlsnr/DPCRN
real-time speech enhance
Language:C++12 1 03
zqlsnr/DTLN
real-time speech enhance
Language:Python3 1 00
zqlsnr/speech-music-detection
tensorflow for speech-music-detection task，acc 96%+
Language:Python2 2 10
zqlsnr/audio_tagging_onnx
Easy to use Audio Tagging in Onnx
Language:Python1 1 00
zqlsnr/filtfilt
voice-blur with filtfilt，forward-pass backward-pass
Language:C++1 1 00
zqlsnr/ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
Language:Python0 0 00
zqlsnr/all-in-one
All-In-One Music Structure Analyzer
Language:Python0 0 00
zqlsnr/automatic
SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models
Language:Python0 0 00
zqlsnr/beat_tracker
Beat tracker assignment for Music Informatics
Language:Python0 0 00
zqlsnr/Bert-VITS2
vits2 backbone with bert
Language:Python0 0 00
zqlsnr/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python00
zqlsnr/OpenVoice
Instant voice cloning by MyShell
Language:Python0 0 00
zqlsnr/Sentiment-classification
LSTM Sentiment-classification
Language:Python0 2 00
zqlsnr/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Language:Python0 0
zqlsnr/CED_audiotagging
Source code for Consistent ensemble distillation for audio tagging
Language:Python0 0
zqlsnr/chorus-detection
A machine learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input a YouTube link and utilize a pre-trained CRNN model to detect chorus sections from a song on YouTube
Language:Jupyter Notebook0 0
zqlsnr/FLUX-Controlnet-Inpainting
zqlsnr/FoleyCrafter
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师，给你的无声视频添加生动而且同步的音效 😝
Language:Python0 0
zqlsnr/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Language:Python0 0
zqlsnr/grok-1
Grok open release
Language:Python0 0
zqlsnr/gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Language:Python0 0
zqlsnr/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
zqlsnr/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
zqlsnr/Kolors-TensorRT-libtorch
Kolors with TensorRT and libtorch
Language:C++0 0
zqlsnr/MoneyPrinter
Automate Creation of YouTube Shorts using MoviePy.
Language:Python0 0
zqlsnr/odeval
Benchmarking the accelerated generation quality of OneDiff.
zqlsnr/PaddleVideo
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.
Language:Python0 0
zqlsnr/PowerPaint
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型，可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成，只需要一个模型
zqlsnr/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
zqlsnr/TVSM-dataset
Language:Python0 0