Pinned Repositories
Blog
Python机器学习算法技术博客,有原创干货!有code实践!
ChatTTS_colab
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
controllable_evc_code
This is the code for controllable EVC framework for seen and unseen emotion generation.
conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
demucs
Code for the paper Music Source Separation in the Waveform Domain
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
llama
Inference code for LLaMA models
one-python-craftsman
来自一位 Pythonista 的编程经验分享,内容涵盖编码技巧、最佳实践与思维模式等方面。
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
hildazzz's Repositories
hildazzz/Blog
Python机器学习算法技术博客,有原创干货!有code实践!
hildazzz/ChatTTS_colab
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
hildazzz/controllable_evc_code
This is the code for controllable EVC framework for seen and unseen emotion generation.
hildazzz/conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
hildazzz/DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
hildazzz/demucs
Code for the paper Music Source Separation in the Waveform Domain
hildazzz/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
hildazzz/llama
Inference code for LLaMA models
hildazzz/one-python-craftsman
来自一位 Pythonista 的编程经验分享,内容涵盖编码技巧、最佳实践与思维模式等方面。
hildazzz/PaddleSpeech
Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
hildazzz/prepare_detection_dataset
convert dataset to coco/voc format
hildazzz/pytorch_kmeans
Implementation of the k-means algorithm in PyTorch that works for large datasets
hildazzz/Top-AI-Conferences-Paper-with-Code
Top-Conferences-Paper-with-Code (ACL、EMNLP、NAACL、COLING、AAAI、IJCAI、NeurIPS、ICLR and etc)
hildazzz/transferlearning
Everything about Transfer Learning and Domain Adaptation--迁移学习
hildazzz/vits-simple-api
A simple VITS HTTP API, developed by extending Moegoe with additional features.
hildazzz/voxceleb_trainer
In defence of metric learning for speaker recognition
hildazzz/yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
hildazzz/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/