Pinned Repositories
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
AlgorithmInterview
baos_Wireless_Lib
chatgpt_academic
科研工作专用ChatGPT拓展,特别优化学术Paper润色体验,支持自定义快捷按钮,支持markdown表格显示,Tex公式双显示,代码显示功能完善,新增本地Python工程剖析功能/自我剖析功能
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
WechatBot
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
wireless
Wireless implement of spinal codes over AWGN without itpp.
wireless3
Spinal Code over AWGN
zwglory's Repositories
zwglory/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
zwglory/chatgpt_academic
科研工作专用ChatGPT拓展,特别优化学术Paper润色体验,支持自定义快捷按钮,支持markdown表格显示,Tex公式双显示,代码显示功能完善,新增本地Python工程剖析功能/自我剖析功能
zwglory/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
zwglory/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
zwglory/bark
🔊 Text-Prompted Generative Audio Model
zwglory/EmoGator
zwglory/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
zwglory/fish-speech
Brand new TTS solution
zwglory/FlipSketch
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations
zwglory/g1
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
zwglory/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
zwglory/gpt4free
decentralising the Ai Industry, just some language model api's...
zwglory/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
zwglory/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
zwglory/moshi
zwglory/MOSS
An open-source tool-augmented conversational language model from Fudan University
zwglory/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
zwglory/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
zwglory/OpenVoice
Instant voice cloning by MyShell.
zwglory/QAnything
Question and Answer based on Anything.
zwglory/RealtimeTTS
Converts text to speech in realtime
zwglory/roomGPT
Upload a photo of your room to generate your dream room with AI.
zwglory/sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
zwglory/so-vits-svc
SoftVC VITS Singing Voice Conversion
zwglory/so-vits-svc-5.0
Core Engine of Singing Voice Conversion & Singing Voice Clone
zwglory/stable-diffusion-webui
Stable Diffusion web UI
zwglory/Telechat
zwglory/video-subtitle-extractor
视频硬字幕提取,无需申请第三方API,本地实现文本识别。基于深度学习(CTPN+CRNN)的视频提取框架,包含字幕区域检测、字幕内容提取
zwglory/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
zwglory/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild