zwglory

University of Chinese Academy of ScienceBeijing in China

Pinned Repositories

wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python4.2k 89 1k1.1k
Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
Language:Python1 0 00
AlgorithmInterview
Language:Python1 1 00
baos_Wireless_Lib
Language:Python1 1 00
chatgpt_academic
科研工作专用ChatGPT拓展，特别优化学术Paper润色体验，支持自定义快捷按钮，支持markdown表格显示，Tex公式双显示，代码显示功能完善，新增本地Python工程剖析功能/自我剖析功能
Language:Python1 0 00
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python3 0 00
WechatBot
Language:TypeScript1 1 00
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:C++0 0 00
wireless
Wireless implement of spinal codes over AWGN without itpp.
Language:C++3 1 01
wireless3
Spinal Code over AWGN
Language:C++5 1 00

zwglory's Repositories

zwglory/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python3 0 00
zwglory/chatgpt_academic
科研工作专用ChatGPT拓展，特别优化学术Paper润色体验，支持自定义快捷按钮，支持markdown表格显示，Tex公式双显示，代码显示功能完善，新增本地Python工程剖析功能/自我剖析功能
Language:Python1 0 00
zwglory/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:C++0 0 00
zwglory/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python0 0
zwglory/bark
🔊 Text-Prompted Generative Audio Model
Language:Python0 0
zwglory/EmoGator
Language:Python0 0
zwglory/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python0 0
zwglory/fish-speech
Brand new TTS solution
Language:Python0 0
zwglory/FlipSketch
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations
zwglory/g1
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
zwglory/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python0 0
zwglory/gpt4free
decentralising the Ai Industry, just some language model api's...
Language:Python0 0
zwglory/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python0 0
zwglory/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频｜评论爬虫、微博帖子｜评论爬虫
Language:Python
zwglory/moshi
Language:Python0 0
zwglory/MOSS
An open-source tool-augmented conversational language model from Fudan University
Language:Python0 0
zwglory/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Language:Python0 0
zwglory/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Language:Python0 0
zwglory/OpenVoice
Instant voice cloning by MyShell.
Language:Python0 0
zwglory/QAnything
Question and Answer based on Anything.
Language:Python0 0
zwglory/RealtimeTTS
Converts text to speech in realtime
Language:Python0 0
zwglory/roomGPT
Upload a photo of your room to generate your dream room with AI.
Language:TypeScript0 0
zwglory/sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
Language:Python0 0
zwglory/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python0 0
zwglory/so-vits-svc-5.0
Core Engine of Singing Voice Conversion & Singing Voice Clone
Language:Python0 0
zwglory/stable-diffusion-webui
Stable Diffusion web UI
Language:Python0 0
zwglory/Telechat
zwglory/video-subtitle-extractor
视频硬字幕提取，无需申请第三方API，本地实现文本识别。基于深度学习(CTPN+CRNN)的视频提取框架，包含字幕区域检测、字幕内容提取
Language:Python0 0
zwglory/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
Language:Python0 0
zwglory/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild