chenpaopao's Stars
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
mistralai/mistral-inference
Official inference library for Mistral models
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
YaoFANGUK/video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
CLUEbenchmark/CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
ChineseSubFinder/ChineseSubFinder
自动化中文字幕下载。字幕网站支持 shooter、xunlei、arrst、a4k、SubtitleBest 。支持 Emby、Jellyfin、Plex、Sonarr、Radarr、TMM
thu-coai/CDial-GPT
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
srx-2000/spider_collection
python爬虫,目前库存:网易云音乐歌曲爬取,B站视频爬取,知乎问答爬取,壁纸爬取,xvideos视频爬取,有声书爬取,微博爬虫,安居客信息爬取+数据可视化,哔哩哔哩视频封面提取器,ip代理池封装,知乎百万级用户爬虫+数据分析,github用户爬虫
libukai/Awesome-ChatTTS
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
lartpang/PyTorchTricks
Some tricks of pytorch... :star:
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
OpenMOSS/AnyGPT
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
SpeechColab/GigaSpeech
Large, modern dataset for speech recognition
6drf21e/ChatTTS_Speaker
ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview
thu-coai/CharacterGLM-6B
[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
double22a/speech_dataset
The dataset of Speech Recognition
yangjianxin1/QQMusicSpider
基于Scrapy的QQ音乐爬虫(QQ Music Spider),爬取歌曲信息、歌词、精彩评论等,并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
robmsmt/ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset
YouTaoBaBa/Chinese-Dialogue-Dataset
用于汇总目前的开源中文对话数据集
0nutation/SpeechAgents
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
xiayongtao/aidatatang_1505zh
abcabc2020/subtitles_search
字幕匹配
lu5je0/SubtitlesDownloader
使用射手字幕网的API,快速下载字幕
USTB-WZL/migu-music-spider
咪咕音乐爬虫,按照类别爬取歌手、歌名、id、歌曲信息,保存到csv文件
youngkuan/subtitle
subtitle downloader using webmagic (使用webmagic爬取字幕网站电影字幕以及相关信息)