wbgxx333

beijing china

wbgxx333's Stars

f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
Language:HTML114k 1.5k 015.6k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python30.2k 217 2543k
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。
Language:Python22.1k 115 7761.6k
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python17.8k 110 4721.3k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.5k 115 3951.4k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.3k 257 128839
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
Language:Python12k 122 3541.1k
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
Language:HTML8k 108 442762
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python7k 55 2071.3k
wgwang/awesome-LLMs-In-China
**大模型
5.7k 107 27474
modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Language:Python3.9k 37 98438
nghuyong/WeiboSpider
持续维护的新浪微博采集工具🚀🚀🚀
Language:Python3.7k 68 315831
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python3.5k 58 71310
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.7k 50 3175
taishi-i/awesome-ChatGPT-repositories
A curated list of resources dedicated to open source GitHub repositories related to ChatGPT
2.2k 57 12256
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2.1k 49 127323
TapXWorld/ChinaTextbook
所有小初高、大学PDF教材。
Language:Roff1.9k 35 16486
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
Language:Python1.3k 46 5587
PyThaiNLP/pythainlp
Thai natural language processing in Python
Language:Python994 46 367273
jiayev/GPT4V-Image-Captioner
Language:Python799 13 5358
jianzhnie/awesome-instruction-datasets
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
570 6 031
facebookresearch/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Language:Python489 15 2361
Yuliang-Liu/MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
Language:Python485 15 3032
langgptai/Awesome-Multimodal-Prompts
Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
229 2 016
CLUEbenchmark/SuperCLUElyb
SuperCLUE琅琊榜：中文通用大模型匿名对战评价基准
143 5 76
micbuffa/WasabiDataset
Repo for the Wasabi datasets
Language:Jupyter Notebook102 6 510
FreedomIntelligence/MLLM-Bench
MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria
Language:Python58 10 33
R1ckShi/AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
Language:Python55 4 418
attapol/tltk
Thai Language Toolkit
Language:Python24 2 75
ag1988/mel-asr
The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George Saon, Brian Kingsbury. Interspeech 2024).
Language:Python15 6 20

wbgxx333

wbgxx333's Stars

f/awesome-chatgpt-prompts

myshell-ai/OpenVoice

opendatalab/MinerU

fishaudio/fish-speech

IDEA-Research/Grounded-Segment-Anything

BradyFU/Awesome-Multimodal-Large-Language-Models

OpenMOSS/MOSS

LianjiaTech/BELLE

jaywalnut310/vits

wgwang/awesome-LLMs-In-China

modelscope/FunClip

nghuyong/WeiboSpider

facebookresearch/encodec

Zjh-819/LLMDataHub

taishi-i/awesome-ChatGPT-repositories

lifeiteng/vall-e

TapXWorld/ChinaTextbook

0nutation/SpeechGPT

PyThaiNLP/pythainlp

jiayev/GPT4V-Image-Captioner

jianzhnie/awesome-instruction-datasets

facebookresearch/audioseal

Yuliang-Liu/MultimodalOCR

langgptai/Awesome-Multimodal-Prompts

CLUEbenchmark/SuperCLUElyb

micbuffa/WasabiDataset

FreedomIntelligence/MLLM-Bench

R1ckShi/AESRC2020

attapol/tltk

ag1988/mel-asr