taikai-zz

taikai-zz's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python70.4k 574 08.3k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++67.1k 551 3.9k9.6k
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python43.8k 444 9.3k7.8k
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
Language:Python40.4k 328 3.7k5.3k
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook35.9k 329 4414.2k
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C++35.3k 312 1.4k3.6k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python35k 289 1.1k4.3k
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language:Python24.3k 316 9983.1k
PromtEngineer/localGPT
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
Language:Python20k 166 5392.2k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20k 157 1.5k2.2k
CodePhiliaX/Chat2DB
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
Language:Java15.4k 104 1k1.7k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python13.9k 103 1.1k1.1k
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook13.1k 174 5171.8k
ggerganov/ggml
Tensor library for machine learning
Language:C++11.1k 128 4121k
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python10.6k 172 6622.3k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.8k 99 661967
abetlen/llama-cpp-python
Python bindings for llama.cpp
Language:Python8k 73 1.1k951
THUDM/CodeGeeX2
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Language:Python7.6k 65 249532
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language:Jupyter Notebook7k 59 138481
v2fly/domain-list-community
Community managed domain list. Generate geosite.dat for V2Ray.
Language:Go4.9k 61 241889
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python2.9k 28 183212
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
Language:Python2.1k 30 156374
divamgupta/stable-diffusion-tensorflow
Stable Diffusion in TensorFlow / Keras
Language:Python1.6k 25 49228
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
Language:Python623 8 6640
swoole/phpy
Connecting the Python and PHP ecosystems together
Language:Python536 14 3744
monatis/clip.cpp
CLIP inference in plain C/C++ with no extra dependencies
Language:C++451 16 5130
SALT-NLP/LLaVAR
Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"
Language:Python257 5 2112
camenduru/LLaVA-colab
Language:Jupyter Notebook207 6 631
hellojixian/StableDiffusionParallelPipeline
this pipeline allow stable diffusion to use multi-GPU resources to speed up single image generation
Language:Python23 1 32
gerardrbentley/easyocr-server
FastAPI + EasyOCR Server
Language:Python7 3 11