taikai-zz's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
ggerganov/llama.cpp
LLM inference in C/C++
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
PromtEngineer/localGPT
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
CodePhiliaX/Chat2DB
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
ggerganov/ggml
Tensor library for machine learning
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
abetlen/llama-cpp-python
Python bindings for llama.cpp
THUDM/CodeGeeX2
CodeGeeX2: A More Powerful Multilingual Code Generation Model
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
v2fly/domain-list-community
Community managed domain list. Generate geosite.dat for V2Ray.
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
divamgupta/stable-diffusion-tensorflow
Stable Diffusion in TensorFlow / Keras
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
swoole/phpy
Connecting the Python and PHP ecosystems together
monatis/clip.cpp
CLIP inference in plain C/C++ with no extra dependencies
SALT-NLP/LLaVAR
Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"
camenduru/LLaVA-colab
hellojixian/StableDiffusionParallelPipeline
this pipeline allow stable diffusion to use multi-GPU resources to speed up single image generation
gerardrbentley/easyocr-server
FastAPI + EasyOCR Server