vshanyiao

vshanyiao's Stars

lucidrains/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
Language:Python53735
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
Language:Python93842
dingdang-robot/dingdang-robot
🤖 叮当是一款可以工作在 Raspberry Pi 上的中文语音对话机器人/智能音箱项目。
Language:Python1.3k404
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python7.2k616
adamcohenhillel/ADeus
An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI.
Language:TypeScript2.9k290
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Language:HTML8.4k693
netease-youdao/QAnything
Question and Answer based on Anything.
Language:Python11.4k1.1k
fishaudio/fish-speech
Brand new TTS solution
Language:Python10.6k829
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
Language:Python1.3k72
unit-mesh/auto-dev
🧙‍AutoDev: The AI-powered coding wizard（AI 驱动编程助手） with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀
Language:Kotlin2.7k315
carlrobertoh/CodeGPT
JetBrains extension providing access to state-of-the-art LLMs, such as GPT-4, Claude 3, Code Llama, and others, all for free
Language:Java983206
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Language:Jupyter Notebook13.2k3.2k
tensorflow/serving
A flexible, high-performance serving system for machine learning models
Language:C++6.2k2.2k
YYuX-1145/Bert-VITS2-Integration-package
vits2 backbone with bert
Language:Python33230
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Language:Python10.8k838
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python32.5k3.8k
mpc001/auto_avsr
Auto-AVSR: Lip-Reading Sentences Project
Language:Python16240
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python11.4k950
allenai/papermage
library supporting NLP and CV research on scientific papers
Language:Python66852
ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
Language:Python1.8k217
KevinWang676/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
Language:Jupyter Notebook2.7k390
AnyaCoder/Bert-VITS2
vits2 backbone with bert
Language:Python848
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python7.8k1.1k
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
Language:Python1.8k243
bram2w/baserow
The official repository is hosted on https://gitlab.com/bramw/baserow. Baserow is an open source no-code database tool and Airtable alternative.
Language:Python2.3k261
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python7.6k444
facebookresearch/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
Language:Python2.7k250
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python28.4k2.8k
alibaba/rtp-llm
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Language:C++51047
open-compass/MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
Language:Python76381