vshanyiao's Stars
lucidrains/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
dingdang-robot/dingdang-robot
🤖 叮当是一款可以工作在 Raspberry Pi 上的中文语音对话机器人/智能音箱项目。
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
adamcohenhillel/ADeus
An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI.
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
netease-youdao/QAnything
Question and Answer based on Anything.
fishaudio/fish-speech
Brand new TTS solution
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
unit-mesh/auto-dev
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手) with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀
carlrobertoh/CodeGPT
JetBrains extension providing access to state-of-the-art LLMs, such as GPT-4, Claude 3, Code Llama, and others, all for free
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
tensorflow/serving
A flexible, high-performance serving system for machine learning models
YYuX-1145/Bert-VITS2-Integration-package
vits2 backbone with bert
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
mpc001/auto_avsr
Auto-AVSR: Lip-Reading Sentences Project
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
allenai/papermage
library supporting NLP and CV research on scientific papers
ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
KevinWang676/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
AnyaCoder/Bert-VITS2
vits2 backbone with bert
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
bram2w/baserow
The official repository is hosted on https://gitlab.com/bramw/baserow. Baserow is an open source no-code database tool and Airtable alternative.
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
facebookresearch/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
alibaba/rtp-llm
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
open-compass/MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI