Pinned Repositories
ASR
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
impala
hbase-clouder-0.94.6
jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
LLaSM
第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。
PunctuationModel
中文标点符号模型,可以给文本添加标点符号。
Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials
self-llm
《开源大模型食用指南》基于AutoDL快速部署开源大模型,更适合**宝宝的部署教程
tensorflow-with-kenlm
Tensorflow with KenLM integrated for beam search scoring
VTuberTalk
shiyuzh2007's Repositories
shiyuzh2007/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
shiyuzh2007/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
shiyuzh2007/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
shiyuzh2007/ParroT
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.
shiyuzh2007/fast-DiT
Fast Diffusion Models with Transformers
shiyuzh2007/UniAudio
The official source code of UniAudio
shiyuzh2007/bark
🔊 Text-Prompted Generative Audio Model
shiyuzh2007/LLaSM
第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。
shiyuzh2007/FindTheChatGPTer
ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利
shiyuzh2007/Anything2Image
Generate image from anything with ImageBind and Stable Diffusion
shiyuzh2007/PunctuationModel
中文标点符号模型,可以给文本添加标点符号。
shiyuzh2007/multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
shiyuzh2007/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
shiyuzh2007/Whisper-Finetune
微调Whisper语音识别模型和加速推理
shiyuzh2007/faster-whisper
Faster Whisper transcription with CTranslate2
shiyuzh2007/gradio
Create UIs for your machine learning model in Python in 3 minutes
shiyuzh2007/pytorchvideo
A deep learning library for video understanding research.
shiyuzh2007/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
shiyuzh2007/FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
shiyuzh2007/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
shiyuzh2007/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
shiyuzh2007/GPT-4-LLM
Instruction Tuning with GPT-4
shiyuzh2007/Plan4MC
Reinforcement learning and planning for Minecraft.
shiyuzh2007/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
shiyuzh2007/InstructGLM
ChatGLM-6B 指令学习|指令数据|Instruct
shiyuzh2007/shap
A game theoretic approach to explain the output of any machine learning model.
shiyuzh2007/PhySO
Physical Symbolic Optimization
shiyuzh2007/pomegranate
Fast, flexible and easy to use probabilistic modelling in Python.
shiyuzh2007/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
shiyuzh2007/lime
Lime: Explaining the predictions of any machine learning classifier