shiyuzh2007

Pinned Repositories

ASR
Language:Python55 5 727
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python10
impala
hbase-clouder-0.94.6
Language:Java1 1 00
jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Language:Jupyter Notebook1 0 00
LLaSM
第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验，同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。
Language:Python1 0 00
PunctuationModel
中文标点符号模型，可以给文本添加标点符号。
Language:Python30
Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials
Language:Python1 0 00
self-llm
《开源大模型食用指南》基于AutoDL快速部署开源大模型，更适合**宝宝的部署教程
Language:Jupyter Notebook10
tensorflow-with-kenlm
Tensorflow with KenLM integrated for beam search scoring
Language:C++1 0 00
VTuberTalk
Language:Python10

shiyuzh2007's Repositories

shiyuzh2007/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
shiyuzh2007/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
shiyuzh2007/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
shiyuzh2007/ParroT
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.
shiyuzh2007/fast-DiT
Fast Diffusion Models with Transformers
shiyuzh2007/UniAudio
The official source code of UniAudio
shiyuzh2007/bark
🔊 Text-Prompted Generative Audio Model
shiyuzh2007/LLaSM
第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验，同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。
1
shiyuzh2007/FindTheChatGPTer
ChatGPT爆火，开启了通往AGI的关键一步，本项目旨在汇总那些ChatGPT的开源平替们，包括文本大模型、多模态大模型等，为大家提供一些便利
shiyuzh2007/Anything2Image
Generate image from anything with ImageBind and Stable Diffusion
shiyuzh2007/PunctuationModel
中文标点符号模型，可以给文本添加标点符号。
3
shiyuzh2007/multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
shiyuzh2007/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
shiyuzh2007/Whisper-Finetune
微调Whisper语音识别模型和加速推理
shiyuzh2007/faster-whisper
Faster Whisper transcription with CTranslate2
shiyuzh2007/gradio
Create UIs for your machine learning model in Python in 3 minutes
shiyuzh2007/pytorchvideo
A deep learning library for video understanding research.
shiyuzh2007/BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
shiyuzh2007/FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
shiyuzh2007/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
shiyuzh2007/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
shiyuzh2007/GPT-4-LLM
Instruction Tuning with GPT-4
shiyuzh2007/Plan4MC
Reinforcement learning and planning for Minecraft.
shiyuzh2007/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
shiyuzh2007/InstructGLM
ChatGLM-6B 指令学习|指令数据|Instruct
shiyuzh2007/shap
A game theoretic approach to explain the output of any machine learning model.
shiyuzh2007/PhySO
Physical Symbolic Optimization
shiyuzh2007/pomegranate
Fast, flexible and easy to use probabilistic modelling in Python.
shiyuzh2007/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
shiyuzh2007/lime
Lime: Explaining the predictions of any machine learning classifier