Pinned Repositories
ai-research-code
aishell-3-baseline-fc
The code for aishell-3 baseline acoustic model
aistplusplus_api
API to support AIST++ Dataset: https://google.github.io/aistplusplus_dataset
ChatGLM-Finetuning
基于ChatGLM-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning等
cyclevae-vc-neuralvoco
MNET
Official PyTorch implementation of the paper "A Brand New Dance Partner:Music-Conditioned Pluralistic Dancing Synthesized by Multiple Dance Genres", CVPR 2022
onnx_runtime_cpp
small c++ library to quickly deploy models using onnxruntime
PoseGPT
reward-modeling
seeprettyface-face_editor
这是一个基于StyleGAN的人脸属性编辑器
zhangsanfeng86's Repositories
zhangsanfeng86/AMchat
AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics, and their solutions. AM (Advanced Mathematics) chat 高等数学大模型。一个集成数学知识和高等数学习题及其解答的大语言模型。
zhangsanfeng86/ASR-2Pass
ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).
zhangsanfeng86/AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
zhangsanfeng86/camel
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Language Model Society (NeruIPS'2023) https://www.camel-ai.org
zhangsanfeng86/curobo
CUDA Accelerated Robot Library
zhangsanfeng86/funasr_seaco_paraformer_onnx_with_timestamp
修复funasr中seaco-paraformer导出onnx后没有时间戳的bug
zhangsanfeng86/gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
zhangsanfeng86/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
zhangsanfeng86/JoyHallo
JoyHallo: Digital human model for Mandarin
zhangsanfeng86/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
zhangsanfeng86/PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
zhangsanfeng86/parler-tts
Inference and training library for high-quality TTS models.
zhangsanfeng86/qwen1.5-convertor
export qwen1.5 to onnx or tflite
zhangsanfeng86/robotics-transformer-x.github.io
zhangsanfeng86/RT-2
Democratization of RT-2 "RT-2: New model translates vision and language into action"
zhangsanfeng86/RT-X
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
zhangsanfeng86/RVT
Official Code for RVT: Robotic View Transformer for 3D Object Manipulation
zhangsanfeng86/SadTalkerTriton
zhangsanfeng86/SenseVoice
Multilingual Voice Understanding Model
zhangsanfeng86/SenseVoice.cpp
Port of Funasr's Sense-voice model in C/C++
zhangsanfeng86/sherpa-onnx
Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter
zhangsanfeng86/streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
zhangsanfeng86/transvip
zhangsanfeng86/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
zhangsanfeng86/VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
zhangsanfeng86/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
zhangsanfeng86/wav2lip384
zhangsanfeng86/wav2lip_data_preprocessing
zhangsanfeng86/YeAudio
Python的音频工具
zhangsanfeng86/YouDub-webui