zhangsanfeng86

Pinned Repositories

ai-research-code
Language:Python0 0 00
aishell-3-baseline-fc
The code for aishell-3 baseline acoustic model
Language:Jupyter Notebook0 0 00
aistplusplus_api
API to support AIST++ Dataset: https://google.github.io/aistplusplus_dataset
Language:Python0 0 00
ChatGLM-Finetuning
基于ChatGLM-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning等
Language:Python0 0 00
cyclevae-vc-neuralvoco
Language:Python1 0 00
MNET
Official PyTorch implementation of the paper "A Brand New Dance Partner:Music-Conditioned Pluralistic Dancing Synthesized by Multiple Dance Genres", CVPR 2022
Language:Python1 0 00
onnx_runtime_cpp
small c++ library to quickly deploy models using onnxruntime
Language:C++1 0 00
PoseGPT
Language:Python0 0 00
reward-modeling
Language:Python1 0 00
seeprettyface-face_editor
这是一个基于StyleGAN的人脸属性编辑器
Language:Python1 0 00

zhangsanfeng86's Repositories

zhangsanfeng86/AMchat
AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics, and their solutions. AM (Advanced Mathematics) chat 高等数学大模型。一个集成数学知识和高等数学习题及其解答的大语言模型。
Language:Python0 0
zhangsanfeng86/ASR-2Pass
ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).
Language:HTML0 0
zhangsanfeng86/AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
Language:Python0 0
zhangsanfeng86/camel
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Language Model Society (NeruIPS'2023) https://www.camel-ai.org
Language:Python0 0
zhangsanfeng86/curobo
CUDA Accelerated Robot Library
Language:Python0 0
zhangsanfeng86/funasr_seaco_paraformer_onnx_with_timestamp
修复funasr中seaco-paraformer导出onnx后没有时间戳的bug
Language:Python0 0
zhangsanfeng86/gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
zhangsanfeng86/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python0 0
zhangsanfeng86/JoyHallo
JoyHallo: Digital human model for Mandarin
zhangsanfeng86/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
Language:Python0 0
zhangsanfeng86/PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
zhangsanfeng86/parler-tts
Inference and training library for high-quality TTS models.
Language:Python0 0
zhangsanfeng86/qwen1.5-convertor
export qwen1.5 to onnx or tflite
Language:Python0 0
zhangsanfeng86/robotics-transformer-x.github.io
Language:HTML0 0
zhangsanfeng86/RT-2
Democratization of RT-2 "RT-2: New model translates vision and language into action"
Language:Python0 0
zhangsanfeng86/RT-X
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
Language:Python0 0
zhangsanfeng86/RVT
Official Code for RVT: Robotic View Transformer for 3D Object Manipulation
Language:Python0 0
zhangsanfeng86/SadTalkerTriton
Language:Python0 011
zhangsanfeng86/SenseVoice
Multilingual Voice Understanding Model
Language:Python0 0
zhangsanfeng86/SenseVoice.cpp
Port of Funasr's Sense-voice model in C/C++
Language:C0 0
zhangsanfeng86/sherpa-onnx
Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter
Language:C++0 0
zhangsanfeng86/streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
zhangsanfeng86/transvip
zhangsanfeng86/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
Language:Python0 0
zhangsanfeng86/VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
Language:Python0 0
zhangsanfeng86/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0
zhangsanfeng86/wav2lip384
Language:Python0 0
zhangsanfeng86/wav2lip_data_preprocessing
Language:Python0 0
zhangsanfeng86/YeAudio
Python的音频工具
Language:Python0 0
zhangsanfeng86/YouDub-webui
Language:Python0 0