ZJLin2oo1's Stars
ybchen97/soc_cluster_guide
Quick guide to using GPUs and PyTorch on NUS SoC Compute Cluster
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Mashiro009/slidespeech_dl
datawhalechina/llm-universe
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
huggingface/course
The Hugging Face course on Transformers
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
LiuyinYang1101/Sea-Wave
Official Implementation of ICASSP 2023 EEG-Auditory Challenge Regression Task: Sea-Wave
xiciliu/Awesome-ChatTTS-2
官方推荐的 ChatTTS 最佳入门指南,整理和汇总了常见问题和相关资源
6drf21e/ChatTTS_colab
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
libukai/Awesome-ChatTTS
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
jimbozhang/yesno-example-for-undergraduates
wzpan/wukong-robot
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
superYong2020/eeg_attention
神念脑电专注度算法
exporl/auditory-eeg-challenge-2024-code
bbaaii/DreamDiffusion
Implementation of “DreamDiffusion: Generating High-Quality Images from Brain EEG Signals”