ZF1546's Stars
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
2noise/ChatTTS
A generative speech model for daily dialogue.
Stability-AI/generative-models
Generative Models by Stability AI
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
mayooear/gpt4-pdf-chatbot-langchain
GPT4 & LangChain Chatbot for large PDF docs
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
aigc-apps/sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
SizheAn/PanoHead
Code Repository for CVPR 2023 Paper "PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 degree"
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
yakami129/VirtualWife
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
PantoMatrix/PantoMatrix
PantoMatrix: Co-Speech Talking Head and Gestures Generation
apachecn/apachecn-dl-zh
ApacheCN 深度学习译文集
yeyupiaoling/VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
fighting41love/zhvoice
Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。
RichardoMrMu/yolov5-deepsort-tensorrt
A c++ implementation of yolov5 and deepsort
IHe-KaiI/DressCode
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance.
jiaxilv/GPT4Motion
xiaoyou-bilibili/voice_recognize
声纹识别(动漫声优识别)
JiejiangWu/FaceG2E
Official code for CVPR2024 paper "text-guided 3d face synthesis - from generation to editing"