Pinned Repositories
BARTNER
CLIP-Chinese
中文CLIP预训练模型
ContinueTrainingBERT
Continue Training BERT with transformers 在垂直领域的预料下继续训练BERT
diffusers-webui
This is a Gradio WebUI working with the Diffusers format of Stable Diffusion(diffusers实现的webui)
diffuzers
a web ui & api for 🤗 diffusers
LLM2CLIP
LLM2CLIP makes SOTA pretrained CLIP model more SOTA ever.
onnx-in-NLP
QAnything
Question and Answer based on Anything.
Quark
控制文本生成
Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.(超分辨率)
hongdangshao's Repositories
hongdangshao/QAnything
Question and Answer based on Anything.
hongdangshao/aiortc
WebRTC and ORTC implementation for Python using asyncio(WebRTC推流)
hongdangshao/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation(声音驱动数字人合成)
hongdangshao/Bunny
A family of lightweight multimodal models. (多模态模型)
hongdangshao/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
hongdangshao/Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 developed from Meta Llama 3
hongdangshao/ConsistentID
Customized ID Consistent for human
hongdangshao/Easy-Wav2Lip
Colab for making Wav2Lip high quality and easy to use
hongdangshao/EAT_code
(带情绪的数字人)Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".
hongdangshao/edge-tts-ms-
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
hongdangshao/FreeAskInternet
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use. (免费联网搜索+LLM)
hongdangshao/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
hongdangshao/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
hongdangshao/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
hongdangshao/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
hongdangshao/MACRec
基于智能体的推荐系统
hongdangshao/metahuman-stream
Real time streaming digital human based on nerf(实时流数字人)
hongdangshao/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
hongdangshao/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
hongdangshao/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
hongdangshao/python_rtmpstream
python库,实现推送实时rtmp音视频流
hongdangshao/pytvzhen
最快油管英文视频转中文
hongdangshao/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
hongdangshao/RAGQnASystem
本项目设计了一个基于 RAG 与大模型技术的医疗问答系统,利用 DiseaseKG 数据集与 Neo4j 构 建知识图谱,结合 BERT 的命名实体识别和 34b 大模型的意图识别,通过精确的知识检索和问答生成, 提升系统在医疗咨询中的性能,解决大模型在医疗领域应用的可靠性问题。
hongdangshao/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code(图片和声音驱动数字人)
hongdangshao/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
hongdangshao/text-generation-webui
A Gradio web UI for Large Language Models.
hongdangshao/Vach
Real time streaming talking head
hongdangshao/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
hongdangshao/vits-simple-api
A simple VITS HTTP API, developed by extending Moegoe with additional features.