hongdangshao

Hangzhou Zhejiang Province

Pinned Repositories

BARTNER
Language:Python1 0 00
CLIP-Chinese
中文CLIP预训练模型
Language:Python1 0 00
ContinueTrainingBERT
Continue Training BERT with transformers 在垂直领域的预料下继续训练BERT
Language:Python10
diffusers-webui
This is a Gradio WebUI working with the Diffusers format of Stable Diffusion（diffusers实现的webui）
Language:Python1 0 00
diffuzers
a web ui & api for 🤗 diffusers
Language:Python1 0 00
LLM2CLIP
LLM2CLIP makes SOTA pretrained CLIP model more SOTA ever.
Language:Python1 0 00
onnx-in-NLP
Language:Python1 1 00
QAnything
Question and Answer based on Anything.
Language:Python1 0 00
Quark
控制文本生成
Language:Python1 0 00
Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.（超分辨率）
Language:Python1 0 00

hongdangshao's Repositories

hongdangshao/QAnything
Question and Answer based on Anything.
Language:Python1 0 00
hongdangshao/aiortc
WebRTC and ORTC implementation for Python using asyncio（WebRTC推流）
Language:Python0 0
hongdangshao/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation(声音驱动数字人合成)
Language:Python0 0
hongdangshao/Bunny
A family of lightweight multimodal models. (多模态模型)
hongdangshao/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
Language:Jupyter Notebook0 0
hongdangshao/Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 developed from Meta Llama 3
0 0
hongdangshao/ConsistentID
Customized ID Consistent for human
hongdangshao/Easy-Wav2Lip
Colab for making Wav2Lip high quality and easy to use
Language:Jupyter Notebook0 0
hongdangshao/EAT_code
（带情绪的数字人）Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".
Language:Python0 0
hongdangshao/edge-tts-ms-
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python0 0
hongdangshao/FreeAskInternet
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use. （免费联网搜索+LLM）
hongdangshao/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
Language:Python0 0
hongdangshao/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python0 0
hongdangshao/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python0 0
hongdangshao/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
hongdangshao/MACRec
基于智能体的推荐系统
hongdangshao/metahuman-stream
Real time streaming digital human based on nerf（实时流数字人）
Language:Python0 0
hongdangshao/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python0 0
hongdangshao/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
hongdangshao/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python0 0
hongdangshao/python_rtmpstream
python库，实现推送实时rtmp音视频流
Language:C++0 0
hongdangshao/pytvzhen
最快油管英文视频转中文
hongdangshao/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python0 0
hongdangshao/RAGQnASystem
本项目设计了一个基于 RAG 与大模型技术的医疗问答系统，利用 DiseaseKG 数据集与 Neo4j 构建知识图谱，结合 BERT 的命名实体识别和 34b 大模型的意图识别，通过精确的知识检索和问答生成，提升系统在医疗咨询中的性能，解决大模型在医疗领域应用的可靠性问题。
hongdangshao/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code（图片和声音驱动数字人）
hongdangshao/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
hongdangshao/text-generation-webui
A Gradio web UI for Large Language Models.
hongdangshao/Vach
Real time streaming talking head
Language:Python0 0
hongdangshao/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除，无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API，本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
hongdangshao/vits-simple-api
A simple VITS HTTP API, developed by extending Moegoe with additional features.