Pinned Repositories
BARTNER
CLIP-Chinese
中文CLIP预训练模型
ContinueTrainingBERT
Continue Training BERT with transformers 在垂直领域的预料下继续训练BERT
diffusers-webui
This is a Gradio WebUI working with the Diffusers format of Stable Diffusion(diffusers实现的webui)
diffuzers
a web ui & api for 🤗 diffusers
LLM2CLIP
LLM2CLIP makes SOTA pretrained CLIP model more SOTA ever.
onnx-in-NLP
QAnything
Question and Answer based on Anything.
Quark
控制文本生成
Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.(超分辨率)
hongdangshao's Repositories
hongdangshao/LLM2CLIP
LLM2CLIP makes SOTA pretrained CLIP model more SOTA ever.
hongdangshao/agents
Build real-time multimodal AI applications 🤖🎙️📹
hongdangshao/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
hongdangshao/BCEmbedding
Netease Youdao's open-source embedding and reranker models for RAG products.(网易开源的支持中英双语嵌入模型)
hongdangshao/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
hongdangshao/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
hongdangshao/DH_live
每个人都能用的数字人
hongdangshao/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
hongdangshao/elasticsearch
Free and Open, Distributed, RESTful Search Engine
hongdangshao/examples
hongdangshao/fish-speech
Brand new TTS solution
hongdangshao/Freeze-Omni
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
hongdangshao/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
hongdangshao/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
hongdangshao/GPU-Puzzles
Solve puzzles. Learn CUDA.(CUDA编程)
hongdangshao/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
hongdangshao/kotaemon
An open-source RAG-based tool for chatting with your documents.
hongdangshao/lagent
A lightweight framework for building LLM-based agents(轻量级agent框架)
hongdangshao/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"(GraphRAG的轻量级解决方案)
hongdangshao/LitServe
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
hongdangshao/LivePortrait
Make one portrait alive!
hongdangshao/LLaMA-O1
Large Reasoning Models
hongdangshao/MaxKB
🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。
hongdangshao/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
hongdangshao/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model(端到端大模型)
hongdangshao/pipecat
Open Source framework for voice and multimodal conversational AI
hongdangshao/searxng
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
hongdangshao/SenseVoice
Multilingual Voice Understanding Model(ASR)
hongdangshao/swarm
Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.(OpenAI多智能体框架)
hongdangshao/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production