Robin021's Stars
ZhengPeng7/BiRefNet
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
adithya-s-k/marker-api
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
FunAudioLLM/FunAudioLLM-APP
lhl/voicechat2
Local SRT/LLM/TTS Voicechat
atlasunified/Templates-ComfyUI-
Templates to view the variety of a prompt based on the samplers available in ComfyUI. Variety of sizes and singlular seed and random seed templates.
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
songquanpeng/message-pusher
搭建专属于你的消息推送服务,支持多种消息推送方式,支持 Markdown,基于 Golang 仅单可执行文件,开箱即用
VisActor/VMind
Not only automatic, but also intelligent. An Intelligent data Visualization System, based on LLM.
VisActor/VChart
VChart, more than just a cross-platform charting library, but also an expressive data storyteller.
FullStackWithLawrence/aws-openai
Example ChatGPT chatbots using Langchain and OpenAI
Menghuan1918/pdfdeal
A python wrapper for the Doc2X API and comes with native PDF processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的PDF处理(提升PDF在RAG中的召回率)。
hzeyuan/bookmarksAI
GPT automatically organizes your browser bookmarks
KwaiVGI/LivePortrait
Bring portraits to life!
Flomp/wanderer
wanderer is a self-hosted trail database. Save your adventures!
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
ruanyf/weekly
科技爱好者周刊,每周五发布
rtvi-ai/rtvi-web-demo
Example UI implementing the RTVI web client
black-forest-labs/flux
Official inference repo for FLUX.1 models
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
opendatalab/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
mem0ai/mem0
The Memory layer for your AI apps
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
modelscope/modelscope-classroom
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
BadToBest/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Chanzhaoyu/chatgpt-web
用 Express 和 Vue3 搭建的 ChatGPT 演示网页