dy1901's Stars
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS/Plugins/Artifacts). One-click FREE deployment of your private ChatGPT/ Claude application.
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Pythagora-io/gpt-pilot
The first real AI developer
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
danny-avila/LibreChat
Enhanced ChatGPT Clone: Features Agents, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
seemoo-lab/openhaystack
Build your own 'AirTags' 🏷 today! Framework for tracking personal Bluetooth devices via Apple's massive Find My network.
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
MrNeRF/awesome-3D-gaussian-splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
bensadeh/tailspin
🌀 A log file highlighter
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
astriaai/headshots-starter
philz1337x/clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
yerfor/GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
zju3dv/4K4D
[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution
mkkellogg/GaussianSplats3D
Three.js-based implementation of 3D Gaussian splatting
linyiLYi/voice-assistant
A simple toy demo of a local voice assistant with whisper and large language model.
wxywb/history_rag
allwefantasy/auto-coder
x-dr/tts
微软azure文本转语音 音频下载
rongardF/tvdatafeed
A simple TradingView historical Data Downloader
RVC-Project/Retrieval-based-Voice-Conversion
in preparation...
aifartist/ArtSpew
An infinite number of monkeys randomly throwing paint at a canvas
mithril-security/blind_chat
A fully in-browser privacy solution to make Conversational AI privacy-friendly
dave1010/pandora
ChatGPT Coding Unleashed! Pandora gives ChatGPT the ability to read and write files and run commands on your machine.
dhbloo/gomoku-calculator
An easy-to-use gomoku/renju web interface
dy1901/GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code