Spicybird's Stars
catid/self-discover
Implementation of Google's SELF-DISCOVER
zhangfaen/finetune-Qwen2-VL
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
modelscope/modelscope-classroom
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
HalcyonAzure/lsky-pro-docker
☁️兰空图床(Lsky Pro) - Docker自动构建,支持多平台
Alvin9999/new-pac
翻墙-科学上网、自由上网、免费科学上网、免费翻墙、油管youtube、fanqiang、软件、VPN、一键翻墙浏览器,vps一键搭建翻墙服务器脚本/教程,免费shadowsocks/ss/ssr/v2ray/goflyway账号/节点,翻墙梯子,电脑、手机、iOS、安卓、windows、Mac、Linux、路由器翻墙、科学上网、youtube视频下载、美区apple id共享账号
luohenyueji/Python-Study-Notes
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Martlgap/FaceIDLight
A lightweight face-recognition toolbox and pipeline based on tensorflow-lite
2noise/ChatTTS
A generative speech model for daily dialogue.
codefuse-ai/CodeFuse-muAgent
An Innovative Agent Framework Driven by KG Engine
Maplemx/Agently
[AI Agent Application Development Framework] - 🚀 Build AI agent native application in very few code 💬 Easy to interact with AI agent in code using structure data and chained-calls syntax 🧩 Enhance AI Agent using plugins instead of rebuild a whole new agent
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
LynanBreeze/simple-resume
Generate your own responsive resume page.
CosmosShadow/gptpdf
Using GPT to parse PDF
agiresearch/AIOS
AIOS: LLM Agent Operating System
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
songquanpeng/one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
mack-a/v2ray-agent
Xray、Tuic、hysteria2、sing-box 八合一一键脚本
CosmosShadow/GeneralAgent
A python native agent framework
fengwang/LLaMA-Factory-docker
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
CCBP/TangdouDownloader
糖豆广场舞(tangdou.com)视频下载器,可以实现视频的自动下载以及简单的剪辑,以及可以将视频转换为音频格式。
open-webui/open-webui
User-friendly WebUI for AI (Formerly Ollama WebUI)
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
e2b-dev/awesome-ai-agents
A list of AI autonomous agents
LLaVA-VL/LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
PJLab-ADG/awesome-knowledge-driven-AD
A curated list of awesome knowledge-driven autonomous driving (continually updated)