moyans's Stars
ultralytics/ultralytics
Ultralytics YOLO11 🚀
2noise/ChatTTS
A generative speech model for daily dialogue.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
harry0703/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
SocialSisterYi/bilibili-API-collect
哔哩哔哩-API收集整理【不断更新中....】
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
KwaiVGI/LivePortrait
Bring portraits to life!
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
karpathy/build-nanogpt
Video+code lecture on building nanoGPT from scratch
modelscope/data-juicer
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Ikaros-521/AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
LLaVA-VL/LLaVA-NeXT
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
kijai/ComfyUI-SUPIR
SUPIR upscaling wrapper for ComfyUI
wangzhaode/mnn-llm
llm deploy project based mnn.
biubug6/Face-Detector-1MB-with-landmark
1M人脸检测模型(含关键点)
ninehills/blog
CBLUEbenchmark/CBLUE
中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
derronqi/yolov8-face
yolov8 face detection with landmark
sparkfish/augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Gmgge/TrOCR-Seal-Recognition
基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用
TencentARC/mllm-npu
mllm-npu: training multimodal large language models on Ascend NPUs
DanielSarmiento04/yolov10cpp
Implementation of yolo v10 in c++ std 17 over opencv and onnxruntime
up-up-up-up/yolov5_Monocular_ranging
FeiGeChuanShu/ncnn_ppstructure
ppstructure deploy by ncnn
hpc203/CoupledTPS-opencv-dnn
使用OpenCV部署CoupledTPS,包含了肖像矫正,不规则边界的图像矩形化,旋转图像矫正,三个模型。依然是包含C++和Python两个版本的程序
Ucas-HaoranWei/Fox
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"