sdpscnc's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
labelmeai/labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
bluenviron/mediamtx
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
PaddlePaddle/PaddleX
All-in-One Development Tool based on PaddlePaddle(飞桨低代码开发工具)
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
sdushantha/wifi-password
Quickly fetch your WiFi password and if needed, generate a QR code of your WiFi to allow phones to easily connect
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
lyuchenyang/Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
yatengLG/ISAT_with_segment_anything
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
latentcat/latentbox
A collection of awesome-lists for AI, creativity and art. AI、创意和艺术领域的精选合集。https://latentbox.com
OpenGVLab/VideoMamba
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
PKU-YuanGroup/Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
PKU-YuanGroup/LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Vision-CAIR/MiniGPT4-video
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
SunOner/sunone_aimbot
Aim-bot based on AI for all FPS games
ibaiGorordo/ONNX-YOLOv8-Object-Detection
Python scripts performing object detection using the YOLOv8 model in ONNX.
phellonchen/X-LLM
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
airockchip/ultralytics_yolov8
NEW - YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite
eProsima/Fast-DDS-python
Python bindings for Fast DDS. Looking for commercial support? Contact info@eprosima.com
DEVILENMO/DeadEye-Auto-Aiming-System
基于Yolo的辅助瞄准系统,高度模块化编写,也可以用于光学控制、监控物体追踪等领域,持续更新中,仅供交流学习使用。An auto-aiming system, or aim bot, which is mainly based on Yolo. Can be easily used in different areas such as optical control, monitoring object tracking. I will keep fixing bugs and adding new features.
atolab/pydds
Python API for DDS
Ton-lee/RTSP-Streaming
Stream H.264 encoded live video (frames) through RTSP by FFmpeg and in Python
yangrudan/FastDDS
python C++