lyming531's Stars
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
wailsapp/wails
Create beautiful applications using Go
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
X-PLUG/mPLUG-Owl
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
PaddlePaddle/Research
novel deep learning research works with PaddlePaddle
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
sipsorcery-org/sipsorcery
A WebRTC, SIP and VoIP library for C# and .NET. Designed for real-time communications apps.
hszhao/semseg
Semantic Segmentation in Pytorch
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
tianrun-chen/SAM-Adapter-PyTorch
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
NirAharon/BoT-SORT
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
open-compass/MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
agsh/onvif
ONVIF node.js implementation
henghuiding/ReLA
[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation
apple/ml-aim
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
dvlab-research/LLaMA-VID
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
HarborYuan/ovsam
[arXiv preprint] The official code of paper "Open-Vocabulary SAM".
PaddlePaddle/Paddle3D
A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D object detection models.
SunzeY/AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
quatanium/python-onvif
ONVIF Client Implementation in Python
shenyunhang/APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
airockchip/rknn-toolkit2
use-go/onvif
full and enhanced onvif protocol stack in golang.
HuKai97/YOLOv5-LPRNet-Licence-Recognition
使用YOLOv5和LPRNet进行车牌检测+识别(CCPD数据集)
orhir/PoseAnything
opendatalab/labelU
Data annotation toolbox supports image, audio and video data.
Stability-AI/StableCode
Code Assistance/ Developer Productivity suite of Models