lyming531

lyming531's Stars

coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python30.6k 267 1k3.7k
wailsapp/wails
Create beautiful applications using Go
Language:Go23k 140 1.6k1.1k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python12k 96 1k973
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.1k 128 1k1.3k
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook4.4k 34 188580
pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
Language:Python2.4k 73 914635
X-PLUG/mPLUG-Owl
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
Language:Python2k 26 204155
PaddlePaddle/Research
novel deep learning research works with PaddlePaddle
Language:Python1.7k 48 149794
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python1.5k 19 87103
sipsorcery-org/sipsorcery
A WebRTC, SIP and VoIP library for C# and .NET. Designed for real-time communications apps.
Language:C#1.4k 68 669409
hszhao/semseg
Semantic Segmentation in Pytorch
Language:Python1.3k 21 84244
FoundationVision/GLEE
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Language:Python939 44 3175
tianrun-chen/SAM-Adapter-PyTorch
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
Language:Python834 10 7372
NirAharon/BoT-SORT
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
Language:Jupyter Notebook819 12 92413
open-compass/MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
Language:Python763 9 1681
agsh/onvif
ONVIF node.js implementation
Language:JavaScript675 47 178230
henghuiding/ReLA
[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation
Language:Python653 5 2315
apple/ml-aim
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
Language:Python647 20 540
dvlab-research/LLaMA-VID
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
Language:Python609 11 9438
HarborYuan/ovsam
[arXiv preprint] The official code of paper "Open-Vocabulary SAM".
Language:Python609 13 2221
PaddlePaddle/Paddle3D
A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D object detection models.
Language:Python547 18 198136
SunzeY/AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Language:Jupyter Notebook539 10 3828
quatanium/python-onvif
ONVIF Client Implementation in Python
Language:Python464 40 102308
shenyunhang/APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
Language:Python445 6 4927
airockchip/rknn-toolkit2
Language:C419 12 5949
use-go/onvif
full and enhanced onvif protocol stack in golang.
Language:Go376 16 28178
HuKai97/YOLOv5-LPRNet-Licence-Recognition
使用YOLOv5和LPRNet进行车牌检测+识别（CCPD数据集）
Language:Python362 4 973
orhir/PoseAnything
Language:Python271 4 1015
opendatalab/labelU
Data annotation toolbox supports image, audio and video data.
Language:Python195 9 1826
Stability-AI/StableCode
Code Assistance/ Developer Productivity suite of Models
Language:Jupyter Notebook117 5 113