shuyikong

shuyikong's Stars

milvus-io/milvus-helm
The helm chart to deploy Milvus
Language:Mustache8177
TXH-mercury/VAST
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Language:Jupyter Notebook24517
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Language:Python888125
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python1.4k89
OpenGVLab/InternVideo2
2122
BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
41313
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.8k433
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.2k481
ktrk115/const_layout
Official implementation of the MM'21 paper "Constrained Graphic Layout Generation via Latent Optimization" (LayoutGAN++, CLG-LO, and Layout evaluation)
Language:Python13827
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
Language:Go31.1k3k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.9k825
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Language:Python12.9k2.9k
PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Language:Python12.2k2.9k
zilliztech/VectorDBBench
A Benchmark Tool for VectorDB
Language:Python567156
zilliztech/milvus-helm
Language:Smarty6043
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python44.8k7.9k
sstzal/STAR-FC
The implementation of the CVPR2021 paper "Structure-Aware Face Clustering on a Large-Scale Graph with 10^7 Nodes"
Language:Python10015
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python8.9k569
xingyizhou/ExtremeNet
Bottom-up Object Detection by Grouping Extreme and Center Points
Language:Python1k174
WongKinYiu/yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Language:Jupyter Notebook13.4k4.2k
HumanSignal/labelImg
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
Language:Python22.9k6.3k
airob0t/idcardgenerator
身份证图片生成工具 generate an id card picture
Language:Python1.4k465
danielgatis/rembg
Rembg is a tool to remove images background
Language:Python17.2k1.9k
debidatta/syndata-generation
Code used to generate synthetic scenes and bounding box annotations for object detection. This was used to generate data used in the Cut, Paste and Learn paper
Language:Python28972
CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook68.6k10.2k
Huntersxsx/GAN-Learning
李宏毅老师GAN的课程作业
Language:Python62
lucidrains/stylegan2-pytorch
Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
Language:Python3.7k588
namdo281/SynthText
Language:C++41
ankush-me/SynthText
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
Language:Python2k622
WenmuZhou/OCR_DataSet
收集并整理有关OCR的数据集并统一标注格式，以便实验需要
Language:Python883191