yousongzhu's Stars
jefferyZhan/Griffon
【ECCV2024】The official repo of Griffon series
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
zhenyuw16/UniDetector
Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
dyabel/detpro
Sense-X/AGVM
Large-batch Optimization for Dense Visual Predictions (NeurIPS 2022)
ksOAn6g5/TaiSu
TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)
FlagAI-Open/FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
CASIA-IVA-Lab/Obj2Seq
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)
kakaobrain/coyo-dataset
COYO-700M: Large-scale Image-Text Pair Dataset
chufengt/ViRReq
Code for the paper "Visual Recognition by Request".
WongKinYiu/yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
haltakov/natural-language-image-search
Search photos on Unsplash using natural language
cnyvfang/labelGo-Yolov5AutoLabelImg
YOLOV5 semi-automatic annotation tool (Based on labelImg)
facebookresearch/metaseq
Repo for external large-scale work
happycaoyue/JSPL
hustvl/MIMDet
[ICCV 2023] You Only Look at One Partial Sequence
EPFL-VILAB/MultiMAE
MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022
facebookresearch/asym-siam
PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)
Alibaba-MIIL/Solving_ImageNet
Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)
jina-ai/clip-as-service
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
lucasjinreal/yolov7_d2
🔥🔥🔥🔥 (Earlier YOLOv7 not official one) YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! 🔥🔥🔥
xyupeng/ContrastiveCrop
[CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning
google-research/pix2seq
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
amazon-science/bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
volcengine/veGiantModel
zhanxlin/Product1M
Product1M
OpenGVLab/gv-benchmark
General Vision Benchmark, GV-B, a project from OpenGVLab
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.