zk1009's Stars
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
YueFan1014/VideoAgent
This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)
mistralai/mistral-common
fuy3/editions
tinyvision/SOLIDER-REID
tinyvision/SOLIDER
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximum extent
breezedeus/CnOCR
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
Qualcomm-AI-research/FP8-quantization
Zhen-Dong/Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
bharath5673/StrongSORT-YOLO
Real-time multi-camera multi-object tracker using (YOLOv5, YOLOv7,YOLOv8) and StrongSORT with OSNet
billryan/resume
An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
kenjihiranabe/The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Anything-of-anything/Anything-3D
Segment-Anything + 3D. Let's lift anything to 3D.
pku-minic/sysy-cmake-template
Template for CMake based SysY compiler projects.
changh95/visual-slam-roadmap
Roadmap to become a Visual-SLAM developer in 2023
zhengjingwei/machine-learning-interview
算法工程师-机器学习面试题总结
CS-BAOYAN/CS-BAOYAN-2023
mli/paper-reading
深度学习经典、新论文逐段精读
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
IDEA-Research/detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
IDEA-Research/MaskDINO
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
junyanz/CatPapers
Cool vision, learning, and graphics papers on Cats!
guochengqian/Magic123
[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
anl13/animal_papers
Awesome papers for markerless animal motion capture and 3D reconstruction.
threestudio-project/threestudio
A unified framework for 3D content generation.
yangjiheng/nerf_and_beyond_docs
xianglin226/scMDC
Single Cell Multi-omics deep clustering
scverse/scvi-tools
Deep probabilistic analysis of single-cell and spatial omics data