zk1009

zk1009's Stars

PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python2.9k208
YueFan1014/VideoAgent
This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)
Language:Python1095
mistralai/mistral-common
Language:Python63357
fuy3/editions
Language:Python1
tinyvision/SOLIDER-REID
Language:Python6212
tinyvision/SOLIDER
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximum extent
Language:Python1.9k344
breezedeus/CnOCR
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
Language:Python3.2k498
Qualcomm-AI-research/FP8-quantization
Language:Python1148
Zhen-Dong/Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
42637
bharath5673/StrongSORT-YOLO
Real-time multi-camera multi-object tracker using (YOLOv5, YOLOv7,YOLOv8) and StrongSORT with OSNet
Language:Python27270
billryan/resume
An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git
Language:TeX9.2k2.6k
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
Language:Python3.8k442
kenjihiranabe/The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Language:PostScript17.8k2.2k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook14.9k1.4k
Anything-of-anything/Anything-3D
Segment-Anything + 3D. Let's lift anything to 3D.
Language:Python1.5k75
pku-minic/sysy-cmake-template
Template for CMake based SysY compiler projects.
Language:CMake87
changh95/visual-slam-roadmap
Roadmap to become a Visual-SLAM developer in 2023
1.4k142
zhengjingwei/machine-learning-interview
算法工程师-机器学习面试题总结
1.3k188
CS-BAOYAN/CS-BAOYAN-2023
Language:HTML1k108
mli/paper-reading
深度学习经典、新论文逐段精读
26.5k2.4k
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Language:Python2.3k109
IDEA-Research/detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
Language:Python2k206
IDEA-Research/MaskDINO
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
Language:Python1.2k103
junyanz/CatPapers
Cool vision, learning, and graphics papers on Cats!
Language:Python1.1k89
guochengqian/Magic123
[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
Language:Jupyter Notebook1.5k95
anl13/animal_papers
Awesome papers for markerless animal motion capture and 3D reconstruction.
Language:Python23913
threestudio-project/threestudio
A unified framework for 3D content generation.
Language:Python6.2k474
yangjiheng/nerf_and_beyond_docs
1k36
xianglin226/scMDC
Single Cell Multi-omics deep clustering
Language:Python234
scverse/scvi-tools
Deep probabilistic analysis of single-cell and spatial omics data
Language:Python1.2k344