chen-si-jia

PhD，Huazhong University of Science and Technology

Huazhong University of Science and Technology

chen-si-jia's Stars

HZAI-ZJNU/Mamba-YOLO
the official pytorch implementation of “Mamba-YOLO：SSMs-based for Object Detection”
Language:Python27734
Boyiliee/LLaDA-AV
Driving Everywhere with Large Language Model Policy Adaptation
91
FeipengMa6/VLoRA
[NeurIPS 2024] Visual Perception by Large Language Model’s Weights
Language:Python281
Fei-Long121/DeepBDC
The Pytorch code of "Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification", CVPR 2022 (Oral).
Language:Python17225
UARK-AICV/TrackGUI
Language:Python10
Mark12Ding/SAM2Long
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
Language:Jupyter Notebook28310
chengche6230/ReST
[ICCV 2023] ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking
Language:Python14315
hou-yz/MVDet
[ECCV 2020] Codes and MultiviewX dataset for "Multiview Detection with Feature Perspective Transformation".
Language:Python16930
sungonce/SENet
Official PyTorch Implementation of Revisiting Self-Similarity: Structural Embedding for Image Retrieval, CVPR 2023
Language:Python622
YehLi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Language:Python1k111
PyRetri/PyRetri
Open source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥
Language:Python1.2k179
willard-yuan/awesome-cbir-papers
📝Awesome and classical image retrieval papers
1.7k293
abewley/sort
Simple, online, and realtime tracking of multiple objects in a video sequence.
Language:Python4k1.1k
WangzcBruce/DHD
Language:Python335
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Language:Python12.8k2.9k
Ruzim/NSFC-application-template-latex
国家自然科学基金申请书正文（面上项目）LaTeX 模板（非官方）
Language:TeX846207
Zplusdragon/CION_ReIDZoo
[NeurIPS2024] Cross-video Identity Correlating for Person Re-identification Pre-training
Language:Python653
Nightmare-n/DepthAnyVideo
Depth Any Video with Scalable Synthetic Data
Language:Python39626
Datacastle-Algorithm-Department/images
31
timesler/facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
Language:Python4.5k952
NExT-ChatV/NExT-Chat
The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".
Language:Python2188
ml-research/deictic-segment-anything
Segment Anything with Deictic Prompting
Language:Python201
vision4robotics/PRL-Track
Language:Python258
ayesha-ishaq/Open3DTrack
Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
Language:Python191
jyrao/MatchTime
[EMNLP 2024 Oral] MatchTime: Towards Automatic Soccer Game Commentary Generation
Language:Python413
leiurayer/downkyi
哔哩下载姬downkyi，哔哩哔哩网站视频下载工具，支持批量下载，支持8K、HDR、杜比视界，提供工具箱（音视频提取、去水印等）。
Language:C#21.2k2.3k
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript12.4k43.9k
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python3.1k189
mbzuai-oryx/GeoChat
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
Language:Python44636
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5k385

chen-si-jia

chen-si-jia's Stars

HZAI-ZJNU/Mamba-YOLO

Boyiliee/LLaDA-AV

FeipengMa6/VLoRA

Fei-Long121/DeepBDC

UARK-AICV/TrackGUI

Mark12Ding/SAM2Long

chengche6230/ReST

hou-yz/MVDet

sungonce/SENet

YehLi/xmodaler

PyRetri/PyRetri

willard-yuan/awesome-cbir-papers

abewley/sort

WangzcBruce/DHD

PaddlePaddle/PaddleDetection

Ruzim/NSFC-application-template-latex

Zplusdragon/CION_ReIDZoo

Nightmare-n/DepthAnyVideo

Datacastle-Algorithm-Department/images

timesler/facenet-pytorch

NExT-ChatV/NExT-Chat

ml-research/deictic-segment-anything

vision4robotics/PRL-Track

ayesha-ishaq/Open3DTrack

jyrao/MatchTime

leiurayer/downkyi

academicpages/academicpages.github.io

QwenLM/Qwen2-VL

mbzuai-oryx/GeoChat

QwenLM/Qwen-VL