FishAndWasabi

Master Student @ Nankai University, VCIP @MCG-NKU.

Nankai University, VCIPTianjin

FishAndWasabi's Stars

jbwang1997/OPUS
[Neurips 2024] OPUS: Occupancy Prediction Using a Sparse Set
Language:Python471
jingyaogong/minimind
【大模型】3小时完全从0训练一个仅有26M的小参数GPT，最低仅需2G显卡即可推理训练！
Language:Python2.1k249
Becomebright/GroundVQA
Official PyTorch code of "Grounded Question-Answering in Long Egocentric Videos", accepted by CVPR 2024.
Language:Python491
hzwer/WritingAIPaper
Writing AI Conference Papers: A Handbook for Beginners
1k29
hassony2/useful-computer-vision-phd-resources
Lists of resources useful for my PhD in computer vision
53198
fiveai/MoCaE
The official implementation of "MoCaE: Mixture of Calibrated Experts Significantly Improves Accuracy in Object Detection"
Language:Python181
yongliu20/SCAN
[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"
Language:Jupyter Notebook493
yzslab/gaussian-splatting-lightning
A 3D Gaussian Splatting framework with various derived algorithms and an interactive web viewer
Language:Jupyter Notebook45538
caojiaolong/RGBDBenchmark
This repository contains various RGBD models and aims to provide a benchmark for evaluating their FLOPs, MACs, and the number of parameters. We will continue to add more functionalities in the future
Language:Jupyter Notebook2
pytorch/torchtune
A Native-PyTorch Library for LLM Fine-tuning
Language:Python4.1k377
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python2.4k138
mc-lan/ProxyCLIP
[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
Language:Python466
yixuan730/DetToolChain
Dettoolchain: A new prompting paradigm to unleash detection ability of MLLM
17
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python2.2k165
mims-harvard/UniTS
A unified multi-task time series model.
Language:Python41154
baaivision/DIVA
Diffusion Feedback Helps CLIP See Better
Language:Python20511
LisaAnne/Hallucination
Language:Python607
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook11.3k965
NK-JittorCV/nk-diffusion
Language:Jupyter Notebook12
zhengyuan-xie/ECCV24_NeST
Language:Python17
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python6.9k998
Atten4Vis/LW-DETR
This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".
Language:Python21913
mc-lan/ClearCLIP
[ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
Language:Python411
nku-zhichengzhang/ExtDM
[CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"
Language:Python28
hhaAndroid/llama3
The official Meta Llama 3 GitHub site
1
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.1k539
yang-0201/MAF-YOLO
Implementation of paper - Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detection.
Language:Python495
jbwang1997/StabilityIndex
[ECCV 2024] Towards Stable 3D Object Detection
Language:Python391
Luo-Z13/pointobb
[CVPR2024] PointOBB: Learning Oriented Object Detection via Single Point Supervision
Language:Python463
TencentARC/Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Language:Python62924

FishAndWasabi

FishAndWasabi's Stars

jbwang1997/OPUS

jingyaogong/minimind

Becomebright/GroundVQA

hzwer/WritingAIPaper

hassony2/useful-computer-vision-phd-resources

fiveai/MoCaE

yongliu20/SCAN

yzslab/gaussian-splatting-lightning

caojiaolong/RGBDBenchmark

pytorch/torchtune

QwenLM/Qwen2-VL

mc-lan/ProxyCLIP

yixuan730/DetToolChain

baaivision/EVA

mims-harvard/UniTS

baaivision/DIVA

LisaAnne/Hallucination

facebookresearch/sam2

NK-JittorCV/nk-diffusion

zhengyuan-xie/ECCV24_NeST

EleutherAI/gpt-neox

Atten4Vis/LW-DETR

mc-lan/ClearCLIP

nku-zhichengzhang/ExtDM

hhaAndroid/llama3

facebookresearch/DiT

yang-0201/MAF-YOLO

jbwang1997/StabilityIndex

Luo-Z13/pointobb

TencentARC/Open-MAGVIT2