Pinned Repositories
Uni3D
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
Awesome-BEV-Perception-Multi-Cameras
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird's-Eye-View, such as DETR3D, BEVDet, BEVFormer, BEVDepth, UniAD
Awesome-BEV-Perception-Multi-Cameras
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird-Eye-View, such as DETR3D, BEVDet, BEVFormer
awesome-NeRF
A curated list of awesome neural radiance fields papers
bevfusion
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Deformable-DETR
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
detr3d
Emu
Emu: An Open Multimodal Generalist
EVA
EVA Series: Visual Representation Fantasies from BAAI
GANet
A Keypoint-based Global Association Network for Lane Detection. Accepted by CVPR 2022
Wolfwjs's Repositories
Wolfwjs/GANet
A Keypoint-based Global Association Network for Lane Detection. Accepted by CVPR 2022
Wolfwjs/detr3d
Wolfwjs/Awesome-BEV-Perception-Multi-Cameras
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird-Eye-View, such as DETR3D, BEVDet, BEVFormer
Wolfwjs/awesome-NeRF
A curated list of awesome neural radiance fields papers
Wolfwjs/bevfusion
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Wolfwjs/Deformable-DETR
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
Wolfwjs/Emu
Emu: An Open Multimodal Generalist
Wolfwjs/EVA
EVA Series: Visual Representation Fantasies from BAAI
Wolfwjs/ImageBind
ImageBind One Embedding Space to Bind Them All
Wolfwjs/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Wolfwjs/LLaVA
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
Wolfwjs/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, image/video restoration/enhancement, etc.
Wolfwjs/open_clip
An open source implementation of CLIP.
Wolfwjs/Wolfwjs
Wolfwjs/Megatron-LM
Ongoing research training transformer models at scale
Wolfwjs/ONE-PEACE
A general representation modal across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Wolfwjs/open_flamingo
An open-source framework for training large multimodal models.
Wolfwjs/OpenShape_code
Wolfwjs/Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
Wolfwjs/PandaGPT
PandaGPT: One Model To Instruction-Follow Them All
Wolfwjs/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Wolfwjs/ULIP
Wolfwjs/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Wolfwjs/ViT-Lens
[Preprint] ViT-Lens: Towards Omni-modal Representations