jungle-gym-ac

Nanjing University

Pinned Repositories

awesome-detection-transformer
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
10
awesome-multiple-object-tracking
Resources for Multiple Object Tracking (MOT)
0 0 00
awesome-open-vocabulary-object-detection
0 0 00
Awesome-Token-Compress
A paper list of some recent works about Token Compress for Vit and VLM
00
cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python0 0 00
CDN
Code for "Mining the Benefits of Two-stage and One-stage HOI Detection"
Language:Python0 0 00
deeplearning_ai_books
deeplearning.ai（吴恩达老师的深度学习课程笔记及资源）
Language:HTML1 0 00
NJU-Big-Data
Course Repo for Big Data Processing: Comprehensive Experiments
Language:Java2 1 00
The-Phoenix-Proiect
凤凰项目：一个 IT运维的传奇故事
2 0 00
p-MoD
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
Language:Python31 2 42

jungle-gym-ac's Repositories

jungle-gym-ac/NJU-Big-Data
Course Repo for Big Data Processing: Comprehensive Experiments
Language:Java2 1 00
jungle-gym-ac/awesome-detection-transformer
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
10
jungle-gym-ac/awesome-multiple-object-tracking
Resources for Multiple Object Tracking (MOT)
0 0 00
jungle-gym-ac/Awesome-Token-Compress
A paper list of some recent works about Token Compress for Vit and VLM
00
jungle-gym-ac/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python0 0 00
jungle-gym-ac/CDN
Code for "Mining the Benefits of Two-stage and One-stage HOI Detection"
Language:Python0 0 00
jungle-gym-ac/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
Language:TypeScript0 0 00
jungle-gym-ac/detr
End-to-End Object Detection with Transformers
Language:Python0 0 00
jungle-gym-ac/NJUCS-Courses
Course Materials from NJUCS
0 1 00
jungle-gym-ac/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
jungle-gym-ac/copilot-gpt4-service
Convert Github Copilot to ChatGPT, free to use the GPT-4 model
Language:Go
jungle-gym-ac/DeepStack-VL
jungle-gym-ac/FastV
Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
Language:Python
jungle-gym-ac/FlexAttention
Language:Python0 0
jungle-gym-ac/HiRED
An early token dropping algorithm to improve inference efficiency for Vision-Lanauge Models with high-resolution images under resource constraints.
Language:Python0 0
jungle-gym-ac/HOI-Learning-List
A list of Human-Object Interaction Learning.
0 0
jungle-gym-ac/HOI-Transformer
HOI Detection Transformer Architecture, Based on CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"
Language:Python
jungle-gym-ac/InternVideo
Video Foundation Models & Data for Multimodal Understanding
Language:Python0 0
jungle-gym-ac/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python0 0
jungle-gym-ac/Linux-Config
My Linux Configuration Scripts, Oh-My-Zsh, etc.
Language:Shell1 0
jungle-gym-ac/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python0 0
jungle-gym-ac/LLaVA-NeXT
Language:Python
jungle-gym-ac/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
Language:Python
jungle-gym-ac/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Language:Python0 0
jungle-gym-ac/NJU-DisSys-Go-RPC
RPC Distributed System implemented in GO
Language:Go1 0
jungle-gym-ac/Open-LLaVA-NeXT
An open-source implementation of LLaVA-NeXT.
jungle-gym-ac/p-MoD
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
jungle-gym-ac/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
Language:Python0 0
jungle-gym-ac/webvid
Large-scale text-video dataset. 10 million captioned short videos.
Language:Python0 0
jungle-gym-ac/zotero-bridge
Obsidian plugin to integrate with Zotero through ZotServer
Language:TypeScript0 0