LuoXinjiee

PhD student in ZJU

Zhejiang UniversityHang Zhou

LuoXinjiee's Stars

CrystalSixone/DSRG
Code for A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation
Language:Python17
YicongHong/Recurrent-VLN-BERT
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
Language:Python15430
CircleRadon/TokenPacker
The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".
Language:Python2189
agnJason/PianoMotion10M
Code release for PianoMotion10M
Language:Python553
cshizhe/VLN-DUET
Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).
Language:Python1167
songw-zju/HASSC
The official implementation of "Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation" (CVPR 2024)
Language:Python281
peteanderson80/Matterport3DSimulator
AI Research Platform for Reinforcement Learning from Real Panoramic Images.
Language:C++504130
sinahmr/NACLIP
PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"
Language:Python435
ctjacobs/sudoku-genetic-algorithm
Solves a Sudoku puzzle using a genetic algorithm.
Language:Python2130
LiWentomng/gradio-osprey-demo
Gradio demo used in our Osprey:Pixel Understanding with Visual Instruction Tuning.
Language:Python142
xiaolul2/MGMap
[CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"
Language:Python885
mhamilton723/FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
Language:Jupyter Notebook1.4k79
bytedance/fc-clip
[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Language:Python28928
facebookresearch/ConvNeXt
Code release for ConvNeXt model
Language:Python5.8k696
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.4k990
linyq2117/TagCLIP
Language:Python636
Qinying-Liu/Awesome-Open-Vocabulary-Semantic-Segmentation
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
46525
open-mmlab/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Language:Python8.3k2.6k
wkentaro/labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Language:Python13.6k3.4k
wangf3014/SCLIP
Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
Language:Python1329
xmed-lab/CLIP_Surgery
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
Language:Jupyter Notebook36626
wysoczanska/clip_dinoiser
Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.
Language:Jupyter Notebook21711
CircleRadon/Osprey
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
Language:Python77443
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook9.3k829
chongzhou96/MaskCLIP
Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)
Language:Python40827
PVIT-official/PVIT
Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models
Language:Python362
inuwamobarak/KOSMOS-2
KOSMOS-2 is designed to handle text and images simultaneously, and redefine the way we perceive and interact with multimodal data, KOSMOS-2 is built on a Transformer-based causal language model architecture, similar to other renowned models like LLaMa-2 and Mistral AI's 7b model.
Language:Jupyter Notebook31
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python20.3k2.6k
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python30.5k2.7k
shikras/shikra
Language:Python74846

LuoXinjiee

LuoXinjiee's Stars

CrystalSixone/DSRG

YicongHong/Recurrent-VLN-BERT

CircleRadon/TokenPacker

agnJason/PianoMotion10M

cshizhe/VLN-DUET

songw-zju/HASSC

peteanderson80/Matterport3DSimulator

sinahmr/NACLIP

ctjacobs/sudoku-genetic-algorithm

LiWentomng/gradio-osprey-demo

xiaolul2/MGMap

mhamilton723/FeatUp

bytedance/fc-clip

facebookresearch/ConvNeXt

mlfoundations/open_clip

linyq2117/TagCLIP

Qinying-Liu/Awesome-Open-Vocabulary-Semantic-Segmentation

open-mmlab/mmsegmentation

wkentaro/labelme

wangf3014/SCLIP

xmed-lab/CLIP_Surgery

wysoczanska/clip_dinoiser

CircleRadon/Osprey

facebookresearch/dinov2

chongzhou96/MaskCLIP

PVIT-official/PVIT

inuwamobarak/KOSMOS-2

microsoft/unilm

lllyasviel/ControlNet

shikras/shikra