RunsenXu

Ph.D. Student@MMLab, CUHK

The Chinese University of Hong KongHong Kong

RunsenXu's Stars

HCPLab-SYSU/Embodied_AI_Paper_List
[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI
53234
shreyansh26/Attention-Mask-Patterns
Using FlexAttention to compute attention with different masking patterns
Language:Python31
TangYuan96/MiniGPT-3D
[MM 2024] [Need a RTX 3090] MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors
Language:Python604
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Language:Python10.1k965
ZiyuGuo99/SAM2Point
The Most Faithful Implementation of Segment Anything (SAM) in 3D
Language:Python25712
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
Language:Python1.1k146
zubair-irshad/Awesome-Robotics-3D
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
47024
google-deepmind/tapnet
Tracking Any Point (TAP)
Language:Jupyter Notebook1.3k120
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
Language:Python36.1k5.1k
jinlinyi/PerspectiveFields
[CVPR 2023 Highlight] Perspective Fields for Single Image Camera Calibration
Language:Jupyter Notebook19319
opendatalab/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。
Language:Python11.5k864
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
Language:Java42.6k3.4k
facebookresearch/vggsfm
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Language:Python84553
karpathy/LLM101n
LLM101n: Let's build a Storyteller
28.8k1.6k
EurekaLabsAI/mlp
The Multilayer Perceptron Language Model
Language:Python50445
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.4k594
OpenRobotLab/GRUtopia
GRUtopia: Dream General Robots in a City at Scale
Language:Python46421
IDEA-Research/Grounding-DINO-1.5-API
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Language:Python71821
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language:HTML30216
microsoft/vscode
Visual Studio Code
Language:TypeScript163k28.7k
OpenRobotLab/Grounded_3D-LLM
Code&Data for Grounded 3D-LLM with Referent Tokens
Language:Python761
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
Language:Python2k33
bertjiazheng/Structured3D
[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
Language:Python52462
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python6.5k572
kornia/kornia
Geometric Computer Vision Library for Spatial AI
Language:Python9.8k958
verlab/accelerated_features
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
Language:Jupyter Notebook90192
zju3dv/pats
Code for "PATS: Patch Area Transportation with Subdivision for Local Feature Matching", CVPR 2023
Language:C++947
facebookresearch/lightplane
Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from images to 3D grids.
Language:Python2527
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.2k3k
jwasham/coding-interview-university
A complete computer science study plan to become a software engineer.
305k76.4k

RunsenXu

RunsenXu's Stars

HCPLab-SYSU/Embodied_AI_Paper_List

shreyansh26/Attention-Mask-Patterns

TangYuan96/MiniGPT-3D

Zeyi-Lin/HivisionIDPhotos

ZiyuGuo99/SAM2Point

open-compass/VLMEvalKit

zubair-irshad/Awesome-Robotics-3D

google-deepmind/tapnet

hacksider/Deep-Live-Cam

jinlinyi/PerspectiveFields

opendatalab/MinerU

Stirling-Tools/Stirling-PDF

facebookresearch/vggsfm

karpathy/LLM101n

EurekaLabsAI/mlp

facebookresearch/xformers

OpenRobotLab/GRUtopia

IDEA-Research/Grounding-DINO-1.5-API

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

microsoft/vscode

OpenRobotLab/Grounded_3D-LLM

yuweihao/MambaOut

bertjiazheng/Structured3D

huggingface/lerobot

kornia/kornia

verlab/accelerated_features

zju3dv/pats

facebookresearch/lightplane

meta-llama/llama3

jwasham/coding-interview-university