SJTU-MKH's Stars
Mark12Ding/SAM2Long
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
Nightmare-n/DepthAnyVideo
Depth Any Video with Scalable Synthetic Data
KMnP/vpt
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
ZCMax/LLaVA-3D
A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
TianhaoFu/Awesome-3D-Semantic-Segmentation
Papers, code and datasets about deep learning for 3D Semantic Segmentation.
jingGM/GND
luxonis/depthai-experiments
Experimental projects we've done with DepthAI.
Benature/bib-catcher
Get bibtex of multiple references in a single line text, by python scraping Google Scholar.
jingGM/DTG
chvmp/champ
MIT Cheetah I Implementation
MediaBrain-SJTU/LED
[CVPR2023] Leapfrog Diffusion Model for Stochastic Trajectory Prediction
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Visualize-ML/Book2_Beauty-of-Data-Visualization
Book_2_《可视之美》 | 鸢尾花书:从加减乘除到机器学习,欢迎批评指正
eric-ai-lab/awesome-vision-language-navigation
A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"
ActiveVisionLab/Awesome-LLM-3D
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
robotics-survey/Awesome-Robotics-Foundation-Models
1989Ryan/llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.
dx118/dynaip
Code for Dynamic Inertial Poser (DynaIP): Part-Based Motion Dynamics Learning for Enhanced Human Pose Estimation with Sparse Inertial Sensors
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
rezazzr/Probing-Representation-Forgetting
chatanywhere/GPT_API_free
Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
opendilab/LMDrive
[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
goodfeli/adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
HCPLab-SYSU/Embodied_AI_Paper_List
[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI
dtuzi123/DEMC
EnnengYang/Awesome-Forgetting-in-Deep-Learning
A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning. TPAMI, 2024.
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
dtuzi123/OVAE
The implementation of Continual Variational Autoencoder Learning via Online Cooperative Memorization
YichiZhang98/SAM4MIS
SAM & SAM 2 for Medical Image Segmentation: Open-Source Project Summary