ShaohuaDong2021

PhD student @ UNT

ShaohuaDong2021's Stars

geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Language:Python41.7k 876 5785k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.2k 220 4502.9k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python15.1k 104 9631.4k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook14.2k 116 3741.3k
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C++7.7k 75 149407
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++7.5k 87 1.6k811
UZ-SLAMLab/ORB_SLAM3
ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM
Language:C++6.2k 128 7932.5k
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python5.9k 67 269506
open-mmlab/mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
Language:Python5k 63 1.6k1.5k
vikhyat/moondream
tiny vision language model
Language:Jupyter Notebook4.5k 52 97403
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.6k 47 172273
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Language:Python3.1k 60 90310
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python2.9k 38 192234
mit-biomimetics/Cheetah-Software
Language:C++2.4k 124 91903
mit-han-lab/bevfusion
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Language:Python2.2k 42 606383
traveller59/second.pytorch
SECOND for KITTI/NuScenes object detection
Language:Python1.7k 47 477720
DLYuanGod/TinyGPT-V
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Language:Python1.2k 19 3275
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
971 25 154
CVPR2023-3D-Occupancy-Prediction/CVPR2023-3D-Occupancy-Prediction
CVPR2023-Occupancy-Prediction-Challenge
Language:Python759 19 5856
cwchenwang/awesome-3d-diffusion
A collection of papers on diffusion models for 3D generation.
655 33 529
allenai/unified-io-2
Language:Python540 15 1625
Azure/MS-AMP
Microsoft Automatic Mixed Precision Library
Language:Python483 11 5735
vasgaowei/BEV-Perception
Bird's Eye View Perception
332 9 220
llm-efficiency-challenge/neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
Language:Python245 16 1656
zhanghm1995/Forge_VFM4AD
A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.
207 7 17
wudongming97/RMOT
[CVPR2023] Referring Multi-Object Tracking
Language:Python108 4 2111
liangxuy/Inter-X
[CVPR 2024] Official implementation of the paper "Towards Versatile Human-Human Interaction Analysis"
Language:Python103 7 92
jjwang/HanOS
Microkernel-based General Purpose Operating System #Hobby OS#
Language:C42 3 14
zyrant/SPGroup3D
[AAAI 2024] SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection
Language:Python31 2 11
YingLv1106/CAINet
This is a multimodal semantic segmentation method, named CAINet: Context-Aware Interaction Network for RGB-T Semantic Segmentation.
Language:Python18 2 42

ShaohuaDong2021

ShaohuaDong2021's Stars

geekan/MetaGPT

Vision-CAIR/MiniGPT-4

huggingface/peft

IDEA-Research/Grounded-Segment-Anything

SJTU-IPADS/PowerInfer

NVIDIA/TensorRT-LLM

UZ-SLAMLab/ORB_SLAM3

Lightning-AI/lit-llama

open-mmlab/mmdetection3d

vikhyat/moondream

mlfoundations/open_flamingo

NExT-GPT/NExT-GPT

OpenGVLab/Ask-Anything

mit-biomimetics/Cheetah-Software

mit-han-lab/bevfusion

traveller59/second.pytorch

DLYuanGod/TinyGPT-V

yunlong10/Awesome-LLMs-for-Video-Understanding

CVPR2023-3D-Occupancy-Prediction/CVPR2023-3D-Occupancy-Prediction

cwchenwang/awesome-3d-diffusion

allenai/unified-io-2

Azure/MS-AMP

vasgaowei/BEV-Perception

llm-efficiency-challenge/neurips_llm_efficiency_challenge

zhanghm1995/Forge_VFM4AD

wudongming97/RMOT

liangxuy/Inter-X

jjwang/HanOS

zyrant/SPGroup3D

YingLv1106/CAINet