matthewdm0816

PhD Student for AI | Otaku Coder

Peking UniversityDeneb

matthewdm0816's Stars

camenduru/stable-diffusion-webui-colab
stable diffusion webui colab
Language:Jupyter Notebook15.6k 189 3532.6k
Rem0o/FanControl.Releases
This is the release repository for Fan Control, a highly customizable fan controlling software for Windows.
13.8k 126 2k437
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python11.9k 100 507834
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
11.7k 269 108758
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python6.8k 49 210518
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python4.8k 49 430360
Blealtan/efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
Language:Python3.9k 33 37341
mseitzer/pytorch-fid
Compute FID scores with PyTorch.
Language:Python3.3k 15 86500
jettify/pytorch-optimizer
torch-optimizer -- collection of optimizers for Pytorch
Language:Python3k 33 64294
torch-points3d/torch-points3d
Pytorch framework for doing deep learning on point clouds.
Language:Python2.5k 53 365390
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.4k 46 3157
rom1504/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
Language:Jupyter Notebook2.4k 23 231208
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2k 19 80164
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
Language:Python2k 6 24333
zhoubolei/bolei_awesome_posters
CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!
1.4k 10 0126
scito/extract_otp_secrets
Extract one time password (OTP) secrets from QR codes exported by two-factor authentication (2FA) apps such as "Google Authenticator". The exported QR codes from authentication apps can be captured by camera, read from images, or read from text files. The secrets can be exported to JSON or CSV, or printed as QR codes to console.
Language:Python1.1k 8 30136
ActiveVisionLab/Awesome-LLM-3D
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
972 37 465
OpenRobotLab/PointLLM
[ECCV 2024 Oral] PointLLM: Empowering Large Language Models to Understand Point Clouds
Language:Python505 12 3523
ChanganVR/awesome-embodied-vision
Reading list for research topics in embodied vision
493 15 165
GraphPKU/PiSSA
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models
Language:Jupyter Notebook245 4 219
Open3DA/LL3DA
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.
Language:Python221 6 259
zaibacu/thesaurus
Offline database of synonyms/thesaurus
Language:Python183 4 440
RUCAIBox/POPE
The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
Language:Python166 1 06
hako-mikan/sd-webui-traintrain
LoRA training extention for Stable Diffusion Web-UI
Language:Python140 2 195
ch3cook-fdu/Vote2Cap-DETR
[CVPR 2023] Vote2Cap-DETR and [T-PAMI 2024] Vote2Cap-DETR++; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning methods
Language:Python82 2 225
chenguolin/InstructScene
[ICLR 2024 spotlight] Official implementation of "InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior".
Language:Python79 4 1310
matthewdm0816/BridgeQA
[AAAI 24] Official Codebase for BridgeQA: Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA
Language:Python13 1 01
idejie/3DSyn
Language:Python5 2 00
TerryLiu18/image-captioning-for-celebrities
image captioning with face recognition for celebrities
Language:Jupyter Notebook4 1 00
idejie/KAD
Language:Python3 1 00

matthewdm0816

matthewdm0816's Stars

camenduru/stable-diffusion-webui-colab

Rem0o/FanControl.Releases

OpenBMB/MiniCPM-V

BradyFU/Awesome-Multimodal-Large-Language-Models

LiheYoung/Depth-Anything

QwenLM/Qwen-VL

Blealtan/efficient-kan

mseitzer/pytorch-fid

jettify/pytorch-optimizer

torch-points3d/torch-points3d

Zjh-819/LLMDataHub

rom1504/clip-retrieval

eric-mitchell/direct-preference-optimization

yuweihao/MambaOut

zhoubolei/bolei_awesome_posters

scito/extract_otp_secrets

ActiveVisionLab/Awesome-LLM-3D

OpenRobotLab/PointLLM

ChanganVR/awesome-embodied-vision

GraphPKU/PiSSA

Open3DA/LL3DA

zaibacu/thesaurus

RUCAIBox/POPE

hako-mikan/sd-webui-traintrain

ch3cook-fdu/Vote2Cap-DETR

chenguolin/InstructScene

matthewdm0816/BridgeQA

idejie/3DSyn

TerryLiu18/image-captioning-for-celebrities

idejie/KAD