MindLostGuy's Stars
CAD-MLLM/CAD-MLLM
CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM
Open-Cascade-SAS/OCCT
Open CASCADE Technology (OCCT) is an open-source software development platform for 3D CAD, CAM, CAE.
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
samxuxiang/BrepGen
[SIGGRAPH 2024] Official PyTorch Implementation of "BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry".
mtli/HTML4Vision
A simple HTML visualization tool for computer vision research :hammer_and_wrench:
QiujieDong/NeurCADRecon
NeurCADRecon: Neural Representation for Reconstructing CAD Surfaces by Enforcing Zero Gaussian Curvature
hzxie/CityDreamer
The official implementation of "CityDreamer: Compositional Generative Model of Unbounded 3D Cities". (Xie et al., CVPR 2024)
weixi-feng/LayoutGPT
Official repo for LayoutGPT
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
princeton-vl/infinigen
Infinite Photorealistic Worlds using Procedural Generation
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
prs-eth/point2cad
Code for "Point2CAD: Reverse Engineering CAD Models from 3D Point Clouds"
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, GOT-OCR2, ...).
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
deepseek-ai/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
microsoft/Graphormer
Graphormer is a general-purpose deep learning backbone for molecular modeling.
zorzi-s/PolyWorldPretrainedNetwork
PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite Images
newhouseb/clownfish
Constrained Decoding for LLMs against JSON Schema
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
HobbitLong/shape2prog
[ICLR 2019] Learning to Infer and Execute 3D Shape Programs
PrincetonLIPS/vitruvion
Code and data for Vitruvion: A Generative Model of Parametric CAD Sketches (ICLR 2022)
autonomousvision/shape_as_points
[NeurIPS'21] Shape As Points: A Differentiable Poisson Solver
1zb/3DShape2VecSet
NVlabs/edm
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
CatOnly/CrashNotes
Notes on Computer Graphics