yanweifu's Stars
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
kornia/kornia
Geometric Computer Vision Library for Spatial AI
voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
emcf/engshell
An English-language shell for any OS, powered by LLMs
nywang16/Pixel2Mesh
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images. In ECCV2018.
mkazhdan/PoissonRecon
Poisson Surface Reconstruction
pixop/video-compare
Split screen video comparison tool using FFmpeg and SDL2
christophschuhmann/improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
Shilin-LU/TF-ICON
[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)
youmi-zym/GO-SLAM
[ICCV2023] GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction
robopen/roboagent
Repository to train and evaluate RoboAgent
ewrfcas/MVSFormer
Codes of MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth (TMLR2023)
natowi/CameraCalibTools
List of Camera Calibration Tools + Patterns
YanjieZe/GNFactor
[CoRL 2023 Oral] GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields
FVPLab/Argus-3D
hetolin/SAR-Net
Code for "SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation" CVPR2022
ewrfcas/ZITS-PlusPlus
ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors (TPAMI2023)
MCG-NJU/TemporalPerceiver
[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
R-Mahmoudi/Real-Time-Object-Counting-on-Jetson-Nano
hetolin/PourIt
Code for "PourIt!: Weakly-supervised Liquid Perception from a Single Image for Visual Closed-Loop Robotic Pouring" ICCV2023
whhamber/Oracle-50K
Oracle Character Recognition Dataset - Oracle-50K
sipposip/simple-gcm-deep-learning
code developed for the paper "Toward Data‐Driven Weather and Climate Forecasting: Approximating a Simple General Circulation Model With Deep Learning"