yanweifu

yanweifu's Stars

facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook46.9k 305 6625.6k
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.7k 996 1883.4k
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Language:Python30.1k 386 3.5k7.4k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.6k 115 1k1.2k
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Language:TypeScript12.3k 184 4.1k3k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12k 270 109769
kornia/kornia
Geometric Computer Vision Library for Spatial AI
Language:Python9.8k 127 926959
voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
Language:Python8.1k 55 1.5k544
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python5.6k 50 559439
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Language:Jupyter Notebook4.7k 44 123485
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
Language:Python3.8k 55 52309
emcf/engshell
An English-language shell for any OS, powered by LLMs
Language:Python2.2k 31 17190
nywang16/Pixel2Mesh
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images. In ECCV2018.
Language:Python1.6k 51 120294
mkazhdan/PoissonRecon
Poisson Surface Reconstruction
Language:C++1.6k 70 274425
pixop/video-compare
Split screen video comparison tool using FFmpeg and SDL2
Language:C++965 12 6744
christophschuhmann/improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
Language:Python876 6 1087
Shilin-LU/TF-ICON
[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)
Language:Python787 35 25102
youmi-zym/GO-SLAM
[ICCV2023] GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction
Language:Python359 12 4331
robopen/roboagent
Repository to train and evaluate RoboAgent
Language:Python291 28 2223
ewrfcas/MVSFormer
Codes of MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth (TMLR2023)
Language:Python181 3 3710
natowi/CameraCalibTools
List of Camera Calibration Tools + Patterns
153 3 323
YanjieZe/GNFactor
[CoRL 2023 Oral] GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields
Language:Python113 2 109
FVPLab/Argus-3D
Language:Python94 7 136
hetolin/SAR-Net
Code for "SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation" CVPR2022
Language:Python66 3 217
ewrfcas/ZITS-PlusPlus
ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors (TPAMI2023)
Language:Python59 4 115
MCG-NJU/TemporalPerceiver
[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Language:Python34 1 11
R-Mahmoudi/Real-Time-Object-Counting-on-Jetson-Nano
Language:Python26 1 08
hetolin/PourIt
Code for "PourIt!: Weakly-supervised Liquid Perception from a Single Image for Visual Closed-Loop Robotic Pouring" ICCV2023
Language:Python14 3 12
whhamber/Oracle-50K
Oracle Character Recognition Dataset - Oracle-50K
12 1 10
sipposip/simple-gcm-deep-learning
code developed for the paper "Toward Data‐Driven Weather and Climate Forecasting: Approximating a Simple General Circulation Model With Deep Learning"
Language:Python9 1 04