computer-vision

There are 35044 repositories under computer-vision topic.

  • open_clip

    An open source implementation of CLIP.

    Language:Python12.6k
  • CVPR2024-Paper-Code-Interpretation

    cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理

  • fashion-mnist

    A MNIST-like fashion product database. Benchmark :point_down:

    Language:Python12.4k
  • Meshroom

    Meshroom

    Node-based Visual Programming Toolbox

    Language:QML12.2k
  • pytorch-grad-cam

    Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

    Language:Python12.2k
  • ludwig

    ludwig

    Low-code framework for building custom LLMs, neural networks, and other AI models

    Language:Python11.6k
  • segmentation_models.pytorch

    Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

    Language:Python10.9k
  • kornia

    🐍 Geometric Computer Vision Library for Spatial AI

    Language:Python10.7k
  • nerfstudio

    nerfstudio

    A collaboration friendly studio for NeRFs

    Language:Python10.6k
  • pcl

    Point Cloud Library (PCL)

    Language:C++10.6k
  • pix2pix

    Image-to-image translation with conditional adversarial nets

    Language:Lua10.5k
  • caire

    Content aware image resize library

    Language:Go10.4k
  • openFrameworks

    openFrameworks is a community-developed cross platform toolkit for creative coding in C++.

    Language:C++10.2k
  • fiftyone

    fiftyone

    Refine high-quality datasets and visual AI models

    Language:Python9.9k
  • colmap

    COLMAP - Structure-from-Motion and Multi-View Stereo

    Language:C++9.8k
  • computervision-recipes

    computervision-recipes

    Best Practices, code samples, and documentation for Computer Vision.

    Language:Jupyter Notebook9.8k
  • autogluon

    autogluon

    Fast and Accurate ML in 3 Lines of Code

    Language:Python9.4k
  • U-2-Net

    The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

    Language:Python9.4k
  • rerun

    rerun

    Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.

    Language:Rust9.2k
  • lama

    lama

    🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

    Language:Jupyter Notebook9.2k
  • RobustVideoMatting

    RobustVideoMatting

    Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

    Language:Python9.1k
  • openvino

    OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

    Language:C++8.8k
  • deeplake

    Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

    Language:Python8.8k
  • jetson-inference

    jetson-inference

    Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

    Language:C++8.5k
  • Deep-Learning-Interview-Book

    深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)

  • notebooks

    A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.

    Language:Jupyter Notebook8.4k
  • librealsense

    Intel® RealSense™ SDK

    Language:C++8.2k
  • introtodeeplearning

    Lab Materials for MIT 6.S191: Introduction to Deep Learning

    Language:Jupyter Notebook8.1k
  • javacv

    Java interface to OpenCV, FFmpeg, and more

    Language:Java8.1k
  • ailab

    Experience, Learn and Code the latest breakthrough innovations with Microsoft AI

    Language:C#7.8k
  • awesome-object-detection

    Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html

  • mmagic

    OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

    Language:Jupyter Notebook7.3k
  • gocv

    Go package for computer vision using OpenCV 4 and beyond. Includes support for DNN, CUDA, OpenCV Contrib, and OpenVINO.

    Language:Go7.2k
  • BackgroundMattingV2

    BackgroundMattingV2

    Real-Time High-Resolution Background Matting

    Language:Python7.1k
  • models

    Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

    Language:Python6.9k
  • SerpentAI

    Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

    Language:Python6.9k