vision

There are 1902 repositories under vision topic.

  • ravens

    Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.

    Language:Python607
  • myvision

    myvision

    Computer vision based ML training data generation tool :rocket:

    Language:JavaScript603
  • vector-python-sdk

    Anki Vector Python SDK

    Language:Python578
  • pytorch-dense-correspondence

    Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"

    Language:Python573
  • neural-motifs

    Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)

    Language:Python536
  • rewriting

    Rewriting a Deep Generative Model, ECCV 2020 (oral). Interactive tool to directly edit the rules of a GAN to synthesize scenes with objects added, removed, or altered. Change StyleGANv2 to make extravagant eyebrows, or horses wearing hats.

    Language:Python534
  • LLaVA-Mini

    LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

    Language:Python524
  • SAPC-APCA

    SAPC-APCA

    APCA (Accessible Perceptual Contrast Algorithm) is a new method for predicting contrast for use in emerging web standards (WCAG 3) for determining readability contrast. APCA is derived form the SAPC (S-LUV Advanced Predictive Color) which is an accessibility-oriented color appearance model designed for self-illuminated displays.

    Language:CSS523
  • HRFormer

    [ NeurIPS2021] This is an official implementation of our paper "HRFormer: High-Resolution Transformer for Dense Prediction".

    Language:Python515
  • cliport

    CLIPort: What and Where Pathways for Robotic Manipulation

    Language:Jupyter Notebook514
  • nodejs-vision

    This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.

  • FaceCropper

    :scissors: Crop faces, inside of your image, with iOS 11 Vision api.

    Language:Swift484
  • arucogen

    Online ArUco markers generator

    Language:JavaScript479
  • r2c

    Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)

    Language:Python468
  • PythonFromSpace

    Python Examples for Remote Sensing

    Language:Jupyter Notebook464
  • OpticalFlow_Visualization

    Python optical flow visualization following Baker et al. (ICCV 2007) as used by the MPI-Sintel challenge

    Language:Python450
  • visualization

    a collection of visualization function

    Language:Python441
  • TokenLabeling

    Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"

    Language:Jupyter Notebook433
  • apriltag_ros

    A ROS wrapper of the AprilTag 3 visual fiducial detector

    Language:C++404
  • GRIP

    Program for rapidly developing computer vision applications

    Language:Java384
  • PaperVision

    Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quickly prototype your vision as you edit.

    Language:Kotlin377
  • rowfill

    Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workers

    Language:TypeScript363
  • photonvision

    PhotonVision is the free, fast, and easy-to-use computer vision solution for the FIRST Robotics Competition.

    Language:Java359
  • VectorDB-Plugin

    Plugin that lets you ask questions about your documents including audio and video files.

    Language:Python348
  • Amazing-ARkit

    ARKit相关资源汇总 群:326705018

  • awesome-deep-vision-web-demo

    A curated list of awesome deep vision web demo

  • vision-based-prediction

    Deep Learning for Vision-based Prediction

    Language:TeX344
  • Stream-Omni

    Stream-Omni is a GPT-4o-like language-vision-speech chatbot that simultaneously supports interaction across various modality combinations.

    Language:Python343
  • FacesVisionDemo

    👀 iOS11 demo application for age and gender classification of facial images.

    Language:Swift324
  • arc-robot-vision

    MIT-Princeton Vision Toolbox for Robotic Pick-and-Place at the Amazon Robotics Challenge 2017 - Robotic Grasping and One-shot Recognition of Novel Objects with Deep Learning.

    Language:Lua315
  • dirt

    DIRT: a fast differentiable renderer for TensorFlow

    Language:C++314
  • cs231a-notes

    The course notes for Stanford's CS231A course on computer vision

    Language:TeX310
  • apc-vision-toolbox

    MIT-Princeton Vision Toolbox for the Amazon Picking Challenge 2016 - RGB-D ConvNet-based object segmentation and 6D object pose estimation.

    Language:C++307
  • ImageDetect

    ✂️ Detect and crop faces, barcodes and texts in image with iOS 11 Vision api.

    Language:Swift306
  • ChatGPT-OpenAI-Smart-Speaker

    This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice and vision-driven conversations, with additional web search capabilities via OpenAI and Langchain agents.

    Language:Python300
  • SAM-DETR

    [CVPR'2022] SAM-DETR & SAM-DETR++: Official PyTorch Implementation

    Language:Python297