vision

There are 1902 repositories under vision topic.

ravens
Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
Language:Python607
myvision
Computer vision based ML training data generation tool :rocket:
Language:JavaScript603
vector-python-sdk
Anki Vector Python SDK
Language:Python578
pytorch-dense-correspondence
Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"
Language:Python573
neural-motifs
Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)
Language:Python536
rewriting
Rewriting a Deep Generative Model, ECCV 2020 (oral). Interactive tool to directly edit the rules of a GAN to synthesize scenes with objects added, removed, or altered. Change StyleGANv2 to make extravagant eyebrows, or horses wearing hats.
Language:Python534
LLaVA-Mini
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
Language:Python524
SAPC-APCA
APCA (Accessible Perceptual Contrast Algorithm) is a new method for predicting contrast for use in emerging web standards (WCAG 3) for determining readability contrast. APCA is derived form the SAPC (S-LUV Advanced Predictive Color) which is an accessibility-oriented color appearance model designed for self-illuminated displays.
Language:CSS523
HRFormer
[ NeurIPS2021] This is an official implementation of our paper "HRFormer: High-Resolution Transformer for Dense Prediction".
Language:Python515
cliport
CLIPort: What and Where Pathways for Robotic Manipulation
Language:Jupyter Notebook514
nodejs-vision
This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.
500
FaceCropper
:scissors: Crop faces, inside of your image, with iOS 11 Vision api.
Language:Swift484
arucogen
Online ArUco markers generator
Language:JavaScript479
r2c
Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)
Language:Python468
PythonFromSpace
Python Examples for Remote Sensing
Language:Jupyter Notebook464
OpticalFlow_Visualization
Python optical flow visualization following Baker et al. (ICCV 2007) as used by the MPI-Sintel challenge
Language:Python450
visualization
a collection of visualization function
Language:Python441
TokenLabeling
Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
Language:Jupyter Notebook433
apriltag_ros
A ROS wrapper of the AprilTag 3 visual fiducial detector
Language:C++404
GRIP
Program for rapidly developing computer vision applications
Language:Java384
PaperVision
Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quickly prototype your vision as you edit.
Language:Kotlin377
rowfill
Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workers
Language:TypeScript363
photonvision
PhotonVision is the free, fast, and easy-to-use computer vision solution for the FIRST Robotics Competition.
Language:Java359
VectorDB-Plugin
Plugin that lets you ask questions about your documents including audio and video files.
Language:Python348
Amazing-ARkit
ARKit相关资源汇总群：326705018
348
awesome-deep-vision-web-demo
A curated list of awesome deep vision web demo
348
vision-based-prediction
Deep Learning for Vision-based Prediction
Language:TeX344
Stream-Omni
Stream-Omni is a GPT-4o-like language-vision-speech chatbot that simultaneously supports interaction across various modality combinations.
Language:Python343
FacesVisionDemo
👀 iOS11 demo application for age and gender classification of facial images.
Language:Swift324
arc-robot-vision
MIT-Princeton Vision Toolbox for Robotic Pick-and-Place at the Amazon Robotics Challenge 2017 - Robotic Grasping and One-shot Recognition of Novel Objects with Deep Learning.
Language:Lua315
dirt
DIRT: a fast differentiable renderer for TensorFlow
Language:C++314
cs231a-notes
The course notes for Stanford's CS231A course on computer vision
Language:TeX310
apc-vision-toolbox
MIT-Princeton Vision Toolbox for the Amazon Picking Challenge 2016 - RGB-D ConvNet-based object segmentation and 6D object pose estimation.
Language:C++307
ImageDetect
✂️ Detect and crop faces, barcodes and texts in image with iOS 11 Vision api.
Language:Swift306
ChatGPT-OpenAI-Smart-Speaker
This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice and vision-driven conversations, with additional web search capabilities via OpenAI and Langchain agents.
Language:Python300
SAM-DETR
[CVPR'2022] SAM-DETR & SAM-DETR++: Official PyTorch Implementation
Language:Python297