yash0307
Computer Vision, Machine Learning.
Czech Technical University, Carnegie Mellon University, IIIT HyderabadPrague, Czech Republic
yash0307's Stars
KevinMusgrave/powerful-benchmarker
A library for ML benchmarking. It's powerful.
yitu-opensource/T2T-ViT
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
scalabel/scalabel
Scalabel: A versatile web-based visual data annotation tool
kashyap7x/QGN
Quadtree Generating Networks for Scene Parsing with Sparse Convolutions (https://arxiv.org/abs/1907.11821)
cvlab-epfl/disk
Disk code release
ChenRocks/UNITER
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
movienet/movienet-tools
Tools for movie and video research
fossasia/visdom
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.
voxel51/fiftyone
Refine high-quality datasets and visual AI models
thodan/epos
Code for "EPOS: Estimating 6D Pose of Objects with Symmetries", CVPR 2020.
AakashKumarNain/annotated_research_papers
This repo contains annotated research papers that I found really good and useful
TheAlgorithms/Python
All Algorithms implemented in Python
HaozhiQi/RPIN
Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)
LuoweiZhou/VLP
Vision-Language Pre-training for Image Captioning and Question Answering
Andrew-Brown1/Smooth_AP
code for the ECCV '20 paper "Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval"
martius-lab/blackbox-backprop
Torch modules that wrap blackbox combinatorial solvers according to the method presented in "Differentiating Blackbox Combinatorial Solvers"
facebookresearch/pytorch3d
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
thodan/bop_toolkit
A Python toolkit of the BOP benchmark for 6D object pose estimation.
tensorflow/compression
Data compression in TensorFlow
HarisIqbal88/PlotNeuralNet
Latex code for making neural networks diagrams
d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
hiroharu-kato/neural_renderer
"Neural 3D Mesh Renderer" (CVPR 2018) by H. Kato, Y. Ushiku, and T. Harada.
MichalBusta/E2E-MLT
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
dmlc/gluon-nlp
NLP made easy
lmb-freiburg/ogn
Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs
griegler/octnet
OctNet: Learning Deep 3D Representations at High Resolutions
MichalBusta/DeepTextSpotter
ankush-me/SynthText
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
debidatta/syndata-generation
Code used to generate synthetic scenes and bounding box annotations for object detection. This was used to generate data used in the Cut, Paste and Learn paper
liuzhuang13/DenseNet
Densely Connected Convolutional Networks, In CVPR 2017 (Best Paper Award).