yash0307
Computer Vision, Machine Learning.
Czech Technical University, Carnegie Mellon University, IIIT HyderabadPrague, Czech Republic
Pinned Repositories
diht
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
TextTopicNet
Self-supervised learning of visual features through embedding images into text topic spaces
E2E-MLT
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
docile
DocILE: Document Information Localization and Extraction Benchmark
differentiable_ransac
PyTorch Implementation of the ICCV 2023 paper: Generalized Differentiable RANSAC ($\nabla$-RANSAC).
3D-R2N2
Single/multi view image(s) to voxel reconstruction using a recurrent neural network
git_DIP
All assignments goes here, of course after deadlines :D
PatchMatching
Patch Matching as Course Project of Digital Image Processing.
RecallatK_surrogate
Code for Recall@k Surrogate Loss with Large Batches and Similarity Mixup, CVPR 2022.
SuperCNN
16-811 Project.
yash0307's Repositories
yash0307/RecallatK_surrogate
Code for Recall@k Surrogate Loss with Large Batches and Similarity Mixup, CVPR 2022.
yash0307/3D-R2N2
Single/multi view image(s) to voxel reconstruction using a recurrent neural network
yash0307/ALBEF
Code for ALBEF: a new vision-language pre-training method
yash0307/awesome-cbir-papers
đź“ťAwesome and classical image retrieval papers
yash0307/bilinear-sampler-pytorch
Pytorch implimentation of STN bilinear sampler
yash0307/CA_Entropy_Model
Repository of the paper "Context-adaptive Entropy Model for End-to-end Optimized Image Compression"
yash0307/compression
Data compression in TensorFlow
yash0307/diht
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
yash0307/DRL_assignment2
yash0307/E2E-MLT
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
yash0307/gluon-cv
Gluon CV Toolkit
yash0307/GroupViT
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
yash0307/imgcomp-cvpr
TensorFlow implementation of Conditional Probability Models for Deep Image Compression, published in CVPR 2018
yash0307/LearningSamplingPolicies
yash0307/models
Models and examples built with TensorFlow
yash0307/MonoDepth-PyTorch
Unofficial implementation of Unsupervised Monocular Depth Estimation neural network MonoDepth in PyTorch
yash0307/ogn
yash0307/openseg.pytorch
The official Pytorch implementation of OCNet series and SegFix.
yash0307/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
yash0307/pytorch-lars
"Layer-wise Adaptive Rate Scaling" in PyTorch
yash0307/Self-Supervised-Retrieval
yash0307/Smooth_AP-1
code for the ECCV '20 paper "Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval"
yash0307/srgan
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
yash0307/SRGAN-tensorflow
Tensorflow implementation of the SRGAN algorithm for single image super-resolution
yash0307/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
yash0307/SynthText
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
yash0307/TCL
code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022
yash0307/Tensorflow-estimator-multilabel-classification
yash0307/webdataset-examples
Examples for the WebDataset PyTorch Dataset Library
yash0307/wheelzoom
A small script for zooming IMG elements with the mousewheel/trackpad.