yash0307

Computer Vision, Machine Learning.

Czech Technical University, Carnegie Mellon University, IIIT HyderabadPrague, Czech Republic

Pinned Repositories

diht
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
Language:Python132 20 54
TextTopicNet
Self-supervised learning of visual features through embedding images into text topic spaces
Language:Python95 9 428
E2E-MLT
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
Language:C++290 16 7584
docile
DocILE: Document Information Localization and Extraction Benchmark
Language:Python119 12 59
differentiable_ransac
PyTorch Implementation of the ICCV 2023 paper: Generalized Differentiable RANSAC ($\nabla$-RANSAC).
Language:Python177 5 1410
3D-R2N2
Single/multi view image(s) to voxel reconstruction using a recurrent neural network
Language:Python0 2 00
git_DIP
All assignments goes here, of course after deadlines :D
Language:HTML1 2 00
PatchMatching
Patch Matching as Course Project of Digital Image Processing.
Language:Matlab3 2 01
RecallatK_surrogate
Code for Recall@k Surrogate Loss with Large Batches and Similarity Mixup, CVPR 2022.
Language:Python58 6 108
SuperCNN
16-811 Project.
Language:TeX10 2 25

yash0307's Repositories

yash0307/RecallatK_surrogate
Code for Recall@k Surrogate Loss with Large Batches and Similarity Mixup, CVPR 2022.
Language:Python58 6 108
yash0307/3D-R2N2
Single/multi view image(s) to voxel reconstruction using a recurrent neural network
Language:Python0 2 00
yash0307/ALBEF
Code for ALBEF: a new vision-language pre-training method
Language:Python0 0 00
yash0307/awesome-cbir-papers
📝Awesome and classical image retrieval papers
0 0
yash0307/bilinear-sampler-pytorch
Pytorch implimentation of STN bilinear sampler
Language:Python2 0
yash0307/CA_Entropy_Model
Repository of the paper "Context-adaptive Entropy Model for End-to-end Optimized Image Compression"
Language:Python1 0
yash0307/compression
Data compression in TensorFlow
Language:Python2 0
yash0307/diht
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
Language:Python0 0
yash0307/DRL_assignment2
Language:Python4 0
yash0307/E2E-MLT
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
Language:C++0 0
yash0307/gluon-cv
Gluon CV Toolkit
Language:Python2 0
yash0307/GroupViT
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
Language:Python0 0
yash0307/imgcomp-cvpr
TensorFlow implementation of Conditional Probability Models for Deep Image Compression, published in CVPR 2018
Language:Python2 0
yash0307/LearningSamplingPolicies
Language:Python2 0
yash0307/models
Models and examples built with TensorFlow
Language:Python2 0
yash0307/MonoDepth-PyTorch
Unofficial implementation of Unsupervised Monocular Depth Estimation neural network MonoDepth in PyTorch
Language:Python2 0
yash0307/ogn
Language:C++2 0
yash0307/openseg.pytorch
The official Pytorch implementation of OCNet series and SegFix.
Language:Python0 0
yash0307/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Language:Python0 0
yash0307/pytorch-lars
"Layer-wise Adaptive Rate Scaling" in PyTorch
Language:Python1 0
yash0307/Self-Supervised-Retrieval
2 0
yash0307/Smooth_AP-1
code for the ECCV '20 paper "Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval"
Language:Python1 0
yash0307/srgan
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
Language:Python2 0
yash0307/SRGAN-tensorflow
Tensorflow implementation of the SRGAN algorithm for single image super-resolution
Language:Python2 0
yash0307/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Language:Python0 0
yash0307/SynthText
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
Language:Python2 0
yash0307/TCL
code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022
Language:Python0 0
yash0307/Tensorflow-estimator-multilabel-classification
Language:Python2 0
yash0307/webdataset-examples
Examples for the WebDataset PyTorch Dataset Library
Language:Python1 0
yash0307/wheelzoom
A small script for zooming IMG elements with the mousewheel/trackpad.
Language:JavaScript2 0