Bridging the Gap Between Value and Policy Based Reinforcement Learning |
NIPS |
code |
46593 |
REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models |
NIPS |
code |
46593 |
Focal Loss for Dense Object Detection |
ICCV |
code |
18356 |
Mask R-CNN |
ICCV |
code |
9493 |
Deep Photo Style Transfer |
CVPR |
code |
8655 |
LightGBM: A Highly Efficient Gradient Boosting Decision Tree |
NIPS |
code |
7536 |
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation |
NIPS |
code |
6449 |
Attention is All you Need |
NIPS |
code |
6288 |
Large Pose 3D Face Reconstruction From a Single Image via Direct Volumetric CNN Regression |
ICCV |
code |
3354 |
Densely Connected Convolutional Networks |
CVPR |
code |
3130 |
A Unified Approach to Interpreting Model Predictions |
NIPS |
code |
3122 |
Deformable Convolutional Networks |
ICCV |
code |
2165 |
ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games |
NIPS |
code |
1823 |
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation |
CVPR |
code |
1523 |
Improved Training of Wasserstein GANs |
NIPS |
code |
1405 |
Fully Convolutional Instance-Aware Semantic Segmentation |
CVPR |
code |
1395 |
Aggregated Residual Transformations for Deep Neural Networks |
CVPR |
code |
1361 |
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network |
CVPR |
code |
1301 |
Unsupervised Image-to-Image Translation Networks |
NIPS |
code |
1205 |
Photographic Image Synthesis With Cascaded Refinement Networks |
ICCV |
code |
1142 |
High-Resolution Image Inpainting Using Multi-Scale Neural Patch Synthesis |
CVPR |
code |
1072 |
SphereFace: Deep Hypersphere Embedding for Face Recognition |
CVPR |
code |
1048 |
Deep Feature Flow for Video Recognition |
CVPR |
code |
966 |
Bayesian GAN |
NIPS |
code |
942 |
Pyramid Scene Parsing Network |
CVPR |
code |
934 |
Efficient Modeling of Latent Information in Supervised Learning using Gaussian Processes |
NIPS |
code |
906 |
Finding Tiny Faces |
CVPR |
code |
856 |
Toward Multimodal Image-to-Image Translation |
NIPS |
code |
794 |
Learning to Discover Cross-Domain Relations with Generative Adversarial Networks |
ICML |
code |
784 |
YOLO9000: Better, Faster, Stronger |
CVPR |
code |
773 |
PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space |
NIPS |
code |
772 |
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks |
ICML |
code |
729 |
FlowNet 2.0: Evolution of Optical Flow Estimation With Deep Networks |
CVPR |
code |
720 |
Channel Pruning for Accelerating Very Deep Neural Networks |
ICCV |
code |
649 |
Dilated Residual Networks |
CVPR |
code |
640 |
Inferring and Executing Programs for Visual Reasoning |
ICCV |
code |
636 |
DSOD: Learning Deeply Supervised Object Detectors From Scratch |
ICCV |
code |
582 |
Arbitrary Style Transfer in Real-Time With Adaptive Instance Normalization |
ICCV |
code |
572 |
Accelerating Eulerian Fluid Simulation With Convolutional Networks |
ICML |
code |
570 |
Learning Disentangled Representations with Semi-Supervised Deep Generative Models |
NIPS |
code |
556 |
Inductive Representation Learning on Large Graphs |
NIPS |
code |
552 |
Regressing Robust and Discriminative 3D Morphable Models With a Very Deep Neural Network |
CVPR |
code |
537 |
How Far Are We From Solving the 2D & 3D Face Alignment Problem? (And a Dataset of 230,000 3D Facial Landmarks) |
ICCV |
code |
526 |
SSH: Single Stage Headless Face Detector |
ICCV |
code |
515 |
Learning From Simulated and Unsupervised Images Through Adversarial Training |
CVPR |
code |
492 |
Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space |
CVPR |
code |
487 |
Video Frame Interpolation via Adaptive Convolution |
CVPR |
code |
482 |
Video Frame Interpolation via Adaptive Separable Convolution |
ICCV |
code |
482 |
GMS: Grid-based Motion Statistics for Fast, Ultra-Robust Feature Correspondence |
CVPR |
code |
460 |
Joint Detection and Identification Feature Learning for Person Search |
CVPR |
code |
459 |
Dual Path Networks |
NIPS |
code |
451 |
Flow-Guided Feature Aggregation for Video Object Detection |
ICCV |
code |
436 |
Deep Image Matting |
CVPR |
code |
434 |
Richer Convolutional Features for Edge Detection |
CVPR |
code |
399 |
Annotating Object Instances With a Polygon-RNN |
CVPR |
code |
397 |
Recurrent Highway Networks |
ICML |
code |
397 |
Detect to Track and Track to Detect |
ICCV |
code |
387 |
RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation |
CVPR |
code |
379 |
Detecting Oriented Text in Natural Images by Linking Segments |
CVPR |
code |
364 |
Deep Lattice Networks and Partial Monotonic Functions |
NIPS |
code |
349 |
Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results |
NIPS |
code |
347 |
RON: Reverse Connection With Objectness Prior Networks for Object Detection |
CVPR |
code |
345 |
Universal Style Transfer via Feature Transforms |
NIPS |
code |
344 |
Residual Attention Network for Image Classification |
CVPR |
code |
329 |
One-Shot Video Object Segmentation |
CVPR |
code |
316 |
Accurate Single Stage Detector Using Recurrent Rolling Convolution |
CVPR |
code |
314 |
Feature Pyramid Networks for Object Detection |
CVPR |
code |
310 |
Efficient softmax approximation for GPUs |
ICML |
code |
304 |
OctNet: Learning Deep 3D Representations at High Resolutions |
CVPR |
code |
302 |
Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution |
CVPR |
code |
301 |
Pixel Recursive Super Resolution |
ICCV |
code |
301 |
Self-Critical Sequence Training for Image Captioning |
CVPR |
code |
299 |
Age Progression/Regression by Conditional Adversarial Autoencoder |
CVPR |
code |
297 |
Style Transfer from Non-Parallel Text by Cross-Alignment |
NIPS |
code |
296 |
Dilated Recurrent Neural Networks |
NIPS |
code |
285 |
Lifting From the Deep: Convolutional 3D Pose Estimation From a Single Image |
CVPR |
code |
280 |
DeepBach: a Steerable Model for Bach Chorales Generation |
ICML |
code |
276 |
The Predictron: End-To-End Learning and Planning |
ICML |
code |
274 |
Convolutional Sequence to Sequence Learning |
ICML |
code |
258 |
OptNet: Differentiable Optimization as a Layer in Neural Networks |
ICML |
code |
245 |
Prototypical Networks for Few-shot Learning |
NIPS |
code |
244 |
Deep Voice: Real-time Neural Text-to-Speech |
ICML |
code |
242 |
Reinforcement Learning with Deep Energy-Based Policies |
ICML |
code |
233 |
Learning Deep CNN Denoiser Prior for Image Restoration |
CVPR |
code |
231 |
GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium |
NIPS |
code |
229 |
A Point Set Generation Network for 3D Object Reconstruction From a Single Image |
CVPR |
code |
228 |
Deeply Supervised Salient Object Detection With Short Connections |
CVPR |
code |
228 |
BlitzNet: A Real-Time Deep Network for Scene Understanding |
ICCV |
code |
227 |
Language Modeling with Gated Convolutional Networks |
ICML |
code |
221 |
Unlabeled Samples Generated by GAN Improve the Person Re-Identification Baseline in Vitro |
ICCV |
code |
215 |
Stacked Generative Adversarial Networks |
CVPR |
code |
215 |
RMPE: Regional Multi-Person Pose Estimation |
ICCV |
code |
215 |
Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning |
CVPR |
code |
214 |
Generative Face Completion |
CVPR |
code |
212 |
VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition |
ICCV |
code |
210 |
The Reversible Residual Network: Backpropagation Without Storing Activations |
NIPS |
code |
210 |
Recurrent Scale Approximation for Object Detection in CNN |
ICCV |
code |
209 |
Learning From Synthetic Humans |
CVPR |
code |
207 |
Spatially Adaptive Computation Time for Residual Networks |
CVPR |
code |
203 |
Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis |
ICCV |
code |
202 |
3D Bounding Box Estimation Using Deep Learning and Geometry |
CVPR |
code |
200 |
Multi-View 3D Object Detection Network for Autonomous Driving |
CVPR |
code |
199 |
Visual Dialog |
CVPR |
code |
199 |
Interpretable Explanations of Black Boxes by Meaningful Perturbation |
ICCV |
code |
192 |
Inverse Compositional Spatial Transformer Networks |
CVPR |
code |
189 |
FastMask: Segment Multi-Scale Object Candidates in One Shot |
CVPR |
code |
189 |
OnACID: Online Analysis of Calcium Imaging Data in Real Time |
NIPS |
code |
189 |
Semantic Scene Completion From a Single Depth Image |
CVPR |
code |
188 |
Learning Efficient Convolutional Networks Through Network Slimming |
ICCV |
code |
186 |
Learning Feature Pyramids for Human Pose Estimation |
ICCV |
code |
185 |
Be Your Own Prada: Fashion Synthesis With Structural Coherence |
ICCV |
code |
183 |
Scene Graph Generation by Iterative Message Passing |
CVPR |
code |
182 |
Fast Image Processing With Fully-Convolutional Networks |
ICCV |
code |
180 |
Learning Multiple Tasks with Multilinear Relationship Networks |
NIPS |
code |
178 |
Learning to Reason: End-To-End Module Networks for Visual Question Answering |
ICCV |
code |
178 |
Single Shot Text Detector With Regional Attention |
ICCV |
code |
176 |
Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment With Limited Resources |
ICCV |
code |
175 |
Deep Feature Interpolation for Image Content Changes |
CVPR |
code |
170 |
On Human Motion Prediction Using Recurrent Neural Networks |
CVPR |
code |
167 |
Image Super-Resolution via Deep Recursive Residual Network |
CVPR |
code |
163 |
Learning Cross-Modal Embeddings for Cooking Recipes and Food Images |
CVPR |
code |
160 |
Input Convex Neural Networks |
ICML |
code |
159 |
Simple Does It: Weakly Supervised Instance and Semantic Segmentation |
CVPR |
code |
159 |
Low-Shot Visual Recognition by Shrinking and Hallucinating Features |
ICCV |
code |
158 |
Oriented Response Networks |
CVPR |
code |
157 |
Soft Proposal Networks for Weakly Supervised Object Localization |
ICCV |
code |
154 |
Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks |
ICML |
code |
147 |
Axiomatic Attribution for Deep Networks |
ICML |
code |
146 |
Gradient Episodic Memory for Continual Learning |
NIPS |
code |
146 |
DSAC - Differentiable RANSAC for Camera Localization |
CVPR |
code |
144 |
Attend to You: Personalized Image Captioning With Context Sequence Memory Networks |
CVPR |
code |
143 |
Conditional Similarity Networks |
CVPR |
code |
142 |
Language Modeling with Recurrent Highway Hypernetworks |
NIPS |
code |
141 |
Triple Generative Adversarial Nets |
NIPS |
code |
138 |
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning |
NIPS |
code |
138 |
One-Sided Unsupervised Domain Mapping |
NIPS |
code |
137 |
Detecting Visual Relationships With Deep Relational Networks |
CVPR |
code |
137 |
Attentive Recurrent Comparators |
ICML |
code |
136 |
Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach |
ICCV |
code |
136 |
Learning a Multi-View Stereo Machine |
NIPS |
code |
135 |
Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model |
NIPS |
code |
134 |
Multi-Context Attention for Human Pose Estimation |
CVPR |
code |
131 |
Controlling Perceptual Factors in Neural Style Transfer |
CVPR |
code |
130 |
Bayesian Compression for Deep Learning |
NIPS |
code |
130 |
Adversarial Discriminative Domain Adaptation |
CVPR |
code |
129 |
Working hard to know your neighbor's margins: Local descriptor learning loss |
NIPS |
code |
128 |
Concrete Dropout |
NIPS |
code |
127 |
SegFlow: Joint Learning for Video Object Segmentation and Optical Flow |
ICCV |
code |
127 |
Segmentation-Aware Convolutional Networks Using Local Attention Masks |
ICCV |
code |
126 |
Detail-Revealing Deep Video Super-Resolution |
ICCV |
code |
126 |
CREST: Convolutional Residual Learning for Visual Tracking |
ICCV |
code |
126 |
Discriminative Correlation Filter With Channel and Spatial Reliability |
CVPR |
code |
124 |
SVDNet for Pedestrian Retrieval |
ICCV |
code |
121 |
Semantic Image Synthesis via Adversarial Learning |
ICCV |
code |
121 |
Spatiotemporal Multiplier Networks for Video Action Recognition |
CVPR |
code |
121 |
PoseTrack: Joint Multi-Person Pose Estimation and Tracking |
CVPR |
code |
121 |
Hierarchical Attentive Recurrent Tracking |
NIPS |
code |
121 |
Good Semi-supervised Learning That Requires a Bad GAN |
NIPS |
code |
120 |
Deep Watershed Transform for Instance Segmentation |
CVPR |
code |
120 |
Associative Domain Adaptation |
ICCV |
code |
119 |
Learning by Association -- A Versatile Semi-Supervised Training Method for Neural Networks |
CVPR |
code |
119 |
Value Prediction Network |
NIPS |
code |
119 |
Unrestricted Facial Geometry Reconstruction Using Image-To-Image Translation |
ICCV |
code |
119 |
MemNet: A Persistent Memory Network for Image Restoration |
ICCV |
code |
119 |
Bayesian Optimization with Gradients |
NIPS |
code |
117 |
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning |
NIPS |
code |
117 |
Compressed Sensing using Generative Models |
ICML |
code |
116 |
Switching Convolutional Neural Network for Crowd Counting |
CVPR |
code |
116 |
WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation |
CVPR |
code |
116 |
Show, Adapt and Tell: Adversarial Training of Cross-Domain Image Captioner |
ICCV |
code |
115 |
Video Frame Synthesis Using Deep Voxel Flow |
ICCV |
code |
114 |
Multiple Instance Detection Network With Online Instance Classifier Refinement |
CVPR |
code |
113 |
Deep Pyramidal Residual Networks |
CVPR |
code |
112 |
Train longer, generalize better: closing the generalization gap in large batch training of neural networks |
NIPS |
code |
112 |
Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction |
CVPR |
code |
110 |
Unite the People: Closing the Loop Between 3D and 2D Human Representations |
CVPR |
code |
110 |
Learning Combinatorial Optimization Algorithms over Graphs |
NIPS |
code |
109 |
FeUdal Networks for Hierarchical Reinforcement Learning |
ICML |
code |
107 |
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression |
ICCV |
code |
105 |
Learning a Deep Embedding Model for Zero-Shot Learning |
CVPR |
code |
104 |
ECO: Efficient Convolution Operators for Tracking |
CVPR |
code |
103 |
SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning |
CVPR |
code |
102 |
Multi-View Supervision for Single-View Reconstruction via Differentiable Ray Consistency |
CVPR |
code |
100 |
Task-based End-to-end Model Learning in Stochastic Optimization |
NIPS |
code |
100 |
Learning to Compose Domain-Specific Transformations for Data Augmentation |
NIPS |
code |
97 |
Genetic CNN |
ICCV |
code |
97 |
HashNet: Deep Learning to Hash by Continuation |
ICCV |
code |
97 |
Interleaved Group Convolutions |
ICCV |
code |
95 |
Deeply-Learned Part-Aligned Representations for Person Re-Identification |
ICCV |
code |
95 |
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model |
NIPS |
code |
94 |
Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation |
CVPR |
code |
93 |
Octree Generating Networks: Efficient Convolutional Architectures for High-Resolution 3D Outputs |
ICCV |
code |
92 |
Semantic Autoencoder for Zero-Shot Learning |
CVPR |
code |
92 |
Deep Hyperspherical Learning |
NIPS |
code |
92 |
Decoupled Neural Interfaces using Synthetic Gradients |
ICML |
code |
90 |
Geometric Matrix Completion with Recurrent Multi-Graph Neural Networks |
NIPS |
code |
90 |
Practical Bayesian Optimization for Model Fitting with Bayesian Adaptive Direct Search |
NIPS |
code |
90 |
Optical Flow Estimation Using a Spatial Pyramid Network |
CVPR |
code |
90 |
AMC: Attention guided Multi-modal Correlation Learning for Image Search |
CVPR |
code |
90 |
Deep Video Deblurring for Hand-Held Cameras |
CVPR |
code |
89 |
Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data |
NIPS |
code |
88 |
Causal Effect Inference with Deep Latent-Variable Models |
NIPS |
code |
87 |
GANs for Biological Image Synthesis |
ICCV |
code |
85 |
MMD GAN: Towards Deeper Understanding of Moment Matching Network |
NIPS |
code |
84 |
Representation Learning by Learning to Count |
ICCV |
code |
84 |
Optical Flow in Mostly Rigid Scenes |
CVPR |
code |
83 |
Fast-Slow Recurrent Neural Networks |
NIPS |
code |
82 |
Unsupervised Video Summarization With Adversarial LSTM Networks |
CVPR |
code |
82 |
Constrained Policy Optimization |
ICML |
code |
81 |
A-NICE-MC: Adversarial Training for MCMC |
NIPS |
code |
80 |
Coarse-To-Fine Volumetric Prediction for Single-Image 3D Human Pose |
CVPR |
code |
80 |
End-To-End Instance Segmentation With Recurrent Attention |
CVPR |
code |
78 |
DeLiGAN : Generative Adversarial Networks for Diverse and Limited Data |
CVPR |
code |
78 |
Learning Shape Abstractions by Assembling Volumetric Primitives |
CVPR |
code |
77 |
Local Binary Convolutional Neural Networks |
CVPR |
code |
77 |
Raster-To-Vector: Revisiting Floorplan Transformation |
ICCV |
code |
76 |
Positive-Unlabeled Learning with Non-Negative Risk Estimator |
NIPS |
code |
76 |
Hard-Aware Deeply Cascaded Embedding |
ICCV |
code |
75 |
Deep Image Harmonization |
CVPR |
code |
73 |
Shape Completion Using 3D-Encoder-Predictor CNNs and Shape Synthesis |
CVPR |
code |
73 |
Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade |
CVPR |
code |
73 |
Improved Stereo Matching With Constant Highway Networks and Reflective Confidence Learning |
CVPR |
code |
72 |
Query-Guided Regression Network With Context Policy for Phrase Grounding |
ICCV |
code |
72 |
Top-Down Visual Saliency Guided by Captions |
CVPR |
code |
72 |
Feedback Networks |
CVPR |
code |
72 |
What Actions Are Needed for Understanding Human Actions in Videos? |
ICCV |
code |
71 |
Xception: Deep Learning With Depthwise Separable Convolutions |
CVPR |
code |
71 |
Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning |
CVPR |
code |
71 |
Video Propagation Networks |
CVPR |
code |
70 |
Image-To-Image Translation With Conditional Adversarial Networks |
CVPR |
code |
70 |
Quality Aware Network for Set to Set Recognition |
CVPR |
code |
69 |
Self-Supervised Learning of Visual Features Through Embedding Images Into Text Topic Spaces |
CVPR |
code |
69 |
Deep Subspace Clustering Networks |
NIPS |
code |
68 |
Escape From Cells: Deep Kd-Networks for the Recognition of 3D Point Cloud Models |
ICCV |
code |
68 |
A Distributional Perspective on Reinforcement Learning |
ICML |
code |
68 |
Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks |
CVPR |
code |
67 |
Deep Transfer Learning with Joint Adaptation Networks |
ICML |
code |
67 |
Training Deep Networks without Learning Rates Through Coin Betting |
NIPS |
code |
66 |
Full Resolution Image Compression With Recurrent Neural Networks |
CVPR |
code |
66 |
SurfaceNet: An End-To-End 3D Neural Network for Multiview Stereopsis |
ICCV |
code |
66 |
Doubly Stochastic Variational Inference for Deep Gaussian Processes |
NIPS |
code |
66 |
TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals |
ICCV |
code |
66 |
Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-Identification |
ICCV |
code |
65 |
Synthesizing 3D Shapes via Modeling Multi-View Depth Maps and Silhouettes With Deep Generative Networks |
CVPR |
code |
65 |
Dance Dance Convolution |
ICML |
code |
65 |
Borrowing Treasures From the Wealthy: Deep Transfer Learning Through Selective Joint Fine-Tuning |
CVPR |
code |
64 |
Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes |
ICCV |
code |
64 |
Toward Controlled Generation of Text |
ICML |
code |
63 |
Person Re-Identification in the Wild |
CVPR |
code |
63 |
ALICE: Towards Understanding Adversarial Learning for Joint Distribution Matching |
NIPS |
code |
63 |
Differentiable Learning of Logical Rules for Knowledge Base Reasoning |
NIPS |
code |
62 |
Person Search With Natural Language Description |
CVPR |
code |
61 |
Multi-Channel Weighted Nuclear Norm Minimization for Real Color Image Denoising |
ICCV |
code |
61 |
Playing for Benchmarks |
ICCV |
code |
61 |
Unsupervised Learning by Predicting Noise |
ICML |
code |
60 |
Localizing Moments in Video With Natural Language |
ICCV |
code |
60 |
End-To-End 3D Face Reconstruction With Deep Neural Networks |
CVPR |
code |
60 |
CoupleNet: Coupling Global Structure With Local Parts for Object Detection |
ICCV |
code |
59 |
AdaGAN: Boosting Generative Models |
NIPS |
code |
59 |
Convolutional Gaussian Processes |
NIPS |
code |
57 |
A Deep Regression Architecture With Two-Stage Re-Initialization for High Performance Facial Landmark Detection |
CVPR |
code |
57 |
Modeling Relationships in Referential Expressions With Compositional Modular Networks |
CVPR |
code |
57 |
Curiosity-driven Exploration by Self-supervised Prediction |
ICML |
code |
56 |
Wavelet-SRNet: A Wavelet-Based CNN for Multi-Scale Face Super Resolution |
ICCV |
code |
56 |
The Neural Hawkes Process: A Neurally Self-Modulating Multivariate Point Process |
NIPS |
code |
56 |
Online and Linear-Time Attention by Enforcing Monotonic Alignments |
ICML |
code |
56 |
Neural Expectation Maximization |
NIPS |
code |
56 |
Dense-Captioning Events in Videos |
ICCV |
code |
55 |
Factorized Bilinear Models for Image Recognition |
ICCV |
code |
55 |
Net-Trim: Convex Pruning of Deep Neural Networks with Performance Guarantee |
NIPS |
code |
54 |
On-the-fly Operation Batching in Dynamic Computation Graphs |
NIPS |
code |
54 |
Visual Translation Embedding Network for Visual Relation Detection |
CVPR |
code |
54 |
Learning Blind Motion Deblurring |
ICCV |
code |
54 |
A Disentangled Recognition and Nonlinear Dynamics Model for Unsupervised Learning |
NIPS |
code |
53 |
Towards Diverse and Natural Image Descriptions via a Conditional GAN |
ICCV |
code |
53 |
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos |
CVPR |
code |
53 |
A Generic Deep Architecture for Single Image Reflection Removal and Image Smoothing |
ICCV |
code |
52 |
Deep IV: A Flexible Approach for Counterfactual Prediction |
ICML |
code |
52 |
Triangle Generative Adversarial Networks |
NIPS |
code |
51 |
EAST: An Efficient and Accurate Scene Text Detector |
CVPR |
code |
51 |
SST: Single-Stream Temporal Action Proposals |
CVPR |
code |
51 |
Predicting Deeper Into the Future of Semantic Segmentation |
ICCV |
code |
51 |
L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space |
CVPR |
code |
51 |
TALL: Temporal Activity Localization via Language Query |
ICCV |
code |
50 |
Hybrid Reward Architecture for Reinforcement Learning |
NIPS |
code |
50 |
Fast Fourier Color Constancy |
CVPR |
code |
49 |
Modulating early visual processing by language |
NIPS |
code |
49 |
Adversarial Examples for Semantic Segmentation and Object Detection |
ICCV |
code |
49 |
Learning Discrete Representations via Information Maximizing Self-Augmented Training |
ICML |
code |
49 |
Efficient Diffusion on Region Manifolds: Recovering Small Objects With Compact CNN Representations |
CVPR |
code |
48 |
Real Time Image Saliency for Black Box Classifiers |
NIPS |
code |
48 |
FC4: Fully Convolutional Color Constancy With Confidence-Weighted Pooling |
CVPR |
code |
47 |
Multiple People Tracking by Lifted Multicut and Person Re-Identification |
CVPR |
code |
47 |
Learned D-AMP: Principled Neural Network based Compressive Image Recovery |
NIPS |
code |
47 |
GP CaKe: Effective brain connectivity with causal kernels |
NIPS |
code |
46 |
Predicting Organic Reaction Outcomes with Weisfeiler-Lehman Network |
NIPS |
code |
46 |
Semantic Video CNNs Through Representation Warping |
ICCV |
code |
46 |
Grammar Variational Autoencoder |
ICML |
code |
46 |
EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis |
ICCV |
code |
46 |
Safe Model-based Reinforcement Learning with Stability Guarantees |
NIPS |
code |
45 |
Deep Spectral Clustering Learning |
ICML |
code |
45 |
Semantic Compositional Networks for Visual Captioning |
CVPR |
code |
45 |
On-Demand Learning for Deep Image Restoration |
ICCV |
code |
45 |
Video Pixel Networks |
ICML |
code |
45 |
Stabilizing Training of Generative Adversarial Networks through Regularization |
NIPS |
code |
45 |
Structured Bayesian Pruning via Log-Normal Multiplicative Noise |
NIPS |
code |
44 |
Deriving Neural Architectures from Sequence and Graph Kernels |
ICML |
code |
44 |
Masked Autoregressive Flow for Density Estimation |
NIPS |
code |
44 |
Unsupervised Adaptation for Deep Stereo |
ICCV |
code |
44 |
Learning Residual Images for Face Attribute Manipulation |
CVPR |
code |
43 |
Learning to Generate Long-term Future via Hierarchical Prediction |
ICML |
code |
43 |
Accurate Optical Flow via Direct Cost Volume Processing |
CVPR |
code |
42 |
Generalized Orderless Pooling Performs Implicit Salient Matching |
ICCV |
code |
42 |
Comparative Evaluation of Hand-Crafted and Learned Local Features |
CVPR |
code |
42 |
SchNet: A continuous-filter convolutional neural network for modeling quantum interactions |
NIPS |
code |
41 |
Temporal Generative Adversarial Nets With Singular Value Clipping |
ICCV |
code |
41 |
Multiplicative Normalizing Flows for Variational Bayesian Neural Networks |
ICML |
code |
41 |
Neural Scene De-Rendering |
CVPR |
code |
40 |
Semantic Image Inpainting With Deep Generative Models |
CVPR |
code |
40 |
A Linear-Time Kernel Goodness-of-Fit Test |
NIPS |
code |
40 |
Least Squares Generative Adversarial Networks |
ICCV |
code |
39 |
Diversified Texture Synthesis With Feed-Forward Networks |
CVPR |
code |
39 |
No Fuss Distance Metric Learning Using Proxies |
ICCV |
code |
38 |
Template Matching With Deformable Diversity Similarity |
CVPR |
code |
38 |
What's in a Question: Using Visual Questions as a Form of Supervision |
CVPR |
code |
38 |
Face Normals "In-The-Wild" Using Fully Convolutional Networks |
CVPR |
code |
38 |
Conditional Image Synthesis with Auxiliary Classifier GANs |
ICML |
code |
37 |
Neural Episodic Control |
ICML |
code |
37 |
3D-PRNN: Generating Shape Primitives With Recurrent Neural Networks |
ICCV |
code |
37 |
Structured Embedding Models for Grouped Data |
NIPS |
code |
36 |
Learning Active Learning from Data |
NIPS |
code |
36 |
Unified Deep Supervised Domain Adaptation and Generalization |
ICCV |
code |
35 |
Transformation-Grounded Image Generation Network for Novel 3D View Synthesis |
CVPR |
code |
35 |
Structured Attentions for Visual Question Answering |
ICCV |
code |
34 |
Geometric Loss Functions for Camera Pose Regression With Deep Learning |
CVPR |
code |
34 |
VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization |
CVPR |
code |
34 |
QMDP-Net: Deep Learning for Planning under Partial Observability |
NIPS |
code |
34 |
Using Ranking-CNN for Age Estimation |
CVPR |
code |
33 |
Hierarchical Boundary-Aware Neural Encoder for Video Captioning |
CVPR |
code |
33 |
Unsupervised Learning of Disentangled Representations from Video |
NIPS |
code |
32 |
Deep Learning on Lie Groups for Skeleton-Based Action Recognition |
CVPR |
code |
32 |
Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection |
CVPR |
code |
32 |
3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder |
CVPR |
code |
32 |
StyleNet: Generating Attractive Visual Captions With Styles |
CVPR |
code |
32 |
Dynamic Word Embeddings |
ICML |
code |
32 |
Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon |
NIPS |
code |
31 |
Continual Learning Through Synaptic Intelligence |
ICML |
code |
31 |
Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes |
CVPR |
code |
31 |
Learning Detection With Diverse Proposals |
CVPR |
code |
31 |
LCNN: Lookup-Based Convolutional Neural Network |
CVPR |
code |
31 |
Towards Accurate Multi-Person Pose Estimation in the Wild |
CVPR |
code |
30 |
Real-Time Neural Style Transfer for Videos |
CVPR |
code |
30 |
Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training |
ICCV |
code |
30 |
Deep Co-Occurrence Feature Learning for Visual Object Recognition |
CVPR |
code |
29 |
Joint distribution optimal transportation for domain adaptation |
NIPS |
code |
29 |
Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields |
CVPR |
code |
29 |
SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Parallelization |
ICML |
code |
29 |
The Statistical Recurrent Unit |
ICML |
code |
29 |
A Unified Approach of Multi-Scale Deep and Hand-Crafted Features for Defocus Estimation |
CVPR |
code |
28 |
Learning Spread-Out Local Feature Descriptors |
ICCV |
code |
28 |
Event-Based Visual Inertial Odometry |
CVPR |
code |
27 |
DropoutNet: Addressing Cold Start in Recommender Systems |
NIPS |
code |
27 |
Phrase Localization and Visual Relationship Detection With Comprehensive Image-Language Cues |
ICCV |
code |
27 |
Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations |
CVPR |
code |
27 |
Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360deg Sports Videos |
CVPR |
code |
27 |
Neural Message Passing for Quantum Chemistry |
ICML |
code |
27 |
State-Frequency Memory Recurrent Neural Networks |
ICML |
code |
27 |
DeepCD: Learning Deep Complementary Descriptors for Patch Representations |
ICCV |
code |
26 |
Contrastive Learning for Image Captioning |
NIPS |
code |
26 |
Stochastic Optimization with Variance Reduction for Infinite Datasets with Finite Sum Structure |
NIPS |
code |
26 |
Learning High Dynamic Range From Outdoor Panoramas |
ICCV |
code |
26 |
Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors |
CVPR |
code |
26 |
Learning to Detect Salient Objects With Image-Level Supervision |
CVPR |
code |
26 |
Improved Variational Autoencoders for Text Modeling using Dilated Convolutions |
ICML |
code |
26 |
Interspecies Knowledge Transfer for Facial Keypoint Detection |
CVPR |
code |
25 |
YASS: Yet Another Spike Sorter |
NIPS |
code |
25 |
Open Set Domain Adaptation |
ICCV |
code |
25 |
Domain-Adaptive Deep Network Compression |
ICCV |
code |
24 |
Long Short-Term Memory Kalman Filters: Recurrent Neural Estimators for Pose Regularization |
ICCV |
code |
24 |
Temporal Context Network for Activity Localization in Videos |
ICCV |
code |
24 |
Incremental Learning of Object Detectors Without Catastrophic Forgetting |
ICCV |
code |
24 |
Dense Captioning With Joint Inference and Visual Context |
CVPR |
code |
24 |
Universal Adversarial Perturbations |
CVPR |
code |
24 |
Asymmetric Tri-training for Unsupervised Domain Adaptation |
ICML |
code |
24 |
Reducing Reparameterization Gradient Variance |
NIPS |
code |
24 |
Exploiting Saliency for Object Segmentation From Image Level Labels |
CVPR |
code |
24 |
A Dirichlet Mixture Model of Hawkes Processes for Event Sequence Clustering |
NIPS |
code |
24 |
Shading Annotations in the Wild |
CVPR |
code |
24 |
Straight to Shapes: Real-Time Detection of Encoded Shapes |
CVPR |
code |
23 |
Dual Discriminator Generative Adversarial Nets |
NIPS |
code |
23 |
Zero-Order Reverse Filtering |
ICCV |
code |
23 |
Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net |
NIPS |
code |
23 |
Learning Spherical Convolution for Fast Features from 360° Imagery |
NIPS |
code |
22 |
Learning to Detect Sepsis with a Multitask Gaussian Process RNN Classifier |
ICML |
code |
22 |
Deep Cross-Modal Hashing |
CVPR |
code |
22 |
When Unsupervised Domain Adaptation Meets Tensor Representations |
ICCV |
code |
22 |
Image Super-Resolution Using Dense Skip Connections |
ICCV |
code |
22 |
Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer |
CVPR |
code |
22 |
STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling |
CVPR |
code |
22 |
Learning Continuous Semantic Representations of Symbolic Expressions |
ICML |
code |
22 |
Deep Growing Learning |
ICCV |
code |
21 |
Combined Group and Exclusive Sparsity for Deep Neural Networks |
ICML |
code |
21 |
Hash Embeddings for Efficient Word Representations |
NIPS |
code |
21 |
Accuracy First: Selecting a Differential Privacy Level for Accuracy Constrained ERM |
NIPS |
code |
21 |
Disentangled Representation Learning GAN for Pose-Invariant Face Recognition |
CVPR |
code |
21 |
Learning to Pivot with Adversarial Networks |
NIPS |
code |
21 |
Learning Dynamic Siamese Network for Visual Object Tracking |
ICCV |
code |
21 |
POSEidon: Face-From-Depth for Driver Pose Estimation |
CVPR |
code |
20 |
Deep Metric Learning via Facility Location |
CVPR |
code |
20 |
Automatic Spatially-Aware Fashion Concept Discovery |
ICCV |
code |
20 |
The Numerics of GANs |
NIPS |
code |
20 |
From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur |
CVPR |
code |
20 |
Unpaired Image-To-Image Translation Using Cycle-Consistent Adversarial Networks |
ICCV |
code |
20 |
Zero-Inflated Exponential Family Embeddings |
ICML |
code |
20 |
InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations |
NIPS |
code |
20 |
Weakly-Supervised Learning of Visual Relations |
ICCV |
code |
20 |
Multi-Label Image Recognition by Recurrently Discovering Attentional Regions |
ICCV |
code |
20 |
Scene Parsing With Global Context Embedding |
ICCV |
code |
20 |
Context Selection for Embedding Models |
NIPS |
code |
20 |
Deep Mean-Shift Priors for Image Restoration |
NIPS |
code |
20 |
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition |
CVPR |
code |
20 |
Fully-Adaptive Feature Sharing in Multi-Task Networks With Applications in Person Attribute Classification |
CVPR |
code |
19 |
Learning Compact Geometric Features |
ICCV |
code |
19 |
Structured Generative Adversarial Networks |
NIPS |
code |
19 |
Joint Gap Detection and Inpainting of Line Drawings |
CVPR |
code |
19 |
Chained Multi-Stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection |
ICCV |
code |
19 |
Adversarial Feature Matching for Text Generation |
ICML |
code |
18 |
BIER - Boosting Independent Embeddings Robustly |
ICCV |
code |
18 |
Predictive-Corrective Networks for Action Detection |
CVPR |
code |
18 |
Stochastic Generative Hashing |
ICML |
code |
18 |
A Bayesian Data Augmentation Approach for Learning Deep Models |
NIPS |
code |
18 |
Attentive Semantic Video Generation Using Captions |
ICCV |
code |
18 |
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network |
CVPR |
code |
18 |
Deep Unsupervised Similarity Learning Using Partially Ordered Sets |
CVPR |
code |
17 |
DualNet: Learn Complementary Features for Image Recognition |
ICCV |
code |
17 |
Neural system identification for large populations separating “what” and “where” |
NIPS |
code |
17 |
FALKON: An Optimal Large Scale Kernel Method |
NIPS |
code |
17 |
Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks |
CVPR |
code |
17 |
Deep Learning with Topological Signatures |
NIPS |
code |
17 |
Streaming Sparse Gaussian Process Approximations |
NIPS |
code |
17 |
RPAN: An End-To-End Recurrent Pose-Attention Network for Action Recognition in Videos |
ICCV |
code |
17 |
Awesome Typography: Statistics-Based Text Effects Transfer |
CVPR |
code |
17 |
RoomNet: End-To-End Room Layout Estimation |
ICCV |
code |
17 |
Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval |
ICCV |
code |
16 |
Deep Supervised Discrete Hashing |
NIPS |
code |
16 |
Few-Shot Learning Through an Information Retrieval Lens |
NIPS |
code |
16 |
Estimating Accuracy from Unlabeled Data: A Probabilistic Logic Approach |
NIPS |
code |
16 |
Learning to Push the Limits of Efficient FFT-Based Image Deconvolution |
ICCV |
code |
16 |
Federated Multi-Task Learning |
NIPS |
code |
16 |
Label Distribution Learning Forests |
NIPS |
code |
16 |
Deep Multitask Architecture for Integrated 2D and 3D Human Sensing |
CVPR |
code |
16 |
Estimating Mutual Information for Discrete-Continuous Mixtures |
NIPS |
code |
16 |
Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes |
CVPR |
code |
16 |
StyleBank: An Explicit Representation for Neural Image Style Transfer |
CVPR |
code |
16 |
Surface Normals in the Wild |
ICCV |
code |
15 |
Automatic Discovery of the Statistical Types of Variables in a Dataset |
ICML |
code |
15 |
Learning Diverse Image Colorization |
CVPR |
code |
15 |
Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems |
ICCV |
code |
15 |
Non-Local Deep Features for Salient Object Detection |
CVPR |
code |
15 |
Structure-Measure: A New Way to Evaluate Foreground Maps |
ICCV |
code |
15 |
Shallow Updates for Deep Reinforcement Learning |
NIPS |
code |
15 |
Wasserstein Generative Adversarial Networks |
ICML |
code |
15 |
Recurrent 3D Pose Sequence Machines |
CVPR |
code |
15 |
Variational Dropout Sparsifies Deep Neural Networks |
ICML |
code |
15 |
Captioning Images With Diverse Objects |
CVPR |
code |
15 |
Off-policy evaluation for slate recommendation |
NIPS |
code |
15 |
Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning |
ICCV |
code |
14 |
Benchmarking Denoising Algorithms With Real Photographs |
CVPR |
code |
14 |
Neural Aggregation Network for Video Face Recognition |
CVPR |
code |
14 |
Learned Contextual Feature Reweighting for Image Geo-Localization |
CVPR |
code |
14 |
Streaming Weak Submodularity: Interpreting Neural Networks on the Fly |
NIPS |
code |
14 |
CVAE-GAN: Fine-Grained Image Generation Through Asymmetric Training |
ICCV |
code |
14 |
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation |
ICCV |
code |
14 |
Spherical convolutions and their application in molecular modelling |
NIPS |
code |
14 |
Multi-Information Source Optimization |
NIPS |
code |
14 |
Convolutional Neural Network Architecture for Geometric Matching |
CVPR |
code |
14 |
Neural Face Editing With Intrinsic Image Disentangling |
CVPR |
code |
14 |
Realistic Dynamic Facial Textures From a Single Image Using GANs |
ICCV |
code |
14 |
Predictive State Recurrent Neural Networks |
NIPS |
code |
13 |
Deep TextSpotter: An End-To-End Trainable Scene Text Localization and Recognition Framework |
ICCV |
code |
13 |
ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events |
NIPS |
code |
13 |
Hunt For The Unique, Stable, Sparse And Fast Feature Learning On Graphs |
NIPS |
code |
13 |
Consensus Convolutional Sparse Coding |
ICCV |
code |
13 |
Weakly Supervised Affordance Detection |
CVPR |
code |
13 |
Joint Learning of Object and Action Detectors |
ICCV |
code |
13 |
Light Field Blind Motion Deblurring |
CVPR |
code |
13 |
Asynchronous Stochastic Gradient Descent with Delay Compensation |
ICML |
code |
13 |
Unrolled Memory Inner-Products: An Abstract GPU Operator for Efficient Vision-Related Computations |
ICCV |
code |
12 |
Maximizing Subset Accuracy with Recurrent Neural Networks in Multi-label Classification |
NIPS |
code |
12 |
Self-Organized Text Detection With Minimal Post-Processing via Border Learning |
ICCV |
code |
12 |
Coordinated Multi-Agent Imitation Learning |
ICML |
code |
12 |
Gradient descent GAN optimization is locally stable |
NIPS |
code |
12 |
Removing Rain From Single Images via a Deep Detail Network |
CVPR |
code |
12 |
Convexified Convolutional Neural Networks |
ICML |
code |
12 |
Multigrid Neural Architectures |
CVPR |
code |
12 |
VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization |
ICCV |
code |
12 |
Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin |
NIPS |
code |
12 |
Differential Angular Imaging for Material Recognition |
CVPR |
code |
12 |
A Multilayer-Based Framework for Online Background Subtraction With Freely Moving Cameras |
ICCV |
code |
11 |
Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation |
NIPS |
code |
11 |
Max-value Entropy Search for Efficient Bayesian Optimization |
ICML |
code |
11 |
Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization |
ICCV |
code |
11 |
Generalized Deep Image to Image Regression |
CVPR |
code |
11 |
Adversarial Image Perturbation for Privacy Protection -- A Game Theory Perspective |
ICCV |
code |
11 |
Predicting Human Activities Using Stochastic Grammar |
ICCV |
code |
11 |
DESIRE: Distant Future Prediction in Dynamic Scenes With Interacting Agents |
CVPR |
code |
11 |
Fisher GAN |
NIPS |
code |
11 |
High-Order Attention Models for Visual Question Answering |
NIPS |
code |
11 |
IM2CAD |
CVPR |
code |
11 |
On Fairness and Calibration |
NIPS |
code |
11 |
DeepPermNet: Visual Permutation Learning |
CVPR |
code |
10 |
f-GANs in an Information Geometric Nutshell |
NIPS |
code |
10 |
Revisiting IM2GPS in the Deep Learning Era |
ICCV |
code |
10 |
Attentional Correlation Filter Network for Adaptive Visual Tracking |
CVPR |
code |
10 |
Learning Cross-Modal Deep Representations for Robust Pedestrian Detection |
CVPR |
code |
10 |
Confident Multiple Choice Learning |
ICML |
code |
10 |
Curriculum Dropout |
ICCV |
code |
9 |
Cognitive Mapping and Planning for Visual Navigation |
CVPR |
code |
9 |
Optimized Pre-Processing for Discrimination Prevention |
NIPS |
code |
9 |
Learning Motion Patterns in Videos |
CVPR |
code |
9 |
Scalable Log Determinants for Gaussian Process Kernel Learning |
NIPS |
code |
9 |
A Hierarchical Approach for Generating Descriptive Image Paragraphs |
CVPR |
code |
9 |
Deep Crisp Boundaries |
CVPR |
code |
9 |
Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Optimization |
NIPS |
code |
9 |
Practical Data-Dependent Metric Compression with Provable Guarantees |
NIPS |
code |
9 |
Do Deep Neural Networks Suffer from Crowding? |
NIPS |
code |
9 |
A Non-Convex Variational Approach to Photometric Stereo Under Inaccurate Lighting |
CVPR |
code |
9 |
End-To-End Learning of Geometry and Context for Deep Stereo Regression |
ICCV |
code |
9 |
From Bayesian Sparsity to Gated Recurrent Nets |
NIPS |
code |
8 |
Regret Minimization in MDPs with Options without Prior Knowledge |
NIPS |
code |
8 |
Following Gaze in Video |
ICCV |
code |
8 |
Model-Powered Conditional Independence Test |
NIPS |
code |
8 |
Cost efficient gradient boosting |
NIPS |
code |
8 |
Reflectance Adaptive Filtering Improves Intrinsic Image Estimation |
CVPR |
code |
8 |
DeepNav: Learning to Navigate Large Cities |
CVPR |
code |
8 |
Look, Listen and Learn |
ICCV |
code |
8 |
Attention-Aware Face Hallucination via Deep Reinforcement Learning |
CVPR |
code |
8 |
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models |
NIPS |
code |
8 |
Introspective Neural Networks for Generative Modeling |
ICCV |
code |
8 |
Affinity Clustering: Hierarchical Clustering at Scale |
NIPS |
code |
8 |
Gaze Embeddings for Zero-Shot Image Classification |
CVPR |
code |
8 |
Input Switched Affine Networks: An RNN Architecture Designed for Interpretability |
ICML |
code |
8 |
Online multiclass boosting |
NIPS |
code |
8 |
Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images |
ICCV |
code |
8 |
SubUNets: End-To-End Hand Shape and Continuous Sign Language Recognition |
ICCV |
code |
7 |
Learning Koopman Invariant Subspaces for Dynamic Mode Decomposition |
NIPS |
code |
7 |
Unsupervised Monocular Depth Estimation With Left-Right Consistency |
CVPR |
code |
7 |
Personalized Image Aesthetics |
ICCV |
code |
7 |
Reasoning About Fine-Grained Attribute Phrases Using Reference Games |
ICCV |
code |
7 |
Lost Relatives of the Gumbel Trick |
ICML |
code |
7 |
Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction |
ICCV |
code |
7 |
Centered Weight Normalization in Accelerating Training of Deep Neural Networks |
ICCV |
code |
6 |
Scalable Planning with Tensorflow for Hybrid Nonlinear Domains |
NIPS |
code |
6 |
Convex Global 3D Registration With Lagrangian Duality |
CVPR |
code |
6 |
Building a Regular Decision Boundary With Deep Networks |
CVPR |
code |
6 |
Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification |
CVPR |
code |
6 |
Forecasting Human Dynamics From Static Images |
CVPR |
code |
6 |
AOD-Net: All-In-One Dehazing Network |
ICCV |
code |
6 |
K-Medoids For K-Means Seeding |
NIPS |
code |
6 |
Diverse Image Annotation |
CVPR |
code |
6 |
Practical Hash Functions for Similarity Estimation and Dimensionality Reduction |
NIPS |
code |
6 |
Deep Adaptive Image Clustering |
ICCV |
code |
6 |
Robust Adversarial Reinforcement Learning |
ICML |
code |
6 |
Improving Training of Deep Neural Networks via Singular Value Bounding |
CVPR |
code |
6 |
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems |
NIPS |
code |
6 |
Tensor Belief Propagation |
ICML |
code |
6 |
Sparse convolutional coding for neuronal assembly detection |
NIPS |
code |
6 |
Unsupervised Pixel-Level Domain Adaptation With Generative Adversarial Networks |
CVPR |
code |
6 |
Bayesian inference on random simple graphs with power law degree distributions |
ICML |
code |
6 |
Tensor Biclustering |
NIPS |
code |
6 |
Riemannian approach to batch normalization |
NIPS |
code |
6 |
Unsupervised Learning of Object Landmarks by Factorized Spatial Embeddings |
ICCV |
code |
6 |
Rolling-Shutter-Aware Differential SfM and Image Rectification |
ICCV |
code |
5 |
Active Decision Boundary Annotation With Deep Generative Models |
ICCV |
code |
5 |
Object Co-Skeletonization With Co-Segmentation |
CVPR |
code |
5 |
Discover and Learn New Objects From Documentaries |
CVPR |
code |
5 |
Understanding Black-box Predictions via Influence Functions |
ICML |
code |
5 |
Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach |
CVPR |
code |
5 |
Decoupling "when to update" from "how to update" |
NIPS |
code |
5 |
MarioQA: Answering Questions by Watching Gameplay Videos |
ICCV |
code |
5 |
Differentially private Bayesian learning on distributed data |
NIPS |
code |
5 |
Grad-CAM: Visual Explanations From Deep Networks via Gradient-Based Localization |
ICCV |
code |
5 |
Question Asking as Program Generation |
NIPS |
code |
5 |
Conic Scan-and-Cover algorithms for nonparametric topic modeling |
NIPS |
code |
5 |
Lip Reading Sentences in the Wild |
CVPR |
code |
5 |
ROAM: A Rich Object Appearance Model With Application to Rotoscoping |
CVPR |
code |
5 |
NeuralFDR: Learning Discovery Thresholds from Hypothesis Features |
NIPS |
code |
5 |
Viraliency: Pooling Local Virality |
CVPR |
code |
5 |
Learning Algorithms for Active Learning |
ICML |
code |
5 |
Point to Set Similarity Based Deep Feature Learning for Person Re-Identification |
CVPR |
code |
5 |
Click Here: Human-Localized Keypoints as Guidance for Viewpoint Estimation |
ICCV |
code |
5 |
The World of Fast Moving Objects |
CVPR |
code |
5 |
Cross-Modality Binary Code Learning via Fusion Similarity Hashing |
CVPR |
code |
5 |
Testing and Learning on Distributions with Symmetric Noise Invariance |
NIPS |
code |
5 |
Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference |
NIPS |
code |
5 |
Diving into the shallows: a computational perspective on large-scale shallow learning |
NIPS |
code |
5 |
Rotation Equivariant Vector Field Networks |
ICCV |
code |
5 |
Recursive Sampling for the Nystrom Method |
NIPS |
code |
5 |
Learning From Video and Text via Large-Scale Discriminative Clustering |
ICCV |
code |
5 |
Global optimization of Lipschitz functions |
ICML |
code |
5 |
Device Placement Optimization with Reinforcement Learning |
ICML |
code |
4 |
Alternating Direction Graph Matching |
CVPR |
code |
4 |
MEC: Memory-efficient Convolution for Deep Neural Network |
ICML |
code |
4 |
Expert Gate: Lifelong Learning With a Network of Experts |
CVPR |
code |
4 |
A Simple yet Effective Baseline for 3D Human Pose Estimation |
ICCV |
code |
4 |
On Structured Prediction Theory with Calibrated Convex Surrogate Losses |
NIPS |
code |
4 |
Sub-sampled Cubic Regularization for Non-convex Optimization |
ICML |
code |
4 |
Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval |
CVPR |
code |
4 |
Bottleneck Conditional Density Estimation |
ICML |
code |
4 |
Learning Cooperative Visual Dialog Agents With Deep Reinforcement Learning |
ICCV |
code |
4 |
Multi-way Interacting Regression via Factorization Machines |
NIPS |
code |
4 |
Joint Discovery of Object States and Manipulation Actions |
ICCV |
code |
4 |
Predicting Salient Face in Multiple-Face Videos |
CVPR |
code |
4 |
From Red Wine to Red Tomato: Composition With Context |
CVPR |
code |
4 |
Encoder Based Lifelong Learning |
ICCV |
code |
4 |
Deep Recurrent Neural Network-Based Identification of Precursor microRNAs |
NIPS |
code |
4 |
Guarantees for Greedy Maximization of Non-submodular Functions with Applications |
ICML |
code |
4 |
Pose-Aware Person Recognition |
CVPR |
code |
4 |
Zero-Shot Recognition Using Dual Visual-Semantic Mapping Paths |
CVPR |
code |
4 |
Asynchronous Distributed Variational Gaussian Processes for Regression |
ICML |
code |
3 |
Saliency Pattern Detection by Ranking Structured Trees |
ICCV |
code |
3 |
Toward Goal-Driven Neural Network Models for the Rodent Whisker-Trigeminal System |
NIPS |
code |
3 |
Learning Non-Maximum Suppression |
CVPR |
code |
3 |
Deep Latent Dirichlet Allocation with Topic-Layer-Adaptive Stochastic Gradient Riemannian MCMC |
ICML |
code |
3 |
Discriminative Bimodal Networks for Visual Localization and Detection With Natural Language Queries |
CVPR |
code |
3 |
AdaNet: Adaptive Structural Learning of Artificial Neural Networks |
ICML |
code |
3 |
Large Margin Object Tracking With Circulant Feature Maps |
CVPR |
code |
3 |
Compatible Reward Inverse Reinforcement Learning |
NIPS |
code |
3 |
Adversarial Surrogate Losses for Ordinal Regression |
NIPS |
code |
3 |
Non-monotone Continuous DR-submodular Maximization: Structure and Algorithms |
NIPS |
code |
3 |
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning |
NIPS |
code |
3 |
A framework for Multi-A(rmed)/B(andit) Testing with Online FDR Control |
NIPS |
code |
3 |
Counting Everyday Objects in Everyday Scenes |
CVPR |
code |
3 |
Loss Max-Pooling for Semantic Image Segmentation |
CVPR |
code |
3 |
Aesthetic Critiques Generation for Photos |
ICCV |
code |
3 |
Expectation Propagation with Stochastic Kinetic Model in Complex Interaction Systems |
NIPS |
code |
3 |
Near-Optimal Edge Evaluation in Explicit Generalized Binomial Graphs |
NIPS |
code |
3 |