Pinned Repositories
3DOD_thesis
3D Object Detection for Autonomous Driving in PyTorch, trained on the KITTI dataset.
bag-of-words
Python Implementation of Bag of Words for Image Recognition using OpenCV and sklearn
LocalizingMoments
Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"
multimodal_vtt
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
niluthpol.github.io
ProfXkit
Professor X Toolkit (previously known as the Princeton Vision and Robotics Toolkit)
RGB2LIDAR
RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization
TALL
TALL: Temporal Activity Localization via Language Query
visual-semantic-embedding
Implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models"
weak_supervised_video_moment
Weakly Supervised Video Moment Retrieval from Text Queries
niluthpol's Repositories
niluthpol/multimodal_vtt
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
niluthpol/weak_supervised_video_moment
Weakly Supervised Video Moment Retrieval from Text Queries
niluthpol/RGB2LIDAR
RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization
niluthpol/visual-semantic-embedding
Implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models"
niluthpol/3DOD_thesis
3D Object Detection for Autonomous Driving in PyTorch, trained on the KITTI dataset.
niluthpol/bag-of-words
Python Implementation of Bag of Words for Image Recognition using OpenCV and sklearn
niluthpol/LocalizingMoments
Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"
niluthpol/niluthpol.github.io
niluthpol/ProfXkit
Professor X Toolkit (previously known as the Princeton Vision and Robotics Toolkit)
niluthpol/TALL
TALL: Temporal Activity Localization via Language Query
niluthpol/VLN-CE
Vision-and-Language Navigation in Continuous Environments using Habitat
niluthpol/vsepp
PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"