bpiyush
1st year DPhil, VGG, Oxford. Past: MSc in AI from UvA | Research @ Wadhwani AI | B.S. in Mathematics @ IIT Kanpur
University of OxfordOxford
Pinned Repositories
BayesOpt
Scalable Bayesian Optimization : Comparison of various methods
CLIP-grounding
Evaluating CLIP's cross-modal grounding using explainability methods.
dino-local
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
ml-engg-docs
Documentation for useful tools and best practices for real-world ML engineering
rotation-equivariant-lfm
Rotation equivariance meets local feature matching
test-time-training
Replication of Code for paper Test-Time Training with Self-Supervision for Generalization under Distribution Shifts.
TestOfTime
Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time
coreml
Generic framework for ML projects
Re-CGN
SEVERE-BENCHMARK
bpiyush's Repositories
bpiyush/TestOfTime
Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time
bpiyush/SoundOfWater
Code for the paper "The Sound of Water: Inferring Physical Properties from Pouring Liquids".
bpiyush/dino-local
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
bpiyush/FastSAM
Fast Segment Anything
bpiyush/FCN-f0
Fully-Convolutional Network for Pitch Estimation of Speech Signals
bpiyush/new-machine-setup-scripts
Bunch of scripts useful to add when starting on a new machine
bpiyush/NLP-CS671A
Course files for CS671A - Natural Language Processing
bpiyush/sam-pt
SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.
bpiyush/sound-guided-semantic-image-manipulation
Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)
bpiyush/Sound2Scene
Clone of the Sound2Scene repo. Need to train on pouring water images.
bpiyush/ST-LLM
[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"
bpiyush/TempCompass
[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou
bpiyush/transparent-liquid-segmentation
We build a novel self-supervised segmentation pipeline to segment transparent liquids (clear water) placed inside transparent containers.
bpiyush/VITATECS
bpiyush/audio_codec_tests
Tests for codec artefacts in stored audio samples.
bpiyush/bpiyush
My personal introductory repository
bpiyush/bpiyush.github.io
A portfolio page
bpiyush/ddsp-pytorch
Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)
bpiyush/digan
Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks (ICLR 2022).
bpiyush/InternVideo
Video Foundation Models & Data for Multimodal Understanding
bpiyush/LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
bpiyush/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
bpiyush/PhysParamInference
Clone of the WACV2023 paper. Adaptation on pouring water.
bpiyush/TimeChat
[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
bpiyush/unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
bpiyush/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
bpiyush/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
bpiyush/VideoMAE-ssl
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
bpiyush/ViLMA
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)
bpiyush/VTimeLLM
[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".