Pinned Repositories
craft
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions
.emacs.d
Personal Emacs Configuration
bvpr
[MULA Workshop @ CVPR 2022] Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters
euphemism
Official Implementation of "Detecting Euphemisms with Literal Descriptions and Visual Imagery"
frozen
A PyTorch implementation of Multimodal Few-Shot Learning with Frozen Language Models with OPT.
GAN
Generative Adversarial Networks in Knet
phoneme-convnet
Convolutional Neural Networks for Speech Recognition implementation with Julia/Knet
show-and-tell
Show and Tell: A Neural Image Caption Generator replication with Julia/Knet.jl
tornado-websocket-client-example
Websocket client application example built on top of Tornado.
ViLMA
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)
ilkerkesen's Repositories
ilkerkesen/frozen
A PyTorch implementation of Multimodal Few-Shot Learning with Frozen Language Models with OPT.
ilkerkesen/ViLMA
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)
ilkerkesen/bvpr
[MULA Workshop @ CVPR 2022] Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters
ilkerkesen/euphemism
Official Implementation of "Detecting Euphemisms with Literal Descriptions and Visual Imagery"
ilkerkesen/.emacs.d
Personal Emacs Configuration
ilkerkesen/adapter-transformers
Huggingface Transformers + Adapters = ❤️
ilkerkesen/ilkerkesen
Repository for my bio
ilkerkesen/intphys
ilkerkesen/RAM
Recurrent Models of Visual Attention implementation in Julia/Knet
ilkerkesen/Sloth.jl
Lazy bums' Knet package
ilkerkesen/Ask-Anything
[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
ilkerkesen/Awesome-Referring-Image-Segmentation
:books: A collection of papers about Referring Image Segmentation.
ilkerkesen/caption_metrics
Evaluation Metrics for Image Captioning
ilkerkesen/ClipBERT
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
ilkerkesen/colorfromlanguage
Code base of the paper : Learning to Color from Language
ilkerkesen/colorization
Automatic colorization using deep neural networks. "Colorful Image Colorization." In ECCV, 2016.
ilkerkesen/DeepLabV3Plus-Pytorch
DeepLabv3, DeepLabv3+ and pretrained weights on VOC & Cityscapes
ilkerkesen/dotfiles
Personal Configuration Files
ilkerkesen/DRAW
Knet implementation of DRAW: A Recurrent Neural Network For Image Generation
ilkerkesen/frozen-in-time
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
ilkerkesen/ilkerkesen.github.io
Personal Website
ilkerkesen/MCQ
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
ilkerkesen/mPLUG-2
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
ilkerkesen/pixel
Research code for pixel-based encoders of language (PIXEL)
ilkerkesen/pytorch-deeplab-xception
DeepLab v3+ model in PyTorch. Support different backbones.
ilkerkesen/singularity
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
ilkerkesen/UVR-NMT
Neural Machine Translation with universal Visual Representation (ICLR 2020)
ilkerkesen/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
ilkerkesen/VideoCLIP
VideoCLIP and VLM implementations for custom benchmark (originally it's fairseq).
ilkerkesen/VindLU