marslanm
Ph.D. Student at MBZUAI.
Mohamed Bin Zayed University of Artificial IntelligenceMasdar, Abu Dhabi
marslanm's Stars
fahadshamshad/awesome-transformers-in-medical-imaging
A collection of resources on applications of Transformers in Medical Imaging.
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
muzairkhattak/multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
mmaaz60/EdgeNeXt
[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications".
mmaaz60/mvits_for_class_agnostic_od
[ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".
muzairkhattak/PromptSRC
[ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without Forgetting".
fahadshamshad/Clip2Protect
[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search".
TalalWasim/Video-FocalNets
Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]
muzairkhattak/ProText
[CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".
marslanm/Multimodality-Representation-Learning
This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl.acm.org/doi/abs/10.1145/3617833 .
asif-hanif/vafa
[MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation" accepted in MICCAI 2023 conference.
akhtarvision/cal-detr
Muhammad-Ibraheem-Siddiqui/PerSense
Implementation of the paper "PerSense: Personalized Instance Segmentation in Dense Images"
BioMedIA-MBZUAI/XReal
msaadsaeed/FOP
Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"
koushiksrivats/robust-concept-erasing
Official implementation of the paper "STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models"
umar1997/propaganda-codeswitched-text
[EMNLP 2023] Official repository of paper titled "Detecting Propaganda Techniques in Code-Switched Social Media Text"
muzairkhattak/ImageRecognition-NVIDIA-Jetson
PyTorch scripts of ResNet50: performance metrics are evaluated on Jetson Nano and on my GTX 1060 powered laptop (Asus GL702VM)
muzairkhattak/facial-mask-detector-MTCNN
Pytorch based custom NN and tensorflow based MTCNN face detection algorithm Facial Mask Detector
umar1997/MBZUAI
Masters Courses @ MBZUAI
mengzaiqiao/TVBR
mengzaiqiao/adapter-transformers
Huggingface Transformers + Adapters = ❤️
mengzaiqiao/doc_cls
mengzaiqiao/draw_io
mengzaiqiao/epiai_cls_nondocker
mengzaiqiao/glances
Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.
mengzaiqiao/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
mengzaiqiao/SkyAR
Dynamic sky replacement and harmonization in videos
mengzaiqiao/techdocs
Accord Project Documentation
umar1997/Propaganda_Detection_on_Code_Switched_Data