Muzammal-Naseer

Asst. Professor, KU

Abu Dhabi, UAE

Muzammal-Naseer's Stars

xai-org/grok-1
Grok open release
Language:Python49.4k 562 2098.3k
apple/ml-ferret
Language:Python8.3k 156 0485
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
Language:Python1.2k 14 11999
mbzuai-oryx/MobiLlama
MobiLlama : Small Language Model tailored for edge devices
Language:Python585 13 1442
awaisrauf/Awesome-CV-Foundational-Models
448 19 626
mbzuai-oryx/GeoChat
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
Language:Python408 10 5229
Haiyang-W/GiT
[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
Language:Python287 7 1112
muzairkhattak/PromptSRC
[ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without Forgetting".
Language:Python219 5 168
TalalWasim/Vita-CLIP
Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]
Language:Python105 6 1210
fahadshamshad/Clip2Protect
[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search".
Language:Python96 6 1211
jameelhassan/PromptAlign
[NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization
Language:Python93 3 1210
TalalWasim/Video-FocalNets
Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]
Language:Python84 6 716
muzairkhattak/ProText
[CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".
Language:Python82 3 73
techmn/satmae_pp
Official repository for "Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery" (CVPR 2024)
Language:Python82 8 25
hananshafi/llmblueprint
[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"
Language:Jupyter Notebook65 3 52
koushiksrivats/FLIP
Official implementation of the paper "FLIP: Cross-domain Face Anti-spoofing with Language Guidance". (ICCV 2023)
Language:Python62 4 123
uncbiag/SegNext
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts (CVPR 2024)
Language:Python62 4 65
rohit901/cooperative-foundational-models
Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"
Language:Python49 6 74
asif-hanif/vafa
[MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation" accepted in MICCAI 2023 conference.
Language:Python48 2 10
OmkarThawakar/composed-video-retrieval
Composed Video Retrieval
Language:Python42 2 50
mbzuai-oryx/CVRR-Evaluation-Suite
Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs".
Language:Python40 0 02
sheng-eatamath/PromptCAL
Official Implementation of paper: PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery (CVPR'23)
Language:Python39 4 97
kahnchana/clippy
Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)
Language:Jupyter Notebook33 3 15
Muhammad-Huzaifaa/ObjectCompose
[ACCV 2024] ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes 🚀🚀🚀
Language:Jupyter Notebook31 2 00
ShahinaKK/LG_SDG
Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]
Language:Jupyter Notebook28 1 11
sheng-eatamath/S3A
repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)
Language:Jupyter Notebook25 2 02
ShahinaKK/LWI-VMS
Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]
Language:Python22 2 00
Hasindri/HLSS
[MICCAI 2024 🔥] HLSS, the first study to explore hierarchical information inherent in histopathology images and their language descriptions for strong multi-modal representation learning
Language:Python20 4 00
Muzammal-Naseer/DCViT-AT
Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)
Language:Python19 1 12
hananshafi/MedContext
[MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"
Language:Python10 1 01

Muzammal-Naseer

Muzammal-Naseer's Stars

xai-org/grok-1

apple/ml-ferret

mbzuai-oryx/Video-ChatGPT

mbzuai-oryx/MobiLlama

awaisrauf/Awesome-CV-Foundational-Models

mbzuai-oryx/GeoChat

Haiyang-W/GiT

muzairkhattak/PromptSRC

TalalWasim/Vita-CLIP

fahadshamshad/Clip2Protect

jameelhassan/PromptAlign

TalalWasim/Video-FocalNets

muzairkhattak/ProText

techmn/satmae_pp

hananshafi/llmblueprint

koushiksrivats/FLIP

uncbiag/SegNext

rohit901/cooperative-foundational-models

asif-hanif/vafa

OmkarThawakar/composed-video-retrieval

mbzuai-oryx/CVRR-Evaluation-Suite

sheng-eatamath/PromptCAL

kahnchana/clippy

Muhammad-Huzaifaa/ObjectCompose

ShahinaKK/LG_SDG

sheng-eatamath/S3A

ShahinaKK/LWI-VMS

Hasindri/HLSS

Muzammal-Naseer/DCViT-AT

hananshafi/MedContext