muzairkhattak

Computer Vision researcher

EPFLLausanne, Switzerland

muzairkhattak's Stars

unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
Language:Python18.6k 129 1k1.3k
anthropics/courses
Anthropic's educational courses
Language:Jupyter Notebook8.1k 70 17626
JingyunLiang/SwinIR
SwinIR: Image Restoration Using Swin Transformer (official repository)
Language:Python4.5k 52 152551
bowang-lab/MedSAM
Segment Anything in Medical Images
Language:Jupyter Notebook3k 22 295419
epfml/ML_course
EPFL Machine Learning Course, Fall 2024
Language:Jupyter Notebook1.3k 93 21910
StanfordVL/taskonomy
Taskonomy: Disentangling Task Transfer Learning [Best Paper, CVPR2018]
Language:Python851 33 48145
TencentARC/Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Language:Python706 20 3929
open-thought/system-2-research
System 2 Reasoning Link Collection
696 23 655
EPFL-VILAB/MultiMAE
MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022
Language:Python550 13 3359
NVlabs/EAGLE
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Language:Python542 32 2045
cientgu/VQ-Diffusion
Language:Python443 5 3043
mlfoundations/task_vectors
Editing Models with Task Arithmetic
Language:Python430 9 1836
EvolvingLMMs-Lab/LongVA
Long Context Transfer from Language to Vision
Language:Python336 8 2817
haritheja-e/robot-utility-models
Robot Utility Models are trained on a diverse set of environments and objects, and then can be deployed in novel environments with novel objects without any further data or training.
Language:Python173 6 56
MMStar-Benchmark/MMStar
[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"
Language:Python149 1 105
snap-research/weights2weights
Official Implementation of weights2weights
Language:Jupyter Notebook121 11 84
nv-dvl/segment-anything-lidar
[ECCV 2024] Better Call SAL: Towards Learning to Segment Anything in Lidar
110 6 31
zeyofu/BLINK_Benchmark
This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.org/abs/2404.12390 [ECCV 2024]
Language:Python107 4 127
liuzhuang13/bias
101 7 41
chs20/RobustVLM
[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
Language:Python99 6 73
vinid/safety-tuned-llamas
ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.
Language:Python71 2 89
UCSC-VLAA/vllm-safety-benchmark
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
Language:Python67 4 13
ys-zong/VLGuard
[ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.
Language:Python45 3 52
ExplainableML/fomo_in_flux
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
35 11 01
zycheiheihei/Transferable-Visual-Prompting
[CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompting for Multimodal Large Language Models" has been accepted in CVPR2024.
Language:Python32 1 00
umer-sheikh/bird-whisperer
[InterSpeech 2024] Official code repository of paper titled "Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Classification" accepted in InterSpeech 2024 conference.
Language:Python31 2 03
koushiksrivats/robust-concept-erasing
Official implementation of the paper "STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models"
15 2 00
renytek13/Soft-Prompt-Generation
[ECCV 2024] Soft Prompt Generation for Domain Generalization
Language:Python12 3 21
akhtarvision/weather-regional
Language:Python10 1 00
mbzuai-oryx/BiMediX2
Bio-Medical EXpert LMM with English and Arabic Language Capabilities
20

muzairkhattak

muzairkhattak's Stars

unslothai/unsloth

anthropics/courses

JingyunLiang/SwinIR

bowang-lab/MedSAM

epfml/ML_course

StanfordVL/taskonomy

TencentARC/Open-MAGVIT2

open-thought/system-2-research

EPFL-VILAB/MultiMAE

NVlabs/EAGLE

cientgu/VQ-Diffusion

mlfoundations/task_vectors

EvolvingLMMs-Lab/LongVA

haritheja-e/robot-utility-models

MMStar-Benchmark/MMStar

snap-research/weights2weights

nv-dvl/segment-anything-lidar

zeyofu/BLINK_Benchmark

liuzhuang13/bias

chs20/RobustVLM

vinid/safety-tuned-llamas

UCSC-VLAA/vllm-safety-benchmark

ys-zong/VLGuard

ExplainableML/fomo_in_flux

zycheiheihei/Transferable-Visual-Prompting

umer-sheikh/bird-whisperer

koushiksrivats/robust-concept-erasing

renytek13/Soft-Prompt-Generation

akhtarvision/weather-regional

mbzuai-oryx/BiMediX2