muzammilbehzad
Ph.D. | AI Scientist | Researcher | Computer Vision | Affective Computing | Machine Learning | Deep Learning
Silo AI | University of OuluOulu, Finland
Pinned Repositories
3DDFA
The PyTorch improved version of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.
Agile-Project-Management
This is the course content about Agile Project Management.
DataloaderMultipleDataset
This repo can be used to efficiently create a PyTorch dataloader for multiple datasets and train network(s) simultaneously on data batches.
FaceLandmarks
This repo is mainly for real-time face and landmark detection. However, It can be equally used for other images and videos in general.
FaceMaskDetection
To further help combat the corona virus (COVID-19), this repository can be used for real-time face mask detection using webcam with an impressive accuracy. For convenience, the source codes are shared both in MATLAB and Python.
MultiviewTransformer
This repo can be used to work with deep learning models that use multi-views from 3D/4D point clouds.
PythonForDataScienceAndAI
This is the final peer-reviewed project for the Python Basics for Data Science Project course offered by IBM on Coursera.
Setup-PyTorch-AppleM1
Instructions on how to install PyTorch on Apple M1-series
muzammilbehzad's Repositories
muzammilbehzad/Students-Projects-ICS474-Big-Data-Analytics-Fall-2024
Repository for student project submissions in ICS474: Big Data Analytics at KFUPM, Fall 2024.
muzammilbehzad/Agile-Project-Management
This is the course content about Agile Project Management.
muzammilbehzad/AdaptCLIPZS
muzammilbehzad/agent-service-toolkit
Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit
muzammilbehzad/AgentLaboratory
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
muzammilbehzad/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
muzammilbehzad/awesome-artificial-intelligence
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
muzammilbehzad/awesome-llm-apps
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
muzammilbehzad/awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
muzammilbehzad/awesome-vlm-architectures
Famous Vision Language Models and Their Architectures
muzammilbehzad/best-of-ml-python
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
muzammilbehzad/Capstone-Applying-Project-Management-in-the-Real-World
This is the course content about Capstone: Applying Project Management in the Real World.
muzammilbehzad/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
muzammilbehzad/code2prompt
A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.
muzammilbehzad/computer-vision-course
This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face discord: hf.co/join/discord
muzammilbehzad/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
muzammilbehzad/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
muzammilbehzad/InstructCV
[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"
muzammilbehzad/LightM-UNet
Pytorch implementation of "LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation"
muzammilbehzad/MONAI
AI Toolkit for Healthcare Imaging
muzammilbehzad/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
muzammilbehzad/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
muzammilbehzad/Project-Execution-Running-the-Project
This is the course content about Project Execution: Running the Project.
muzammilbehzad/Project-Planning-Putting-It-All-Together
This is the course content about Project Planning: Putting It All Together.
muzammilbehzad/RAG-Diffusion
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
muzammilbehzad/samurai
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
muzammilbehzad/Visual-Style-Prompting
Official Pytorch implementation of "Visual Style Prompting with Swapping Self-Attention"
muzammilbehzad/Vitron
NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
muzammilbehzad/VLM_survey
Collection of AWESOME vision-language models for vision tasks
muzammilbehzad/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information