Pinned Repositories
blog
customer_bot
Simple chatbot using Rasa.ai
healthcare_ml
A curated list of ML|NLP resources for healthcare.
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
modnet_docker
Dockerized container for MODNet - a Real-Time Portrait Matting solution
movieRecommender
Simple recommender system using Spotlight library on MovieLens dataset
NLP_Study_Group
NLPNotes
random notes
raspberryPi
nahidalam's Repositories
nahidalam/maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
nahidalam/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
nahidalam/annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
nahidalam/Apollo
Apollo is a family of LMMs designed for video understanding
nahidalam/Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI Strawberry o1.
nahidalam/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
nahidalam/blt
Code for BLT research paper
nahidalam/collage-low-precision
nahidalam/edgevl
Offcial code for the ECCV2024 paper "Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities"
nahidalam/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
nahidalam/imp
a family of multimodal small language models
nahidalam/inspect_ai
Inspect: A framework for large language model evaluations
nahidalam/Leffa
Learning Flow Fields in Attention for Controllable Person Image Generation
nahidalam/LLaVA-NeXT
nahidalam/LLaVA-Video-Llama-3
nahidalam/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
nahidalam/LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
nahidalam/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
nahidalam/matmulfreellm
Implementation for MatMul-free LM.
nahidalam/MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
nahidalam/ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
nahidalam/ngpt
Normalized Transformer (nGPT)
nahidalam/ollama
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
nahidalam/PLLaVA
Official repository for the paper PLLaVA
nahidalam/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
nahidalam/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
nahidalam/smol-course
A course on aligning smol models.
nahidalam/smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
nahidalam/SQ-LLaVA
Visual self-questioning for large vision-language assistant.
nahidalam/VTCD