MohsenFayyaz89's Stars
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
guidance-ai/guidance
A guidance language for controlling large language models.
ggerganov/ggml
Tensor library for machine learning
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
amirhossein-kz/Awesome-Diffusion-Models-in-Medical-Imaging
Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)
xlang-ai/OSWorld
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
HuangOwen/Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
allenai/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
nothinglo/Deep-Photo-Enhancer
TensorFlow implementation of the CVPR 2018 spotlight paper, Deep Photo Enhancer: Unpaired Learning for Image Enhancement from Photographs with GANs
mahmoudnafifi/Exposure_Correction
Project page of the paper "Learning Multi-Scale Photo Exposure Correction" (CVPR 2021).
TRI-ML/prismatic-vlms
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
sjmoran/DeepLPF
Code for CVPR 2020 paper "Deep Local Parametric Filters for Image Enhancement"
jzhang38/LongMamba
Some preliminary explorations of Mamba's context scaling.
SqueezeAILab/LLM2LLM
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
google-deepmind/compressed_vision
UCDvision/NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"
QUVA-Lab/PIN
Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
FarnoushRJ/MambaLRP
Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models".
TaylorSwiftNet/TaylorSwiftNet
[BMVC 2022] TaylorSwiftNet: Taylor Driven Temporal Modeling for Swift Future Frame Prediction
msaberp/transformer2024
Improved Transformer implementation for machine translation, featuring bug fixes, updated dependencies, enhanced configuration, and code refactoring.
es0m/llama.cpp
Port of Facebook's LLaMA model in C/C++