miznchimaki's Stars
openai/guided-diffusion
openai/improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
w86763777/pytorch-ddpm
Unofficial PyTorch implementation of Denoising Diffusion Probabilistic Models
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
neelsjain/NEFTune
Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
thunlp/LLaVA-UHD
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
meta-llama/PurpleLlama
Set of tools to assess and improve LLM security.
meta-llama/codellama
Inference code for CodeLlama models
meta-llama/llama3
The official Meta Llama 3 GitHub site
mlfoundations/open_clip
An open source implementation of CLIP.
optuna/optuna
A hyperparameter optimization framework
open-mmlab/Multimodal-GPT
Multimodal-GPT
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
LightDXY/FT-CLIP
CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
wkentaro/gdown
Google Drive Public File Downloader when Curl/Wget Fails
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
replicate/cog
Containers for machine learning
wandb/wandb
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
triton-lang/triton
Development repository for the Triton language and compiler
redotvideo/mamba-chat
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.