miznchimaki

miznchimaki's Stars

openai/guided-diffusion
Language:Python6.2k819
openai/improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
Language:Python3.2k483
w86763777/pytorch-ddpm
Unofficial PyTorch implementation of Denoising Diffusion Probabilistic Models
Language:Python49862
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python8.2k1k
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python1.8k126
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.7k4.5k
neelsjain/NEFTune
Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning
Language:Python37819
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
Language:Python6.3k444
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python2.1k140
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
Language:Python1.9k123
thunlp/LLaVA-UHD
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Language:Python31415
meta-llama/PurpleLlama
Set of tools to assess and improve LLM security.
Language:Python2.6k440
meta-llama/codellama
Inference code for CodeLlama models
Language:Python16k1.9k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.7k3k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.1k968
optuna/optuna
A hyperparameter optimization framework
Language:Python10.7k1k
open-mmlab/Multimodal-GPT
Multimodal-GPT
Language:Python1.5k125
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python2.3k165
LightDXY/FT-CLIP
CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
Language:Python2067
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.3k865
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python5.8k453
wkentaro/gdown
Google Drive Public File Downloader when Curl/Wget Fails
Language:Python4.3k349
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript48.7k7k
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
Language:Python3.8k310
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.6k241
replicate/cog
Containers for machine learning
Language:Python8k558
wandb/wandb
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
Language:Python9k671
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.1k1.6k
redotvideo/mamba-chat
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
Language:Python90570
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python10.4k1k