Pinned Repositories
flash-attention
Fast and memory-efficient exact attention
ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
google-images-download
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
llama3v
A SOTA vision model built on top of llama3 8B.
MiniCPM
MiniCPM-2B: An end-side LLM outperforms Llama2-13B.
OmniLMM
Large Multi-modal Models for Strong Performance and Efficient Deployment
swift
Use PEFT or Full-parameter to fine-tuning LLMs or MLLMs
VisCPM
基于CPM基础模型的中英双语多模态大模型系列
YuzaChongyi's Repositories
YuzaChongyi/google-images-download
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
YuzaChongyi/llama3v
A SOTA vision model built on top of llama3 8B.
YuzaChongyi/MiniCPM
MiniCPM-2B: An end-side LLM outperforms Llama2-13B.
YuzaChongyi/OmniLMM
Large Multi-modal Models for Strong Performance and Efficient Deployment
YuzaChongyi/swift
Use PEFT or Full-parameter to fine-tuning LLMs or MLLMs
YuzaChongyi/VisCPM
基于CPM基础模型的中英双语多模态大模型系列