Pinned Repositories
Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
llama.cpp
Lenovo private fork of Llama.cpp
Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
JianbangZ's Repositories
JianbangZ/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
JianbangZ/llama.cpp
Lenovo private fork of Llama.cpp
JianbangZ/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more