JianbangZ

US

Pinned Repositories

Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
Language:Python823 10 6150
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python44.6k 246 6.2k5.5k
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python141k 1.1k 16.9k28.3k
llama.cpp
Lenovo private fork of Llama.cpp
Language:C0 0 00
Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Language:Jupyter Notebook1 0 01
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Language:Python0 0 00
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python38.1k 354 1.9k4.7k
MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Language:Python19k 137 7711.4k
CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python2.3k 29 182152

JianbangZ's Repositories

JianbangZ/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Language:Jupyter Notebook1 0 01
JianbangZ/llama.cpp
Lenovo private fork of Llama.cpp
Language:C0 0 00
JianbangZ/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Language:Python0 0 00