GoGiants1

@wafflestudio

Seoul National UniversitySeoul

GoGiants1's Stars

modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
Language:Python4.5k391
TideDra/VL-RLHF
A RLHF Infrastructure for Vision-Language Models
Language:Python1216
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python1k45
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python31.2k4.7k
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Language:Python2.1k165
allenai/open-instruct
Language:Python1.9k216
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python73451
ytongbai/LVM
Language:Python1.8k55
HKUST-LongGroup/CoMM
Official repository for CoMM Dataset
Language:Python27
xichenpan/Kosmos-G
Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Language:Python523
AILab-CVC/SEED-X
Multimodal Models in Real World
Language:Jupyter Notebook40719
AILab-CVC/SEED
Official implementation of SEED-LLaMA (ICLR 2024).
Language:Python58432
apple/ml-aim
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
Language:Python1k50
lucidrains/LVMAE-pytorch
Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch
Language:Python341
siboehm/SGEMM_CUDA
Fast CUDA matrix multiplication from scratch
Language:Cuda51763
yzhaiustc/Optimizing-SGEMM-on-NVIDIA-Turing-GPUs
Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.
Language:Cuda28845
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python6.3k555
bytedance/1d-tokenizer
This repo contains the code for 1D tokenizer and generator
Language:Jupyter Notebook56824
KellerJordan/modded-nanogpt
NanoGPT (124M) in 5 minutes
Language:Python1.6k148
AIDC-AI/Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
Language:Python55433
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.8k894
PKU-YuanGroup/LLaVA-CoT
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Language:Python1.5k55
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell10.4k642
ys-zong/VL-ICL
Code for paper: VL-ICL Bench: The Devil in the Details of Benchmarking Multimodal In-Context Learning
Language:Python302
UW-Madison-Lee-Lab/CoBSAT
Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"
Language:Jupyter Notebook301
leloykun/mmsg
Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.
Language:Python253
RLHF-V/RLHF-V
[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
Language:Python2386
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python7k1k
ostris/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
Language:Python3.5k375
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python1.9k75

GoGiants1

GoGiants1's Stars

modelscope/ms-swift

TideDra/VL-RLHF

showlab/Show-o

vllm-project/vllm

EvolvingLMMs-Lab/lmms-eval

allenai/open-instruct

princeton-nlp/SimPO

ytongbai/LVM

HKUST-LongGroup/CoMM

xichenpan/Kosmos-G

AILab-CVC/SEED-X

AILab-CVC/SEED

apple/ml-aim

lucidrains/LVMAE-pytorch

siboehm/SGEMM_CUDA

yzhaiustc/Optimizing-SGEMM-on-NVIDIA-Turing-GPUs

sgl-project/sglang

bytedance/1d-tokenizer

KellerJordan/modded-nanogpt

AIDC-AI/Ovis

OpenBMB/MiniCPM-V

PKU-YuanGroup/LLaVA-CoT

QwenLM/Qwen2.5

ys-zong/VL-ICL

UW-Madison-Lee-Lab/CoBSAT

leloykun/mmsg

RLHF-V/RLHF-V

EleutherAI/gpt-neox

ostris/ai-toolkit

baaivision/Emu3