NExT-GPT
An end-to-end MM-LLM that perceive input and generate output in arbitrary combinations (any-to-any) of text, image, video, and audio and beyond.
Pinned Repositories
NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
NExT-GPT.github.io
NExT-GPT: Any-to-Any Multimodal Large Language Model
NExT-GPT's Repositories
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
NExT-GPT/NExT-GPT.github.io
NExT-GPT: Any-to-Any Multimodal Large Language Model