Pinned Repositories
VisionGraph
The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context"
Anim-Director
The codes of Siggraph Asia 2024 paper "Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation"
Circuit-area-optimization-of-MPRM
implementation of GreedyFrog(GFLA) and Circuit area optimization of multi-output MPRM based on improved SFLA
Easy_Neural_Network
A configurable neural network
SNLC
Simple Nested Language Compiler in C++
NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Vitron
NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
PixWizard
HaoyuanShi's Repositories
HaoyuanShi/SNLC
Simple Nested Language Compiler in C++
HaoyuanShi/Circuit-area-optimization-of-MPRM
implementation of GreedyFrog(GFLA) and Circuit area optimization of multi-output MPRM based on improved SFLA
HaoyuanShi/Easy_Neural_Network
A configurable neural network