Efficient-Large-Model/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
PythonApache-2.0
Watchers
- autogyro
- cjfcsjt
- dnth@zenml-io
- eemailme
- ghchris2021
- globalpaiFrank Ouyang
- katopzAVAREUM
- Kingcuda
- LiWentomngZhejiang University
- Lyken17Cambridge, MA
- meenchen
- RaymondWang0MIT EECS
- rebotnixrebotnix technologies
- ruiyingUniversity of Portsmouth
- shahinsharifi
- simasjVilnius, Lithuania
- sodabeta7Apple
- songhanMIT, NVIDIA
- xiaozhiob