Issues
- 1
What is the conv_mode for VILA-1.5-3B ?
#103 opened by amitbcp - 1
Expected Release Date for VILA^2 Model and Code
#124 opened by SZUHvern - 14
ValueError: The checkpoint you are trying to load has model type `llava_llama` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.
#135 opened by eternal8080 - 1
Context size and examples for LongVILA
#141 opened by yulinzou - 1
KeyError: 'llava_llama'
#138 opened by RajaAIStarter - 1
cannot download dataset
#139 opened by henrycjh - 1
Unable to run Gradio demo: VILA with TinyChat
#146 opened by mitraavi - 5
How to get the stage 2 checkpoint path for 3_sft.sh
#143 opened by Qnancy - 1
- 2
Fine-tuning LongVILA
#140 opened by lyluh - 1
Repetitive Output in LongViLa-LLama3-1024Frames
#149 opened by hb-jw - 20
- 0
- 1
Docker setup gets error
#147 opened by cholland-nv - 0
How to inference with AWQ in linux shell.
#144 opened by GitMonkey0 - 1
- 3
Dataset and Training code for Longvila
#132 opened by JcWang20 - 4
TypeError: LlamaRotaryEmbedding.forward() got an unexpected keyword argument 'seq_len' when running VILA model inference
#126 opened by LanceLeonhart - 0
Long context video module only
#142 opened by MH-Python - 1
VILA-1.5-HD coming soon?
#137 opened by collinmccarthy - 2
- 1
About the inference on video.
#134 opened by trinhvg - 3
create long-video QA samples
#121 opened by peiliu0408 - 8
- 1
how to run VILA1.5-40B-AWQ
#125 opened by chenxinhua - 0
- 0
Where is the server.py script?
#131 opened by zixinglin07 - 2
- 0
- 1
Fine tuning and --evaluation_strategy argument
#122 opened by lyluh - 1
Can VILA do grounding jobs?
#128 opened by PredyDaddy - 0
Plz fix run_vila.py line 65 output variable(s)
#127 opened by ziyaosg - 0
Data preparation for Stage 4 and Stage 5 in LONGVILA
#119 opened by GenjiB - 1
LongVILA - compatibility with other LLMs
#115 opened by orrzohar - 3
COYO-700M Dataset Download Script Error
#107 opened by XuGW-Kevin - 2
- 3
How to convert model to gguf
#93 opened by dand-milestone - 1
No training scripts in scripts/v1_5/paper/
#112 opened by yhyang123 - 3
- 1
Image text retrieval support
#106 opened by lhchau - 1
Support VILA with lmdeploy
#105 opened by cmpute - 1
- 4
Is there any way to increase the context window?
#100 opened by ZackBradshaw - 5
Deployment to SageMaker and/or HuggingFace Inference Endpoints Fails With Error
#94 opened by averypfeiffer - 2
- 6
- 1
- 1
- 2
- 0
Llama2 or Llama3
#102 opened by amitbcp