sshh12/multi_token

Embed arbitrary modalities (images, audio, documents, etc) into large language models.

PythonApache-2.0

Issues

Error when running AudioWhisper inference
#26 opened 2 months ago by setianke
1
GGUF Support?
#27 opened 2 months ago by yukiarimo
0
M1 support?
#23 opened 4 months ago by yukiarimo
2
OpenAI Client for Serving
#25 opened 3 months ago by SulRash
1
Fine tuning LLAVA for object detection
#24 opened 3 months ago by dipikakhullar
1
Cannot compile adapter_model.bin?
#22 opened 4 months ago by kuki2008
6
No module named 'imagebind'
#21 opened 4 months ago by kuki2008
0
Training with no pretrained encoder - just projection from ready embeddings
#20 opened 5 months ago by tehila17-meet
6
How train mixtral MoE ?
#18 opened 5 months ago by tommarques56
3
Multi GPU
#19 opened 5 months ago by tehila17-meet
1
is the training data available?
#17 opened 6 months ago by tanganke
2
Supported Base Models
#16 opened 6 months ago by DhruvSinghiitmandi
1
Summarize video
#15 opened 6 months ago by linchen111
1
pretrain errors
#14 opened 7 months ago by linchen111
4
Adapter weights not found
#12 opened 8 months ago by DeuceOfClubs
9
HFValidationError
#13 opened 7 months ago by linchen111
0
can you share an dataset to me use to train my vision model based on this?
#9 opened 8 months ago by guilh00009
3
Thanks for the great work
#6 opened 8 months ago by codybum
4
Thank you for posting this!
#10 opened 8 months ago by matbee-eth
3
Require ``ModalityArguments`` for new modalities
#11 opened 9 months ago by super-dainiu
3
wait, what is that in my training?
#8 opened 9 months ago by guilh00009
1
theres nothing in my output
#7 opened 9 months ago by guilh00009
7
Finetuning already trained model
#5 opened 10 months ago by Aniketto16
2
Multiple Image QA Model
#4 opened a year ago by tsdocode
3