Issues
- 1
How to train Bunny with both image-text pair data and pure text data together?
#139 opened by isbrycee - 1
Question about deepspeed checkpoint loading
#138 opened by Wintoplay - 2
What is the text encoder for the bunny model?
#131 opened by Why0912 - 2
- 2
evaluation about mmbench
#125 opened by caiyuxuan1120 - 2
about images of pretrain data
#124 opened by nicedoctor - 2
Multi-images in 1 prompt
#120 opened by motcapbovit - 5
About pretrain data
#119 opened by kid369 - 6
Convert Bunny-v1.0-3B to GGUF
#115 opened by q104769424 - 8
Vision_tower is not updated as expected
#130 opened by ChenFicha - 3
About continuous_training
#137 opened by Wintoplay - 1
what is questions_answers_YN?
#136 opened by Tramac - 1
- 0
Faster inference
#134 opened by sahil02235 - 1
Inconsistency in Bunny-695K Dataset with Technical Report and Sample Duplication
#132 opened by daybreaksly - 5
Batch inference
#93 opened by mtsysin - 0
Can this model do object detection job?
#129 opened by PredyDaddy - 0
- 2
obtain the log probabilities of output tokens
#126 opened by Yuxin916 - 5
Continuous Fine-tuning Bunny 1.1 4B
#123 opened by ChenFicha - 3
- 2
Pre_train 和 SFT
#109 opened by mynamelxy - 3
KeyError: 'bunny-phi'
#105 opened by jui0616 - 2
- 2
Model only responds with fine-tuned answers
#97 opened by tamdan17 - 1
Bunny v1.1 Llama 3 8b GGUF support/release?
#122 opened - 0
Inference acceleration, can the trained model use some inference framework? The code comes from the llava architecture. Can it be integrated into inference frameworks such as sglang or lmdeploy similar to llava?
#121 opened by zhangqingwu - 1
- 4
- 2
about the training strategy for Llama-3-8B
#96 opened by Jancsi9981 - 1
Network Error due to High Traffic Error Message
#114 opened by kyuewang17 - 3
- 7
- 2
- 2
- 2
Why do you modify the function `prepare_inputs_for_generation` of the LLM?
#106 opened by linhaojia13 - 1
Smaller qwen2 model?
#104 opened by R3xpook - 2
about finetune train
#90 opened by ZuyongWu - 1
微调数据集制作疑问
#103 opened by chenzhu005774 - 6
about training
#95 opened by Tengfei000 - 5
微调报错
#102 opened by chenzhu005774 - 2
微调模型后启动web显示矩阵维度对不上
#100 opened by htesd - 4
Zero3 error for pretrain
#98 opened by zhww - 0
Support for Qwen2
#94 opened by Gary2018X - 4
convert raw data to training format
#92 opened by acul3 - 2
- 5
question about s2
#89 opened by zezeze97 - 1
How to Evaluate a Fine-Tuned Model
#88 opened by HuBocheng - 1
please use torch.amp instead of apex directly.
#86 opened by dragen1860 - 1
tokenization mismatch
#87 opened by Wondersui