Ucas-HaoranWei/Vary-toy
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
Python
Issues
- 0
Vary-tiny 是否支持中文?
#41 opened by Davidwhw - 0
请问下,相对于诸如Florence2这类的0.3/0.7B的模型,Vary-tony的性能如何呢?
#40 opened by gjd2017 - 0
How do I download flash-attn? I have followed the downloading steps completely.
#39 opened by ranck626 - 0
where the requirements
#38 opened by Moneorker - 1
一张卡train不起来
#34 opened by fanshuaiyao - 0
ValueError: Trying to set a tensor of shape torch.Size([257, 1024]) in "weight" (which has shape torch.Size([577, 1024])), this look incorrect.
#37 opened by willpat1213 - 1
难以控制生成语言种类
#36 opened by TekhneC - 1
训练loss降为0
#32 opened by afreestudy - 3
- 2
训练数据中的<lb>
#33 opened by fanshuaiyao - 9
请问训练大概需要什么性能的GPU
#9 opened by xaswq - 3
生成的内容出现问题
#15 opened by Nikol-coder - 6
- 9
修改加载 CLIP-VIT-L 模型路径的问题
#7 opened by hotwa - 20
- 0
训练json的格式
#31 opened by afreestudy - 0
请问修改哪里能在训练模型时,接入opt模型
#29 opened by LimbCC - 1
- 2
- 1
我在进行第一阶段的训练(视觉词汇表)后,测试的时候opt输出错误的坐标位置,无法检测对象
#26 opened by black1948 - 1
训练参数 --model_name_or_path
#17 opened by sixgod-666 - 1
有什么办法把llm部分切换到hf上的qwen2吗?
#25 opened by shifan3 - 0
RuntimeError: Input type (c10::Half) and bias type (float) should be the same
#24 opened by Gary-code - 3
路径如何修改
#14 opened by lht1605766283 - 3
请问new vision vocabulary weights是否指的是sam部分的权重?
#22 opened by whalefa1I - 1
Error: Downloading models from huggingface
#20 opened by chenweilong915 - 1
train errorKeyError: 'data_name1'
#21 opened by bsbrother - 2
麻烦问一下,qwen 1.8B用的是chat版本的还是非chat版本的?
#19 opened by duchenzhuang - 1
About deployment?
#18 opened by CVHub520 - 1
训练的问题
#13 opened by duchenzhuang - 0
exits with return code = -9 after I delete 'device_map="cuda"', OOM will occur if I keep 'device_map="cuda"'
#16 opened by zodiac50 - 12
- 1
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:5 and cuda:6!
#12 opened by readyFly - 1
ValueError: Trying to set a tensor of shape torch.Size([1024, 1024]) in "weight" (which has shape torch.Size([2048, 1024])), this look incorrect.
#11 opened by zodiac50 - 1
有没有开源文档渲染数据代码的计划?
#6 opened by yazheng0307 - 8
CUDA out of memory
#5 opened by sixgod-666 - 7
可以更一下 requirements.txt 么
#1 opened by tpoisonooo - 1
支持 4/8 bit 量化运行
#3 opened by yazheng0307 - 3