xverse-ai/XVERSE-65B
XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.
PythonApache-2.0
Issues
- 2
请问 chat 模型什么时候可以放出来呢?
#4 opened by Xu-Chen - 1
请问使用flash-attn 2进行训练的代码能否公开
#3 opened by linqinhong - 1
请问预训练数据集会开源部分吗?
#7 opened by echo-valor - 0
请问大概需要多少gpu内存才能运行
#8 opened by 1014670860 - 2
请问对于gaokao-bench的评估,你们的prompt是怎么构造的?
#5 opened by liu904-61 - 0
- 1
模型推理有什么可用的加速策略嘛
#1 opened by cdxzyc - 2
关于中文tokenizer编码问题
#2 opened by onlyfish79