Issues
- 3
Telechat对于flash-attn的版本要求?
#64 opened by yishengduwu - 0
7B模型在许多问题上答非所问
#72 opened by Sun-Xiaohui - 3
星辰支持AutoModelForSequenceClassification任务相关问题
#71 opened by tcoln - 34
- 0
请问用官网方式微调后总是重复输出内容
#70 opened by wangxuanhao - 0
请问有提供微调的数据集吗?
#69 opened by atpdxy - 2
似乎默认调cuda,不能用CPU模式跑?
#19 opened by nobodybut - 3
300I Pro部署问题
#63 opened by kasldsaknf - 1
国产GPU的适配目前有什么计划吗,华为晟腾系列,海光,寒武纪等国产GPU
#66 opened by b1gme - 0
7B就行lora微调过程报错
#65 opened by lichengyang666 - 3
是否支持Agent
#58 opened by zhanpengjie - 0
镜像又失效了,能麻烦再更新下吗
#60 opened by FoRever1010 - 1
vllm推理或者符合open-Ai格式的API服务可以支持吗?
#49 opened by Moon502 - 4
lora微调1B结果有问题
#54 opened by lichengyang666 - 1
- 1
微调大模型之后保存的global_step问题
#50 opened by Ricardo-Ping - 1
RuntimeError: FlashAttention only supports Ampere GPUs or newer.这种问题怎么解决,卡好几天了,我改代码也不行
#52 opened by SongAoxiang - 9
AssertionError: Please install FlashAttention first, e.g., with pip install flash-attn
#12 opened by zhanghui-china - 1
- 3
UnboundLocalError: cannot access local variable 'dim' where it is not associated with a value
#47 opened by Cathelloya - 0
镜像失效了,麻烦更新下
#53 opened by BubblyFace - 0
ImportError: /home/.cache/torch_extensions/py38_cu118/fused_adam/fused_adam.so: undefined symbol: _ZN3c106detail23torchInternalAssertFailEPKcS2_jS2_RKSs
#48 opened by Jerry-CLAY - 0
如何开启lora量化微调
#46 opened by coding-cadenza - 1
- 0
load error
#44 opened by pzwstudy - 0
- 8
使用LLama-Factory Lora微调后推理报错
#22 opened by JasonCZH4 - 0
- 1
请问能否提供预训练代码
#32 opened by uloveqian2021 - 0
用7B模型进行训练,需要多大的GPU显存呢
#40 opened by 1361095044 - 3
新增了运行环境Dockerfile
#37 opened by gptq - 0
requirements.txt缺少sentencepiece
#38 opened by gptq - 0
能否上传至ollama模型库
#36 opened by hlj - 0
- 1
请问12B模型何时发布
#24 opened by zhanpengjie - 1
- 2
12B模型ValueError: Tokenizer class TelechatTokenizer does not exist or is not currently imported.
#31 opened by wuxiulike - 2
telechat_infer_demo.py中base模型直接续写演示输出结果有问题
#28 opened by hzhaoy - 0
A10的GPU能够支撑训练
#29 opened by Tomcat-kin - 1
Support English documents for README
#25 opened by StefanXiepj - 1
- 2
"TeleChat模型社区许可协议.pdf" can't open or view
#13 opened by ruanjianhui - 0
- 0
英文语料会开源吗?
#23 opened by YixinSong-e - 3
最长训练长度
#14 opened by icemoon-creative - 3
镜像下载链接好像有误
#20 opened by 768761418 - 1
- 0
关于数据开源
#18 opened by ddddddreamcastle - 1
- 0
关于模型结构的问题
#16 opened by CSlearnerZM