Issues
- 0
训练log中的loss scale指的是什么?
#221 opened by xliu99 - 0
- 1
error about the GLM-130B’s model checkpoint
#219 opened by sunpian1 - 0
下载到一半就再也下不了了
#218 opened by HaHaLiang666 - 0
请各位大佬伸以援手,我想要在自己本地部署一个该模型,怎么在windows上进行部署?
#217 opened by kangkangkangkkkk - 0
有用tensortRT-llm的docker环境跑通模型的吗?求助...
#216 opened by dahaobenhao - 1
- 0
Clarification Request on GLM-130B Model Architecture and Licensing for Commercial Use
#215 opened by JayLiangs - 0
8卡 fastertransformer 推理报错RuntimeError: [FT][ERROR] Assertion fail: /home/young.ruan/FasterTransformer/src/fastertransformer/th_op/glm/GlmOp.h:539
#213 opened by rGitcy - 0
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0answers, answers_with_style, blanks = fill_blanks(raw_text, model, tokenizer, strategy)
#212 opened by rGitcy - 1
glm2-130B will it be made?
#209 opened by yhyu13 - 1
请问,课程链接在哪里?
#210 opened by Stonesusu - 1
Embedding Layer Gradient Shrink在哪里实现的?
#191 opened by jiezhangGt - 6
模型效果很差,是什么原因呢?
#186 opened by rchanggogogo - 7
- 0
FasterTransformer能否支持Glm6B呢
#208 opened by sym19991125 - 5
申请邮件收到的模型下载链接都失效了
#207 opened by bixyz - 1
模型申请页面无法提交申请
#205 opened by VSRacer - 4
- 0
基于130B有chat版本开源的计划吗?
#206 opened by ricosr - 1
如何使用FasterTransformer适配自己的模型
#182 opened by ming-shy - 0
请问GLM可以在输出内容时,同时输出引用内容的来源吗?
#204 opened by mike-2020 - 1
- 1
6 cards inference
#194 opened by wangheqi987 - 0
模型并行集群怎么搭建
#203 opened by ChenBinfighting1 - 0
- 0
每个token耗时呈脉冲式变化
#201 opened by wangheqi987 - 0
关于FT inference benchmark数据的疑问
#200 opened by frankxyy - 0
训练目标
#199 opened by shuangshuangguo - 0
关于docs/quantization.md中图片疑问
#198 opened by M3Dade - 1
4*4090gpu for int4 model inference error
#174 opened by sukibean163 - 1
[Question]GLM-130B模型有vocab文件吗?
#195 opened by starkhu - 0
GLM-130B 模型结构超参问题
#196 opened by peiyingxin - 0
FasterTransformer支持bf16推理吗
#193 opened by benyang0506 - 0
GLM-130B如何使用lora微调
#190 opened by ShaunHeNJU - 0
请问,GLM-130B有部署到DCU上的教程吗?
#189 opened by guoxiaoyue111111 - 0
nvlink通信
#188 opened by wangheqi987 - 2
是不是chatglm与这个GLM-130b开源模型中间还有很多问题待解决?
#178 opened by applepieiris - 1
aria2的http_proxy和https_proxy报错
#187 opened by Timaos123 - 1
现在好像没有ChatGLM-130B开源吧?只有6B, 130B的不是Chat
#183 opened by guotong1988 - 3
int4模型加载报错
#185 opened by wudajun7509 - 1
- 0
[HELP] 有人能分享一下量化好的int4 版本的模型吗?
#179 opened by rchanggogogo - 15
想问一下作者,量化成int4 int8 之后为什么模型大小没有变化,都是240g
#172 opened by GXKIM - 2
国内模型下载地址
#176 opened by wangheqi987 - 0
关于论文中bf16的一个疑问
#180 opened by Saggressive - 0
question: what does token mean here ?
#175 opened by jiangying000 - 0
- 1
https://tianqi.aminer.cn/ 天启官网合作咨询验证码打不开,请问如何联系商用
#171 opened by sjtuzhaoxh - 3
为什么没有中文说明?
#170 opened by fsy1215