firechecking/CleanTransformer
an implementation of transformer, bert, gpt, and diffusion models for learning purposes
PythonMIT
Issues
- 1
Interested in the Generate part
#6 opened by carlosFir - 1
这里是不是写错了?AttentionLayer 里没有bias变量吧?
#5 opened by compass-star - 0
用never_split构建前缀树识别输入文本text中的不可分割字符
#4 opened by compass-star