lucidrains/PaLM-pytorch
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
PythonMIT
Issues
- 1
code bug: running python train.py
#18 opened by yanzia12138 - 2
Hardware for 100b training?
#13 opened by zxcvqwerasdf - 1
Relu Option for Speed Up
#17 opened by CerebralSeed - 1
How can I train the model on QA task?
#4 opened by mellahysf - 1
Compare loss on enwik8
#16 opened by Bachstelze - 1
Encoder-Decoder-Model
#15 opened by Bachstelze - 2
model weights
#14 opened by Bachstelze - 1
Unneeded `max` substraction in softmax?
#9 opened by hypnopump - 2
- 2
- 1
How to extract PaLM embeddings for code?
#1 opened by kb-open