Issues
- 43
- 0
lm_eval missing
#264 opened by falv706 - 0
- 15
AttributeError: module 'jax.random' has no attribute 'KeyArray' while fine tuning.
#221 opened by samyakai - 0
6b.eleuther.ai mystic model is down for GPT-J-6B.
#263 opened by Gertie01 - 0
Web demo must be fixed.
#261 opened by Gertie01 - 11
- 4
6b.eleuther.ai mystic model is down for GPT-J-6B
#226 opened by orionnelson - 0
- 0
About rope embedding
#260 opened by eyuansu62 - 0
Framework
#259 opened by T3fo0ls7766 - 0
Finetuning Hardware Recomendations
#258 opened by greyweb - 1
How to stop model generating
#228 opened by jingrongchen - 2
- 2
The PILE dataset is full of racist content and thus GPT-J produces racist thinking.
#240 opened by azeemh - 1
How to infer with GPT-J on TPU_driver0.2 or nightly?
#256 opened by mosmos6 - 7
tpu_driver0.1 is not initialized on colab (cannot infer with GPT-J on Colab) [Again]
#252 opened by mosmos6 - 2
Discrepancy between results reported in this repo and in the NeoX paper
#257 opened by william-cerebras - 6
Resolving dependency issues
#246 opened by rinapch - 2
Can we please get a quickstart guide?
#243 opened by tswallen - 2
Which version of Python does this work with?
#253 opened by chrisbward - 0
Quantization for training / finetuning
#254 opened by torphix - 1
- 5
Could not find a version that satisfies the requirement ray[default]==1.4.1
#245 opened by Maxim-Mazurok - 2
- 2
- 4
training stuck at validation step 1
#218 opened by Selimonder - 0
TPU not found on VM (jax version 0.2.16)
#242 opened by Eichhof - 0
Project dependencies may have API risk issues
#239 opened by PyDeps - 4
- 1
Dead link to weights?
#238 opened by samacqua - 2
TPU Instance Creation
#237 opened by zzj0402 - 3
- 0
- 2
- 9
- 1
- 2
- 1
Typo in 'to_hf_weights.py '
#231 opened by AmoArt - 0
[Feature Request] Multilingual assistance.
#229 opened by phly95 - 2
- 0
Finetuning GPT Neo 20B Using TPU V3-8s
#227 opened by nikhilanayak - 3
`TypeError: Cannot subclass <class 'typing._SpecialForm'>` in `slim_model.py `
#212 opened by danyaljj - 3
GPT-J inference on TPU
#219 opened by airesearch38 - 0
CausalTransformerV2 or CausalTransformer?
#220 opened by leejason - 0
Can "slim_model.py" work with "d_model" as 768?
#217 opened by leejason - 2
- 1
Error while `to_hf_weights.py`: `ValueError: cannot reshape array of size 25804800 into shape (1,4096,50400)`
#214 opened by danyaljj - 1
`OSError: libmkl_intel_lp64.so.1: cannot open shared object file` when using `to_hf_weights.py`
#215 opened by danyaljj - 1