yandex/YaLM-100B

Pretrained language model with 100B parameters

PythonApache-2.0

Issues

How to use it with LangChain?
#30 opened a year ago by tonymacx86PRO
2
gguf / mlx format?
#34 opened 5 months ago by alexander-potemkin
0
Why usage ssh-agent and openssh-client package in docker
#33 opened a year ago by spanarek
0
Timeout on 8 x RTX A6000
#31 opened a year ago by andrewerf
2
Request to Open "Russian Pile" Dataset for Public Access
#29 opened a year ago by mgrankin
0
Provide pruned version for weaker hardware
#27 opened a year ago by CommanderTvis
2
Citation bibtex?
#28 opened a year ago by Guitaricet
2
CUDA out of memory
#22 opened 2 years ago by Aspector1
6
Has anyone deployed it on 10x 3090 ? Or any similar configuration?
#26 opened 2 years ago by AlexanderKozhevin
1
Is there any plans for making cloud service?
#25 opened 2 years ago by AlexanderKozhevin
1
PCI x1 or PCI x16 for GPU
#24 opened 2 years ago by hostingmydata
0
NCCL error
#23 opened 2 years ago by ZiqianXie
1
Would it be possible to run the model on single A100 (40GB) or 2xV100 (32GB) ?
#20 opened 2 years ago by alkavan
2
ZeRO 3 NVMe Offload?
#7 opened 2 years ago by Vbansal21
9
No mention of `bfloat16` in source, and yet weights are `bfloat16`
#21 opened 2 years ago by lostmsu
0
Could you share the md5 value for those checkpoints?
#18 opened 2 years ago by OleNet
2
Can it be launched on usual VPS? For example, 6 CPU 16 RAM (usual chips)
#19 opened 2 years ago by CombainerA19
2
Online examples
#3 opened 2 years ago by Slauta
9
AWS
#15 opened 2 years ago by githubuser100007
1
Run on networked nodes
#14 opened 2 years ago by ExtraE113
0
Dataset information
#5 opened 2 years ago by finetunej
2
[NL] token
#8 opened 2 years ago by TatianaShavrina
2
How did you used LAMB optimizer with ZeRO CPU offload?
#13 opened 2 years ago by ghosthamlet
2
Possible to run on 8 x 24GB 3090?
#9 opened 2 years ago by hobodrifterdavid
3
Привет
#11 opened 2 years ago by DedP0tap
2
Model dataset irregularity
#10 opened 2 years ago by joshlk
0
YaLm
#4 opened 2 years ago by Sigma2398
1
Evaluation benchmarks (lm-eval-harness)
#2 opened 2 years ago by justheuristic
0