CoEich's Stars
microsoft/mup
maximal update parametrization (µP)
Aleph-Alpha/magma
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
Aleph-Alpha/scaling
Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for training large language models.
PeterOvermann/TriadicMemory
Cognitive Computing with Associative Memory