Gryphe/BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
PythonApache-2.0
Stargazers
- bdashore3New York College of Osteopathic Medicine
- bmilde
- ctlllll@Princeton
- curtisgray
- cyber-physSanative AI
- DocShotgun
- dongxiaolong
- e-p-armstrong
- flotosPrisme.ai
- fly51flyPRIS
- gdacciaroItaly
- haoyuzhao123Princeton University
- JeffCarpenterCanada
- jsalixSouthern Oregon
- khoapip
- kotykd
- lrq3000GIGA-Consciousness - Coma Science Group - University & Hospital of Liège
- madook1
- mindragesToronto
- moerehman
- msaroufim@PyTorch
- nmandic78Zagreb, Croatia
- Pent
- prateeky2806
- RossBencinaMelbourne
- SandalotsVolcanak
- seongminpActionPower
- shamy1997Harbin
- soma2000-lang@unifyai
- the-crypt-keeper
- tolecy
- u-brixton
- utensil
- VicisVic
- voladeltaThe Matrix
- zzlgreat