Gryphe/BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
PythonApache-2.0
Issues
- 0
Different Parameter Sizes
#4 opened by codelauncher444 - 0
[Q] Chat Template
#2 opened by NightMachinery - 1
Float32 necessary?
#1 opened by 0xymoro