graphcore-research/out-of-the-box-fp8-training
Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.
Jupyter NotebookMIT
Stargazers
- justinchuby
- zauberresonatorChicago
- vvvm23United Kingdom
- balancapLondon
- s-maddrellmanderBristol, UK
- progerSupercomputer City
- thecharlieblakeLondon
- websitegardener
- bilelomrani1Paris, France
- accesine
- kyegomezPalo Alto
- FarisHijaziRiyadh, Saudi Arabia
- carmoccaSpain
- bobbercheng
- dumpmemory
- isamu-isozakiPhiladelphia, PA, USA
- ethansmith2000florida
- spencerfreiBerkeley, CA
- jmargetaTrencianska Tepla, Slovakia
- fly51flyBeiJing
- Sandalots
- Banguiskode
- Thalituto
- simonJJJ
- Siris-LimxBeijing, China
- vahbuna
- xrsrkeEarth
- Pent
- tantaraPalo Alto, CA
- sheng-qin
- C-TCZurich, Switzerland
- abcp4