Pattern-Exploiting Training in Julia and Knet. A replication of "It’s Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"
Primary LanguageJulia
No one’s star this repository yet.