LabelAugmentedModels TODO loss with embeddings regularization save memory like state_dict add multi-gpu support add 16 bit precision support add ptml sampler