arthurdouillard/deepcourse

Transformer colab

Closed this issue · 0 comments

  • uniformize C and D
  • add patch embed after class token in answer to Vit
  • typo in solution openning to multi-head
  • forgot square root in self-attention