28.07.2024 |
Scaling to Very Very Large Corpora for Natural Language Disambiguation |
ACL |
Jafar Isbarov |
03.08.2024 |
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models |
arXiv |
Kavsar Huseynova |
10.08.2024 |
Direct Preference Optimization: Your Language Model is Secretly a Reward Model |
NeurIPS |
Mirakram Aghalarov |
17.08.2024 |
Is Cosine-Similarity of Embeddings Really About Similarity? |
arXiv |
Eyvaz Najafli |
24.08.2024 |
Vision Transformers Need Registers |
ICLR |
Sanur Jujuyeva |
31.08.2024 |
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization |
ICCV |
Lala Ibadullayeva |
7.09.2024 |
Understanding deep learning requires rethinking generalization |
ICLR |
Jafar Isbarov |