Title | Publication | Review | Source |
---|---|---|---|
Mamba: Linear-Time Sequence Modeling with Selective State Spaces | ICLR |
Blog | Paper |
Improving fine-grained understanding in image-text pre-training | CV |
Blog | Paper |
DoLA: DECODING BY CONTRASTING LAYERS IMPROVES FACTUALITY IN LARGE LANGUAGE MODELS | ICLR |
Blog | Paper |
Title | Publication | Review | Source |
---|---|---|---|
Do PLMs Know and Understand Ontological Knowledge? | ACL |
Blog | Paper |
FRUIT: Faithfully Reflecting Updated Information in Text | NAACL |
Blog | Paper |
CogAgent: A Visual Language Model for GUI Agents | - | Blog | Paper |
Generative Agents: Interactive Simulacra of Human Behavior | UIST |
Blog | Paper |
UniMath: A Foundational and Multimodal Mathematical Reasoner | EMNLP |
Blog | Paper |
Active Retrieval Augmented Generation | EMNLP |
Blog | Paper |
PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents | EMNLP |
Blog | Paper |
Extractive Summarization via ChatGPT for Faithful Summary Generation | EMNLP |
Blog | Paper |
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment | EMNLP |
Blog | Paper |
Large language models are human-level prompt engineers | ICLR |
Blog | Paper |
Improving the Domain Adaptation of Retrieval Augmented Generation(RAG) Models for Open Domain Question Answering | TACL |
Blog | Paper |
Mistral 7B | preprint |
Blog | Paper |
Title | Publication | Review | Source |
---|---|---|---|
Token Merging: Your ViT But Faster | ICLR |
Blog | Paper |
Visual Programming: Compositional visual reasoning without training | CVPR |
Blog | Paper |
AnyText: Multilingual Visual Text Generation And Editing | CVPR |
Blog | Paper |
What the DAAM: Interpreting Stable Diffusion Using Cross Attention | ACL |
Blog | Paper |
Erasing Concepts from Diffusion Models | ICCV |
Blog | Paper |
Zero-shot Referring Image Segmentation with Global-Local Context Features | CVPR |
Blog | Paper |
Title | Publication | Review | Source |
---|---|---|---|
Learning Fair Graph Representations via Automated Data Augmentations | ICLR |
Blog | Paper |
Going Beyond Local; Global-Graph-Enhanced Personalized News Recommendation | RecSys |
Blog | Paper |
Cracking the Code of Negative Transfer: A Cooperative Game Theoretic Approach for Cross-Domain Sequential Recommendation | CIKM |
Blog | Paper |
Goal-Oriented Multi-Modal Interactive Recommendation with Verbal and Non-Verbal Relevance Feedback | RecSys |
Blog | Paper |
Of Spiky SVDs and Music Recommendation | RecSys |
Blog | Paper |
Denoising Self-Attentive Sequential Recommendation | RecSys |
Blog | Paper |
Interpretable User Retention Modeling in Recommendation | RecSys |
Blog | Paper |
Diffusion Recommender Model | SIGIR |
Blog | Paper |
GPT4Rec: A Generative Framework for Personalized Recommendation and User Interests Interpretation | - | Blog | Paper |
Collaborative filtering algorithms are prone to mainstream-taste bias | RecSys |
Blog | Paper |
Title | Publication | Review | Source |
---|---|---|---|
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models | - | Blog | Paper |
Osprey: Pixel Understanding with Visual Instruction Tuning | - | Blog | Paper |
TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation | ICCV |
- | Paper |
Unifying Vision, Text, and Layout for Universal Document Processing | CVPR |
Blog | Paper |
Title | Publication | Review | Source |
---|---|---|---|
Robust Speech Recognition via Large-Scale Weak Supervision | ICML |
Blog | Paper |
Title | Publication | Review | Source |
---|---|---|---|
A Time Series is Worth 64 Words: Long-term Forecasting with Transformers | ICLR |
Blog | Paper |
Are Emergent Abilities of Large Language Models a Mirage? | NIPS |
Blog | Paper |
Two-way Multi-Label Loss | CVPR |
Blog | Paper |
Title | Field | Publication | Review | Source |
---|---|---|---|---|
Modeling Spatio-temporal Neighbourhood for Personalized Point-of-interest Recommendation | RecSys |
IJCAI |
Blog | Paper |
Do Prompt-Based Models Really Understand the Meaning of Their Prompts? | NLP |
NAACL |
Blog | Paper |
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning | NLP |
NIPS |
Blog | Paper |
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation | Multi-Modal |
CVPR |
Blog | Paper |
Rethinking Personalized Ranking at Pinterest: An End-to-End Approach | RecSys |
RecSys |
Blog | Paper |