/nlp-arxiv-daily

Automatically Update NLP Papers Daily using Github Actions (ref: https://github.com/Vincentqyw/cv-arxiv-daily)

Primary LanguagePython

Updated on 2024.05.24

Table of Contents
  1. NLP
  2. Legal NLP
  3. Sequence Annotation
  4. Named Entity Recognition
  5. Text Classification
  6. Sentiment Analysis
  7. Question Answering
  8. Information Extraction
  9. Recommendation System
  10. Knowledge Graph
  11. GNN

NLP

Publish Date Title Authors PDF Code
2024-05-21 Code-mixed Sentiment and Hate-speech Prediction Anjali Yadav et.al. 2405.12929v1 null
2024-05-21 SmartFlow: Robotic Process Automation using LLMs Arushi Jain et.al. 2405.12842v1 null
2024-05-21 Large Language Models Meet NLP: A Survey Libo Qin et.al. 2405.12819v1 null
2024-05-21 Transformer in Touch: A Survey Jing Gao et.al. 2405.12779v1 null
2024-05-21 SYMPLEX: Controllable Symbolic Music Generation using Simplex Diffusion with Vocabulary Priors Nicolas Jonason et.al. 2405.12666v1 null
2024-05-21 Exploration of Masked and Causal Language Modelling for Text Generation Nicolo Micheletti et.al. 2405.12630v1 null
2024-05-21 Mamba in Speech: Towards an Alternative to Self-Attention Xiangyu Zhang et.al. 2405.12609v1 null
2024-05-21 Is Dataset Quality Still a Concern in Diagnosis Using Large Foundation Model? Ziqin Lin et.al. 2405.12584v1 null
2024-05-21 Phishing Email Detection Using Inputs From Artificial Intelligence Mithün Paul et.al. 2405.12494v1 null
2024-05-21 Resolving Word Vagueness with Scenario-guided Adapter for Natural Language Inference Yonghao Liu et.al. 2405.12434v1 null
2024-05-20 Developers' Perceptions on the Impact of ChatGPT in Software Development: A Survey Thiago S. Vaillant et.al. 2405.12195v1 null
2024-05-20 Unveiling factors influencing judgment variation in Sentiment Analysis with Natural Language Processing and Statistics Olga Kellert et.al. 2405.12055v1 null
2024-05-20 Continuous Sign Language Recognition with Adapted Conformer via Unsupervised Pretraining Neena Aloysius et.al. 2405.12018v1 null
2024-05-20 Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification Weilian Zhou et.al. 2405.12003v1 link
2024-05-20 A review on the use of large language models as virtual tutors Silvia García-Méndez et.al. 2405.11983v1 null
2024-05-20 Biomedical Entity Linking for Dutch: Fine-tuning a Self-alignment BERT Model on an Automatically Generated Wikipedia Corpus Fons Hartendorp et.al. 2405.11941v1 link
2024-05-20 Beyond MLE: Investigating SEARNN for Low-Resourced Neural Machine Translation Chris Emezue et.al. 2405.11819v1 null
2024-05-20 FedCAda: Adaptive Client-Side Optimization for Accelerated and Stable Federated Learning Liuzhi Zhou et.al. 2405.11811v1 null
2024-05-20 Inverse Design of Metal-Organic Frameworks Using Quantum Natural Language Processing Shinyoung Kang et.al. 2405.11783v1 null
2024-05-20 Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques Siva Rajesh Kasa et.al. 2405.11775v1 null
2024-05-17 A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers Kaiyu Huang et.al. 2405.10936v1 link
2024-05-17 High-dimensional multiple imputation (HDMI) for partially observed confounders including natural language processing-derived auxiliary covariates Janick Weberpals et.al. 2405.10925v1 null
2024-05-17 Prioritising GitHub Priority Labels James Caddy et.al. 2405.10891v1 null
2024-05-17 Natural Language Processing for Requirements Traceability Jin L. C. Guo et.al. 2405.10845v1 null
2024-05-17 INDUS: Effective and Efficient Language Models for Scientific Applications Bishwaranjan Bhattacharjee et.al. 2405.10725v1 null
2024-05-17 Empowering Prior to Court Legal Analysis: A Transparent and Accessible Dataset for Defensive Statement Classification and Interpretation Yannis Spyridis et.al. 2405.10702v1 null
2024-05-17 Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges Xiaoming Shi et.al. 2405.10630v1 null
2024-05-17 Dynamic data sampler for cross-language transfer learning in large language models Yudong Li et.al. 2405.10626v1 link
2024-05-17 Feature-based Low-Rank Compression of Large Language Models via Bayesian Optimization Yixin Ji et.al. 2405.10616v1 link
2024-05-17 Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset Jie Zhu et.al. 2405.10542v1 link
2024-05-16 Mitigating Text Toxicity with Counterfactual Generation Milan Bhan et.al. 2405.09948v1 null
2024-05-16 On the relevance of pre-neural approaches in natural language processing pedagogy Aditya Joshi et.al. 2405.09854v1 null
2024-05-16 Optimization Techniques for Sentiment Analysis Based on LLM (GPT-3) Tong Zhan et.al. 2405.09770v1 null
2024-05-15 SCI 3.0: A Web-based Schema Curation Interface for Graphical Event Representations Reece Suchocki et.al. 2405.09733v1 null
2024-05-15 Enhancing Maritime Trajectory Forecasting via H3 Index and Causal Language Modelling (CLM) Nicolas Drapier et.al. 2405.09596v1 null
2024-05-15 Facilitating Opinion Diversity through Hybrid NLP Approaches Michiel van der Meer et.al. 2405.09439v1 null
2024-05-15 Comparing the Efficacy of GPT-4 and Chat-GPT in Mental Health Care: A Blind Assessment of Large Language Models for Psychological Support Birger Moell et.al. 2405.09300v1 null
2024-05-15 Positional Knowledge is All You Need: Position-induced Transformer (PiT) for Operator Learning Junfeng Chen et.al. 2405.09285v1 null
2024-05-15 Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy Feng Wang et.al. 2405.09014v1 link
2024-05-14 Challenges and Opportunities in Text Generation Explainability Kenza Amara et.al. 2405.08468v1 null
2024-05-13 A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection Matthew Korban et.al. 2405.08204v1 null
2024-05-13 Benchmarking Retrieval-Augmented Large Language Models in Biomedical NLP: Application, Robustness, and Self-Awareness Mingchen Li et.al. 2405.08151v1 null
2024-05-14 PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition Ziyang Zhang et.al. 2405.07932v2 link
2024-05-13 A Comprehensive Analysis of Static Word Embeddings for Turkish Karahan Sarıtaş et.al. 2405.07778v1 link
2024-05-13 Challenges and Opportunities of NLP for HR Applications: A Discussion Paper Jochen L. Leidner et.al. 2405.07766v1 null
2024-05-13 Constructing a BPE Tokenization DFA Martin Berglund et.al. 2405.07671v1 null
2024-05-13 Backdoor Removal for Generative Large Language Models Haoran Li et.al. 2405.07667v1 null
2024-05-13 AIris: An AI-powered Wearable Assistive Device for the Visually Impaired Dionysia Danai Brilli et.al. 2405.07606v1 null
2024-05-13 Evaluation of Retrieval-Augmented Generation: A Survey Hao Yu et.al. 2405.07437v1 link
2024-05-11 Advanced Natural-based interaction for the ITAlian language: LLaMAntino-3-ANITA Marco Polignano et.al. 2405.07101v1 null
2024-05-11 TacoERE: Cluster-aware Compression for Event Relation Extraction Yong Guan et.al. 2405.06890v1 null
2024-05-10 PLeak: Prompt Leaking Attacks against Large Language Model Applications Bo Hui et.al. 2405.06823v1 link
2024-05-10 Explaining Text Similarity in Transformer Models Alexandros Vasileiou et.al. 2405.06604v1 link
2024-05-10 What Can Natural Language Processing Do for Peer Review? Ilia Kuznetsov et.al. 2405.06563v1 link
2024-05-10 Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification Yaoqin Ye et.al. 2405.06468v1 null
2024-05-10 LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play Li-Chun Lu et.al. 2405.06373v1 null
2024-05-10 A NLP Approach to "Review Bombing" in Metacritic PC Videogames User Ratings Javier Coronado-Blázquez et.al. 2405.06306v1 null
2024-05-09 Creating Geospatial Trajectories from Human Trafficking Text Corpora Saydeh N. Karabatis et.al. 2405.06130v1 null
2024-05-09 Narrative to Trajectory (N2T+): Extracting Routes of Life or Death from Human Trafficking Text Corpora Saydeh N. Karabatis et.al. 2405.06129v1 null
2024-05-09 Collaborative Design for Job-Seekers with Autism: A Conceptual Framework for Future Research Sungsoo Ray Hong et.al. 2405.06078v1 null
2024-05-09 Natural Language Processing RELIES on Linguistics Juri Opitz et.al. 2405.05966v1 null
2024-05-09 Revitalising Stagecraft: NLP-Driven Sentiment Analysis for Traditional Theater Revival Saikat Samanta et.al. 2405.05813v1 null
2024-05-09 Enhancing Suicide Risk Detection on Social Media through Semi-Supervised Deep Label Smoothing Matthew Squires et.al. 2405.05795v1 null
2024-05-09 Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language Ronny Paul et.al. 2405.05777v1 null
2024-05-09 Computational lexical analysis of Flamenco genres Pablo Rosillo-Rodes et.al. 2405.05723v1 null
2024-05-09 Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM Xikang Yang et.al. 2405.05610v1 link
2024-05-09 A Survey on Backbones for Deep Video Action Recognition Zixuan Tang et.al. 2405.05584v1 null
2024-05-08 Enhancing Holonic Architecture with Natural Language Processing for System of Systems Muhammad Ashfaq et.al. 2405.05365v1 null
2024-05-08 CARE-SD: Classifier-based analysis for recognizing and eliminating stigmatizing and doubt marker labels in electronic health records: model development and validation Drew Walker et.al. 2405.05204v1 null
2024-05-08 An Artificial Intelligence Approach for Interpreting Creative Combinational Designs Liuqing Chen et.al. 2405.04985v1 null
2024-05-08 Improving Long Text Understanding with Knowledge Distilled from Summarization Model Yan Liu et.al. 2405.04955v1 null
2024-05-08 Fine-tuning Pre-trained Named Entity Recognition Models For Indian Languages Sankalp Bahad et.al. 2405.04829v1 null
2024-05-08 Zero-shot LLM-guided Counterfactual Generation for Text Amrita Bhattacharjee et.al. 2405.04793v1 null
2024-05-08 CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization Zheyan Qu et.al. 2405.04781v1 null
2024-05-07 Bridging the Bosphorus: Advancing Turkish Large Language Models through Strategies for Low-Resource Language Adaptation and Benchmarking Emre Can Acikgoz et.al. 2405.04685v1 null
2024-05-07 Vision Mamba: A Comprehensive Survey and Taxonomy Xiao Liu et.al. 2405.04404v1 link
2024-05-07 Revisiting character-level adversarial attacks Elias Abad Rocamora et.al. 2405.04346v1 link
2024-05-07 NOVA: NoC-based Vector Unit for Mapping Attention Layers on a CNN Accelerator Mohit Upadhyay et.al. 2405.04206v1 null
2024-05-07 LingML: Linguistic-Informed Machine Learning for Enhanced Fake News Detection Jasraj Singh et.al. 2405.04165v1 null
2024-05-07 Sign2GPT: Leveraging Large Language Models for Gloss-Free Sign Language Translation Ryan Wong et.al. 2405.04164v1 null
2024-05-07 GPT-Enabled Cybersecurity Training: A Tailored Approach for Effective Awareness Nabil Al-Dhamari et.al. 2405.04138v1 null
2024-05-07 Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning Karim Galliamov et.al. 2405.04126v1 link
2024-05-07 Evaluating Text Summaries Generated by Large Language Models Using OpenAI's GPT Hassan Shakil et.al. 2405.04053v1 null
2024-05-07 Sketch Then Generate: Providing Incremental User Feedback and Guiding LLM Code Generation through Language-Oriented Code Sketches Chen Zhu-Tian et.al. 2405.03998v1 null
2024-05-07 A Roadmap for Multilingual, Multimodal Domain Independent Deception Detection Dainis Boumber et.al. 2405.03920v1 null
2024-05-06 Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment Abhinav Agarwalla et.al. 2405.03594v1 null
2024-05-06 Gaussian Stochastic Weight Averaging for Bayesian Low-Rank Adaptation of Large Language Models Emre Onal et.al. 2405.03425v1 null
2024-05-06 Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond Jiuxiang Gu et.al. 2405.03251v1 null
2024-05-06 Vietnamese AI Generated Text Detection Quang-Dan Tran et.al. 2405.03206v1 null
2024-05-06 CRAFT: Extracting and Tuning Cultural Instructions from the Wild Bin Wang et.al. 2405.03138v1 link
2024-05-06 WDMoE: Wireless Distributed Large Language Models with Mixture of Experts Nan Xue et.al. 2405.03131v1 null
2024-05-05 Unraveling the Dominance of Large Language Models Over Transformer Models for Bangla Natural Language Inference: A Comprehensive Study Fatema Tuj Johora Faria et.al. 2405.02937v1 link
2024-05-05 Exploring the Improvement of Evolutionary Computation via Large Language Models Jinyu Cai et.al. 2405.02876v1 null
2024-05-05 HuixiangDou-CR: Coreference Resolution in Group Chats Huanjun Kong et.al. 2405.02817v1 link
2024-05-05 Structural Balance in Real-World Social Networks: Incorporating Direction and Transitivity in Measuring Partial Balance Rezvaneh Rezapour et.al. 2405.02798v1 null
2024-05-03 Impact of emoji exclusion on the performance of Arabic sarcasm detection models Ghalyah H. Aleryani et.al. 2405.02195v1 null
2024-05-03 Single and Multi-Hop Question-Answering Datasets for Reticular Chemistry with GPT-4-Turbo Nakul Rampal et.al. 2405.02128v1 null
2024-05-03 Comparative Analysis of Retrieval Systems in the Real World Dmytro Mozolevskyi et.al. 2405.02048v1 null
2024-05-03 The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification Minh Duc Bui et.al. 2405.02010v1 null
2024-05-03 Conformal Prediction for Natural Language Processing: A Survey Margarida M. Campos et.al. 2405.01976v1 null
2024-05-03 Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders and Identifying Distinct Features Chuanbo Hu et.al. 2405.01799v1 null
2024-05-02 Question Suggestion for Conversational Shopping Assistants Using Product Metadata Nikhita Vedula et.al. 2405.01738v1 null
2024-05-02 Automatically Extracting Numerical Results from Randomized Controlled Trials with Large Language Models Hye Sun Yun et.al. 2405.01686v1 link
2024-05-02 Leveraging Prompt-Learning for Structured Information Extraction from Crohn's Disease Radiology Reports in a Low-Resource Language Liam Hazan et.al. 2405.01682v1 null
2024-05-02 1-Diffractor: Efficient and Utility-Preserving Text Obfuscation Leveraging Word-Level Metric Differential Privacy Stephen Meisenbacher et.al. 2405.01678v1 link
2024-05-02 Analyzing the Role of Semantic Representations in the Era of Large Language Models Zhijing Jin et.al. 2405.01502v1 link
2024-05-02 "In-Context Learning" or: How I learned to stop worrying and love "Applied Information Retrieval" Andrew Parry et.al. 2405.01116v1 null
2024-05-01 A Legal Framework for Natural Language Processing Model Training in Portugal Rúben Almeida et.al. 2405.00536v1 null
2024-05-01 DAM: A Universal Dual Attention Mechanism for Multimodal Timeseries Cryptocurrency Trend Forecasting Yihang Fu et.al. 2405.00522v1 link
2024-05-01 Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning Lucas-Andreï Thil et.al. 2405.00516v1 null
2024-05-01 Thread review sentimental analysis with tkinter GUI & tableau dashboard Robin Donal et.al. 2405.00377v1 link
2024-05-01 AdaMoLE: Fine-Tuning Large Language Models with Adaptive Mixture of Low-Rank Adaptation Experts Zefang Liu et.al. 2405.00361v1 link
2024-05-01 A Survey on Deep Active Learning: Recent Advances and New Frontiers Dongyuan Li et.al. 2405.00334v1 null
2024-05-01 Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition Dongyuan Li et.al. 2405.00307v1 link
2024-05-01 ASAM: Boosting Segment Anything Model with Adversarial Tuning Bo Li et.al. 2405.00256v1 link
2024-04-30 RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing Yucheng Hu et.al. 2404.19543v1 link
2024-04-30 DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets Xiaoyu Huang et.al. 2404.19264v1 null
2024-04-30 Mix of Experts Language Model for Named Entity Recognition Xinwei Chen et.al. 2404.19192v1 null
2024-04-30 Revenge of the Fallen? Recurrent Models Match Transformers at Predicting Human Language Comprehension Metrics James A. Michaelov et.al. 2404.19178v1 null
2024-04-29 A Framework for Real-time Safeguarding the Text Generation of Large Language Ximing Dong et.al. 2404.19048v1 null
2024-04-29 Unsupervised Binary Code Translation with Application to Code Similarity Detection and Vulnerability Discovery Iftakhar Ahmad et.al. 2404.19025v1 link
2024-04-29 Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism Lei Kang et.al. 2404.19024v1 link
2024-04-29 Computational Job Market Analysis with Natural Language Processing Mike Zhang et.al. 2404.18977v1 link
2024-04-29 Overcoming Knowledge Barriers: Online Imitation Learning from Observation with Pretrained World Models Xingyuan Zhang et.al. 2404.18896v1 link
2024-04-29 Towards A Structured Overview of Use Cases for Natural Language Processing in the Legal Domain: A German Perspective Juraj Vladika et.al. 2404.18759v1 null
2024-04-29 Reinforcement Learning Problem Solving with Large Language Models Sina Gholamian et.al. 2404.18638v1 null
2024-04-29 From ChatGPT, DALL-E 3 to Sora: How has Generative AI Changed Digital Humanities Research and Services? Jiangfeng Liu et.al. 2404.18518v1 null
2024-04-29 Quantitative Tools for Time Series Analysis in Natural Language Processing: A Practitioners Guide W. Benedikt Schmal et.al. 2404.18499v1 link
2024-04-28 Mapping 'when'-clauses in Latin American and Caribbean languages: an experiment in subtoken-based typology Nilo Pedrazzini et.al. 2404.18257v1 null
2024-04-28 PatentGPT: A Large Language Model for Intellectual Property Zilong Bai et.al. 2404.18255v1 null
2024-04-28 4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBs Minjie Wang et.al. 2404.18209v1 link
2024-04-28 Exploring the Robustness of In-Context Learning with Noisy Labels Chen Cheng et.al. 2404.18191v1 link
2024-04-28 Application and practice of AI technology in quantitative investment Shuochen Bi et.al. 2404.18184v1 null
2024-04-26 Transformer For Low-frequency Extrapolating of Seismic Data Zheng Cong et.al. 2404.17437v1 null
2024-04-26 Evaluation of Geographical Distortions in Language Models: A Crucial Step Towards Equitable Representations Rémy Decoupes et.al. 2404.17401v1 null
2024-04-26 M3BAT: Unsupervised Domain Adaptation for Multimodal Mobile Sensing with Multi-Branch Adversarial Training Lakmal Meegahapola et.al. 2404.17391v1 null
2024-04-26 Can a Multichoice Dataset be Repurposed for Extractive Question Answering? Teresa Lynn et.al. 2404.17342v1 null
2024-04-26 Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM Xuan Zhang et.al. 2404.17283v1 link
2024-04-26 Prompting Towards Alleviating Code-Switched Data Scarcity in Under-Resourced Languages with GPT as a Pivot Michelle Terblanche et.al. 2404.17216v1 null
2024-04-26 Quantifying Memorization of Domain-Specific Pre-trained Language Models using Japanese Newspaper and Paywalls Shotaro Ishihara et.al. 2404.17143v1 null
2024-04-26 Process Mining Embeddings: Learning Vector Representations for Petri Nets Juan G. Colonna et.al. 2404.17129v1 link
2024-04-26 Text Sentiment Analysis and Classification Based on Bidirectional Gated Recurrent Units (GRUs) Model Wei Xu et.al. 2404.17123v1 null
2024-04-26 2M-NER: Contrastive Learning for Multilingual and Multimodal NER with Language and Modal Fusion Dongsheng Wang et.al. 2404.17122v1 null
2024-04-25 EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning Hongxia Xie et.al. 2404.16670v1 link
2024-04-25 ProbGate at EHRSQL 2024: Enhancing SQL Query Generation Accuracy through Probabilistic Threshold Filtering and Error Handling Sangryul Kim et.al. 2404.16659v1 link
2024-04-25 Análise de ambiguidade linguística em modelos de linguagem de grande escala (LLMs) Lavínia de Carvalho Moraes et.al. 2404.16653v1 null
2024-04-25 U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF Xingchen Song et.al. 2404.16407v1 null
2024-04-25 LLM-Based Section Identifiers Excel on Open Source but Stumble in Real World Applications Saranya Krishnamoorthy et.al. 2404.16294v1 link
2024-04-24 Towards Efficient Patient Recruitment for Clinical Trials: Application of a Prompt-Based Learning Model Mojdeh Rahmanian et.al. 2404.16198v1 null
2024-04-24 Chat2Scenario: Scenario Extraction From Dataset Through Utilization of Large Language Model Yongqi Zhao et.al. 2404.16147v1 link
2024-04-24 Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges Badri Narayana Patro et.al. 2404.16112v1 link
2024-04-24 Semantic Routing for Enhanced Performance of LLM-Assisted Intent-Based 5G Core Network Management and Orchestration Dimitrios Michael Manias et.al. 2404.15869v1 null
2024-04-24 Porting Large Language Models to Mobile Devices for Question Answering Hannes Fassold et.al. 2404.15851v1 null
2024-04-24 Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations? Hossein Salami et.al. 2404.15578v1 null
2024-04-23 Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information Chihiro Taguchi et.al. 2404.15501v1 link
2024-04-23 IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents Jean-Philippe Corbeil et.al. 2404.15488v1 link
2024-04-23 Feature Distribution Shift Mitigation with Contrastive Pretraining for Intrusion Detection Weixing Wang et.al. 2404.15382v1 null
2024-04-22 MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of Experts Dengchun Li et.al. 2404.15159v1 link
2024-04-23 Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-case Muhammad Asif Auyb et.al. 2404.14977v1 null
2024-04-23 Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models Yang Tan et.al. 2404.14850v1 link
2024-04-23 Modeling the Sacred: Considerations when Using Considerations when Using Religious Texts in Natural Language Processing Ben Hutchinson et.al. 2404.14740v1 null
2024-04-23 Learning Word Embedding with Better Distance Weighting and Window Size Scheduling Chaohao Yang et.al. 2404.14631v1 null
2024-04-22 Automated Long Answer Grading with RiceChem Dataset Shashank Sonkar et.al. 2404.14316v1 link
2024-04-22 Marking: Visual Grading with Highlighting Errors and Annotating Missing Bits Shashank Sonkar et.al. 2404.14301v1 null
2024-04-22 EnzChemRED, a rich enzyme chemistry relation extraction dataset Po-Ting Lai et.al. 2404.14209v1 null
2024-04-22 Protecting Your LLMs with Information Bottleneck Zichuan Liu et.al. 2404.13968v1 link
2024-04-22 MARIO Eval: Evaluate Your Math LLM with your Math LLM--A mathematical dataset evaluation toolkit Boning Zhang et.al. 2404.13925v1 link
2024-04-21 Mixture of LoRA Experts Xun Wu et.al. 2404.13628v1 link
2024-04-21 Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications Charith Chandra Sai Balne et.al. 2404.13506v1 null
2024-04-20 Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing Yuang Liu et.al. 2404.13434v1 null
2024-04-20 Retrieval-Augmented Generation-based Relation Extraction Sefika Efeoglu et.al. 2404.13397v1 link
2024-04-20 MahaSQuAD: Bridging Linguistic Divides in Marathi Question-Answering Ruturaj Ghatage et.al. 2404.13364v1 link
2024-04-19 FinLangNet: A Novel Deep Learning Framework for Credit Risk Prediction Using Linguistic Analogy in Financial Data Yu Lei et.al. 2404.13004v1 link
2024-04-19 LiMe: a Latin Corpus of Late Medieval Criminal Sentences Alessandra Bassani et.al. 2404.12829v1 null
2024-04-19 Large Language Model Supply Chain: A Research Agenda Shenao Wang et.al. 2404.12736v1 null
2024-04-19 Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation Lasal Jayawardena et.al. 2404.12596v1 null
2024-04-18 GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction Urchade Zaratiana et.al. 2404.12491v1 link
2024-04-18 NLP-enabled trajectory map-matching in urban road networks using transformer sequence-to-sequence model Sevin Mohammadi et.al. 2404.12460v1 null
2024-04-18 RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation Chao Jin et.al. 2404.12457v1 null
2024-04-18 Point-In-Context: Understanding Point Cloud via In-Context Learning Mengyuan Liu et.al. 2404.12352v1 link
2024-04-18 Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting Nicholas Harris et.al. 2404.12283v1 null
2024-04-18 EuSQuAD: Automatically Translated and Aligned SQuAD2.0 for Basque Aitor García-Pablos et.al. 2404.12177v1 link
2024-04-18 Stance Detection on Social Media with Fine-Tuned Large Language Models İlker Gül et.al. 2404.12171v1 null
2024-04-18 Enhance Robustness of Language Models Against Variation Attack through Graph Integration Zi Xiong et.al. 2404.12014v1 null
2024-04-18 ParaFusion: A Large-Scale LLM-Driven English Paraphrase Dataset Infused with High-Quality Lexical and Syntactic Diversity Lasal Jayawardena et.al. 2404.12010v1 null
2024-04-18 EVIT: Event-Oriented Instruction Tuning for Event Reasoning Zhengwei Tao et.al. 2404.11978v1 null
2024-04-18 Sharing Parameter by Conjugation for Knowledge Graph Embeddings in Complex Space Xincan Feng et.al. 2404.11809v1 link
2024-04-17 REQUAL-LM: Reliability and Equity through Aggregation in Large Language Models Sana Ebrahimi et.al. 2404.11782v1 null
2024-04-17 Pretraining Billion-scale Geospatial Foundational Models on Frontier Aristeidis Tsaris et.al. 2404.11706v1 null
2024-04-17 Related Work and Citation Text Generation: A Survey Xiangci Li et.al. 2404.11588v1 null
2024-04-17 Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis Soyoung Yang et.al. 2404.11539v1 null
2024-04-17 GenFighter: A Generative and Evolutive Textual Attack Removal Md Athikul Islam et.al. 2404.11538v1 null
2024-04-17 Research on emotionally intelligent dialogue generation based on automatic dialogue system Jin Wang et.al. 2404.11447v1 null
2024-04-17 Low-Cost Language Models: Survey and Performance Evaluation on Python Code Generation Jessica López Espejel et.al. 2404.11160v1 null
2024-04-17 Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions Leena Mathur et.al. 2404.11023v1 null
2024-04-16 Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training Pavel Denisov et.al. 2404.10922v1 link
2024-04-16 A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents Wiam Adnan et.al. 2404.10848v1 null
2024-04-16 A Sentiment Analysis of Medical Text Based on Deep Learning Yinan Chen et.al. 2404.10503v1 null
2024-04-16 Towards Complex Ontology Alignment using Large Language Models Reihaneh Amini et.al. 2404.10329v1 null
2024-04-16 Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs Woomin Song et.al. 2404.10308v1 link
2024-04-16 Future Language Modeling from Temporal Document History Changmao Li et.al. 2404.10297v1 link
2024-04-15 LegalPro-BERT: Classification of Legal Provisions by fine-tuning BERT Large Language Model Amit Tewari et.al. 2404.10097v1 link
2024-04-15 Detecting AI Generated Text Based on NLP and Machine Learning Approaches Nuzhat Prova et.al. 2404.10032v1 null
2024-04-15 How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model Hanxue Gu et.al. 2404.09957v1 link
2024-04-15 AI-Driven Statutory Reasoning via Software Engineering Methods Rohan Padhye et.al. 2404.09868v1 null
2024-04-15 Reimagining Self-Adaptation in the Age of Large Language Models Raghav Donakanti et.al. 2404.09866v1 null
2024-04-15 KG-CTG: Citation Generation through Knowledge Graph-guided Large Language Models Avinash Anand et.al. 2404.09763v1 null
2024-04-15 Resilience of Large Language Models for Noisy Instructions Bin Wang et.al. 2404.09754v1 null
2024-04-15 State Space Model for New-Generation Network Alternative to Transformers: A Survey Xiao Wang et.al. 2404.09516v1 link
2024-04-15 Automatic Knowledge Graph Construction for Judicial Cases Jie Zhou et.al. 2404.09416v1 null
2024-04-15 A Large-Scale Evaluation of Speech Foundation Models Shu-wen Yang et.al. 2404.09385v1 link
2024-04-14 Hierarchical Attention Models for Multi-Relational Graphs Roshni G. Iyer et.al. 2404.09365v1 link
2024-04-14 Counteracting Concept Drift by Learning with Future Malware Predictions Branislav Bosansky et.al. 2404.09352v1 null
2024-04-14 A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion Zihan Cao et.al. 2404.09293v1 null
2024-04-14 Unveiling LLM Evaluation Focused on Metrics: Challenges and Solutions Taojun Hu et.al. 2404.09135v1 null
2024-04-13 Multilingual Evaluation of Semantic Textual Relatedness Sharvi Endait et.al. 2404.09047v1 null
2024-04-13 WikiSplit++: Easy Data Refinement for Split and Rephrase Hayato Tsukagoshi et.al. 2404.09002v1 link
2024-04-13 Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives Yidan Liu et.al. 2404.08926v1 null
2024-04-10 An inclusive review on deep learning techniques and their scope in handwriting recognition Sukhdeep Singh et.al. 2404.08011v1 null
2024-04-11 AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports Lukas Lange et.al. 2404.07765v1 link
2024-04-11 ODA: Observation-Driven Agent for integrating LLMs and Knowledge Graphs Lei Sun et.al. 2404.07677v1 link
2024-04-11 CAT: Contrastive Adapter Training for Personalized Image Generation Jae Wan Park et.al. 2404.07554v1 link
2024-04-11 Behavior Trees Enable Structured Programming of Language Model Agents Richard Kelley et.al. 2404.07439v1 link
2024-04-11 Towards Robustness of Text-to-Visualization Translation against Lexical and Phrasal Variability Jinwei Lu et.al. 2404.07135v2 null
2024-04-10 DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space Jianxiang Xiang et.al. 2404.06760v1 null
2024-04-12 Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness Xincan Feng et.al. 2404.06714v2 null
2024-04-10 CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers Longwei Zou et.al. 2404.06709v1 null
2024-04-09 Perplexed: Understanding When Large Language Models are Confused Nathan Cooper et.al. 2404.06634v1 null
2024-04-09 ClinLinker: Medical Entity Linking of Clinical Concept Mentions in Spanish Fernando Gallego et.al. 2404.06367v1 null
2024-04-09 Finding fake reviews in e-commerce platforms by using hybrid algorithms Mathivanan Periasamy et.al. 2404.06339v1 null
2024-04-09 Exploring the True Potential: Evaluating the Black-box Optimization Capability of Large Language Models Beichen Huang et.al. 2404.06290v1 null
2024-04-09 VI-OOD: A Unified Representation Learning Framework for Textual Out-of-distribution Detection Li-Ming Zhan et.al. 2404.06217v1 link
2024-04-09 Protection of Guizhou Miao Batik Culture Based on Knowledge Graph and Deep Learning Huafeng Quan et.al. 2404.06168v1 null
2024-04-09 Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports Tianyu Cao et.al. 2404.06162v1 null
2024-04-09 Mansformer: Efficient Transformer of Mixed Attention for Image Deblurring and Beyond Pin-Hung Kuo et.al. 2404.06135v1 null
2024-04-09 FLEX: FLEXible Federated Learning Framework Francisco Herrera et.al. 2404.06127v1 link
2024-04-09 All in One: An Empirical Study of GPT for Few-Shot Aspect-Based Sentiment Anlaysis Baoxing Jiang et.al. 2404.06063v1 null
2024-04-09 Privacy Preserving Prompt Engineering: A Survey Kennedy Edemacu et.al. 2404.06001v1 null
2024-04-08 A Large-Scale Exploration of $μ$ -Transfer Lucas Lingle et.al. 2404.05728v1 link
2024-04-08 Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding Ahmad Idrissi-Yaghir et.al. 2404.05694v1 null
2024-04-08 Causality Extraction from Nuclear Licensee Event Reports Using a Hybrid Framework Sohag Rahman et.al. 2404.05656v1 null
2024-04-08 LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking Faren Yan et.al. 2404.05624v1 null
2024-04-08 3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering Qingyuan Zhou et.al. 2404.05522v1 null
2024-04-08 Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations Yiming Li et.al. 2404.05415v1 null
2024-04-08 NLP Progress in Indigenous Latin American Languages Atnafu Lambebo Tonja et.al. 2404.05365v1 null
2024-04-08 Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods Roopkatha Dey et.al. 2404.05159v1 null
2024-04-08 EcoVerse: An Annotated Twitter Dataset for Eco-Relevance Classification, Environmental Impact Analysis, and Stance Detection Francesca Grasso et.al. 2404.05133v1 link
2024-04-07 Adapting LLMs for Efficient Context Processing through Soft Prompt Compression Cangqing Wang et.al. 2404.04997v1 null
2024-04-05 player2vec: A Language Modeling Approach to Understand Player Behavior in Games Tianze Wang et.al. 2404.04234v1 null
2024-04-05 Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving Gulsum Yigit et.al. 2404.03938v1 null
2024-04-05 Simple Techniques for Enhancing Sentence Embeddings in Generative Language Models Bowen Zhang et.al. 2404.03921v1 link
2024-04-05 A Bi-consolidating Model for Joint Relational Triple Extraction Xiaocheng Luo et.al. 2404.03881v1 null
2024-04-04 Understanding Language Modeling Paradigm Adaptations in Recommender Systems: Lessons Learned and Open Challenges Lemei Zhang et.al. 2404.03788v1 link
2024-04-04 Dendrites endow artificial neural networks with accurate, robust and parameter-efficient learning Spyridon Chavlis et.al. 2404.03708v1 null
2024-04-04 Knowledge Graph Representation for Political Information Sources Tinatin Osmonova et.al. 2404.03437v1 null
2024-04-04 ChangeMamba: Remote Sensing Change Detection with Spatio-Temporal State Space Model Hongruixuan Chen et.al. 2404.03425v1 link
2024-04-04 Towards Pareto Optimal Throughput in Small Language Model Serving Pol G. Recasens et.al. 2404.03353v1 null
2024-04-04 A Comparative Analysis of Word-Level Metric Differential Privacy: Benchmarking The Privacy-Utility Trade-off Stephen Meisenbacher et.al. 2404.03324v1 link
2024-04-04 The Death of Feature Engineering? BERT with Linguistic Features on SQuAD 2.0 Jiawei Li et.al. 2404.03184v1 null
2024-04-03 Construction of Functional Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model Yanpeng Ye et.al. 2404.03080v1 null
2024-04-03 GPT-DETOX: An In-Context Learning-Based Paraphraser for Text Detoxification Ali Pesaranghader et.al. 2404.03052v1 null
2024-04-03 Automatic Prompt Selection for Large Language Models Viet-Tung Do et.al. 2404.02717v1 null
2024-04-03 Adversarial Attacks and Dimensionality in Text Classifiers Nandish Chattopadhyay et.al. 2404.02660v1 null
2024-04-03 Learn to Disguise: Avoid Refusal Responses in LLM's Defense via a Multi-agent Attacker-Disguiser Game Qianqiao Xu et.al. 2404.02532v1 null
2024-04-03 On the Efficiency and Robustness of Vibration-based Foundation Models for IoT Sensing: A Case Study Tomoyoshi Kimura et.al. 2404.02461v1 null
2024-04-03 Task Agnostic Architecture for Algorithm Induction via Implicit Composition Sahil J. Sindhi et.al. 2404.02450v1 null
2024-04-03 The Promises and Pitfalls of Using Language Models to Measure Instruction Quality in Education Paiheng Xu et.al. 2404.02444v1 null
2024-04-03 CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models Zaid Sheikh et.al. 2404.02408v1 link
2024-04-02 Corpus Considerations for Annotator Modeling and Scaling Olufunke O. Sarumi et.al. 2404.02340v1 link
2024-04-02 Comparative Study of Domain Driven Terms Extraction Using Large Language Models Sandeep Chataut et.al. 2404.02330v1 null
2024-04-02 Using Interpretation Methods for Model Enhancement Zhuo Chen et.al. 2404.02068v1 link
2024-04-02 BERTopic-Driven Stock Market Predictions: Unraveling Sentiment Insights Enmin Zhu et.al. 2404.02053v1 null
2024-04-02 Kallaama: A Transcribed Speech Dataset about Agriculture in the Three Most Widely Spoken Languages in Senegal Elodie Gauthier et.al. 2404.01991v1 link
2024-04-02 Team UTSA-NLP at SemEval 2024 Task 5: Prompt Ensembling for Argument Reasoning in Civil Procedures with GPT4 Dan Schumacher et.al. 2404.01961v1 link
2024-04-02 Classifying Graphemes in English Words Through the Application of a Fuzzy Inference System Samuel Rose et.al. 2404.01953v1 null
2024-04-02 Sentiment Analysis of Citations in Scientific Articles Using ChatGPT: Identifying Potential Biases and Conflicts of Interest Walid Hariri et.al. 2404.01800v1 null
2024-04-02 Can Humans Identify Domains? Maria Barrett et.al. 2404.01785v1 link
2024-04-02 M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets Gaurish Thakkar et.al. 2404.01753v1 null
2024-04-02 CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models Xuechen Liang et.al. 2404.01663v1 link
2024-04-02 mChartQA: A universal benchmark for multimodal Chart Question Answer based on Vision-Language Alignment and Reasoning Jingxuan Wei et.al. 2404.01548v1 null
2024-03-29 LayerNorm: A key component in parameter-efficient fine-tuning Taha ValizadehAslani et.al. 2403.20284v1 null
2024-03-29 ChatGPT v.s. Media Bias: A Comparative Study of GPT-3.5 and Fine-tuned Language Models Zehao Wen et.al. 2403.20158v1 null
2024-03-29 NLP for Counterspeech against Hate: A Survey and How-To Guide Helena Bonaldi et.al. 2403.20103v1 null
2024-03-29 Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets Shadi Manafi et.al. 2403.20056v1 link
2024-03-29 Colorful Cutout: Enhancing Image Data Augmentation with Curriculum Learning Juhwan Choi et.al. 2403.20012v1 null
2024-03-29 MANGO: A Benchmark for Evaluating Mapping and Navigation Abilities of Large Language Models Peng Ding et.al. 2403.19913v1 link
2024-03-28 Natural Language, AI, and Quantum Computing in 2024: Research Ingredients and Directions in QNLP Dominic Widdows et.al. 2403.19758v1 null
2024-03-28 Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data Shan Chen et.al. 2403.19511v1 link
2024-03-29 Uncovering Misattributed Suicide Causes through Annotation Inconsistency Detection in Death Investigation Notes Song Wang et.al. 2403.19432v2 link
2024-03-28 EthioMT: Parallel Corpus for Low-resource Ethiopian Languages Atnafu Lambebo Tonja et.al. 2403.19365v1 null
2024-03-28 A diverse Multilingual News Headlines Dataset from around the World Felix Leeb et.al. 2403.19352v1 link
2024-03-27 Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data Yuting Guo et.al. 2403.19031v1 null
2024-03-27 Resource Allocation in Large Language Model Integrated 6G Vehicular Networks Chang Liu et.al. 2403.19016v1 null
2024-03-27 A Survey on Large Language Models from Concept to Implementation Chen Wang et.al. 2403.18969v1 null
2024-03-27 Reshaping Free-Text Radiology Notes Into Structured Reports With Generative Transformers Laura Bergomi et.al. 2403.18938v1 link
2024-03-27 Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation Mateusz Klimaszewski et.al. 2403.18804v1 link
2024-03-27 3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation Ehsan Latif et.al. 2403.18778v1 null
2024-03-27 Transformers-based architectures for stroke segmentation: A review Yalda Zafari-Ghadim et.al. 2403.18637v1 null
2024-03-27 Debiasing Sentence Embedders through Contrastive Word Pairs Philip Kenneweg et.al. 2403.18555v1 link
2024-03-27 Neural Architecture Search for Sentence Classification with BERT Philip Kenneweg et.al. 2403.18547v1 link
2024-03-27 Faster Convergence for Transformer Fine-tuning with Line Search Methods Philip Kenneweg et.al. 2403.18506v1 link
2024-03-27 SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks Brian Formento et.al. 2403.18423v1 link
2024-03-27 Improving Attributed Text Generation of Large Language Models via Preference Learning Dongfang Li et.al. 2403.18381v1 null
2024-03-27 mALBERT: Is a Compact Multilingual BERT Model Still Worth It? Christophe Servan et.al. 2403.18338v1 null
2024-03-27 RankMamba, Benchmarking Mamba's Document Ranking Performance in the Era of Transformers Zhichao Xu et.al. 2403.18276v1 link
2024-03-26 OmniVid: A Generative Framework for Universal Video Understanding Junke Wang et.al. 2403.17935v1 link
2024-03-26 Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications Philip Lippmann et.al. 2403.17860v1 null
2024-03-26 ArabicaQA: A Comprehensive Dataset for Arabic Question Answering Abdelrahman Abdallah et.al. 2403.17848v1 link
2024-03-26 Graph Language Model (GLM): A new graph-based approach to detect social instabilities Wallyson Lemes de Oliveira et.al. 2403.17816v1 null
2024-03-26 Are Compressed Language Models Less Subgroup Robust? Leonidas Gee et.al. 2403.17811v1 link
2024-03-26 A Survey on Deep Learning and State-of-the-arts Applications Mohd Halim Mohd Noor et.al. 2403.17561v1 null
2024-03-26 Practical Applications of Advanced Cloud Services and Generative AI Systems in Medical Image Analysis Jingyu Xu et.al. 2403.17549v1 null
2024-03-26 An Empirical Study of ChatGPT-related projects on GitHub Zheng Lin et.al. 2403.17437v1 null
2024-03-26 Transcribing Bengali Text with Regional Dialects to IPA using District Guided Tokens S M Jishanul Islam et.al. 2403.17407v1 null
2024-03-26 Extracting Biomedical Entities from Noisy Audio Transcripts Nima Ebadi et.al. 2403.17363v1 null
2024-03-25 Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning? Shaoxiong Ji et.al. 2403.16777v1 null
2024-03-25 NSINA: A News Corpus for Sinhala Hansi Hettiarachchi et.al. 2403.16571v1 link
2024-03-25 Harnessing the power of LLMs for normative reasoning in MASs Bastin Tony Roy Savarimuthu et.al. 2403.16524v1 null
2024-03-25 Linguistically Differentiating Acts and Recalls of Racial Microaggressions on Social Media Uma Sushmitha Gunturi et.al. 2403.16514v1 null
2024-03-25 $\textit{LinkPrompt}$ : Natural and Universal Adversarial Attacks on Prompt-based Language Models Yue Xu et.al. 2403.16432v1 link
2024-03-24 Large Language Models in Biomedical and Health Informatics: A Bibliometric Review Huizi Yu et.al. 2403.16303v1 null
2024-03-24 Image Captioning in news report scenario Tianrui Liu et.al. 2403.16209v1 null
2024-03-24 Korean Bio-Medical Corpus (KBMC) for Medical Named Entity Recognition Sungjoo Byun et.al. 2403.16158v1 null
2024-03-24 A Survey on Lexical Ambiguity Detection and Word Sense Disambiguation Miuru Abeysiriwardana et.al. 2403.16129v1 null
2024-03-23 LlamBERT: Large-scale low-cost data annotation in NLP Bálint Csanády et.al. 2403.15938v1 link
2024-03-23 RAAMove: A Corpus for Analyzing Moves in Research Article Abstracts Hongzheng Li et.al. 2403.15872v1 null
2024-03-22 Towards Deep Learning Enabled Cybersecurity Risk Assessment for Microservice Architectures Majid Abdulsatar et.al. 2403.15169v1 null
2024-03-22 CHisIEC: An Information Extraction Corpus for Ancient Chinese History Xuemei Tang et.al. 2403.15088v1 null
2024-03-22 Construction of a Japanese Financial Benchmark for Large Language Models Masanori Hirano et.al. 2403.15062v1 link
2024-03-22 LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement Nicholas Lee et.al. 2403.15042v1 link
2024-03-22 On Zero-Shot Counterspeech Generation by LLMs Punyajoy Saha et.al. 2403.14938v1 link
2024-03-21 Reversible Jump Attack to Textual Classifiers with Modification Reduction Mingze Ni et.al. 2403.14731v1 link
2024-03-21 PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model Zheng Zhang et.al. 2403.14598v1 link
2024-03-21 ChatGPT Alternative Solutions: Large Language Models Survey Hanieh Alipour et.al. 2403.14469v1 null
2024-03-21 From Perils to Possibilities: Understanding how Human (and AI) Biases affect Online Fora Virginia Morini et.al. 2403.14298v1 null
2024-03-21 Dermacen Analytica: A Novel Methodology Integrating Multi-Modal Large Language Models with Machine Learning in tele-dermatology Dimitrios P. Panagoulias et.al. 2403.14243v1 null
2024-03-21 Extracting Emotion Phrases from Tweets using BART Mahdi Rezapour et.al. 2403.14050v1 null
2024-03-21 The NeurIPS 2023 Machine Learning for Audio Workshop: Affective Audio Benchmarks and Novel Data Alice Baird et.al. 2403.14048v1 null
2024-03-20 Leveraging Linguistically Enhanced Embeddings for Open Information Extraction Fauzan Farooqui et.al. 2403.13903v1 null
2024-03-20 EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation Atnafu Lambebo Tonja et.al. 2403.13737v1 null
2024-03-20 Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models Chengzhe Feng et.al. 2403.13588v1 null
2024-03-20 How Gender Interacts with Political Values: A Case Study on Czech BERT Models Adnan Al Ali et.al. 2403.13514v1 null
2024-03-20 Community Needs and Assets: A Computational Analysis of Community Conversations Md Towhidul Absar Chowdhury et.al. 2403.13272v1 link
2024-03-19 AdaFish: Fast low-rank parameter-efficient fine-tuning by using second-order information Jiang Hu et.al. 2403.13128v1 null
2024-03-19 Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Texts Sai Ashish Somayajula et.al. 2403.12918v1 link
2024-03-19 Comparing Explanation Faithfulness between Multilingual and Monolingual Fine-tuned Language Models Zhixue Zhao et.al. 2403.12809v1 link
2024-03-19 Quantixar: High-performance Vector Data Management System Gulshan Yadav et.al. 2403.12583v1 null
2024-03-19 Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices Sara Abdali et.al. 2403.12503v1 null
2024-03-19 Third-Party Language Model Performance Prediction from Instruction Rahul Nadkarni et.al. 2403.12413v1 link
2024-03-19 Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question Answering Yuan Gao et.al. 2403.12393v1 null
2024-03-19 AraPoemBERT: A Pretrained Language Model for Arabic Poetry Analysis Faisal Qarah et.al. 2403.12392v1 null
2024-03-19 Improving Generalizability of Extracting Social Determinants of Health Using Large Language Models through Prompt-tuning Cheng Peng et.al. 2403.12374v1 null
2024-03-18 Leveraging Large Language Models to Extract Information on Substance Use Disorder Severity from Clinical Notes: A Zero-shot Learning Approach Maria Mahbub et.al. 2403.12297v1 null
2024-03-18 Evaluating Named Entity Recognition: Comparative Analysis of Mono- and Multilingual Transformer Models on Brazilian Corporate Earnings Call Transcriptions Ramon Abilio et.al. 2403.12212v1 link
2024-03-17 ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization Mengsha Liu et.al. 2403.11236v1 link
2024-03-17 Multi-Objective Evolutionary Neural Architecture Search for Recurrent Neural Networks Reinhard Booysen et.al. 2403.11173v1 link
2024-03-17 Exploring Tokenization Strategies and Vocabulary Sizes for Enhanced Arabic Language Models Mohamed Taher Alrefaie et.al. 2403.11130v1 null
2024-03-17 RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning Javad Rafiei Asl et.al. 2403.11082v1 null
2024-03-17 Deep Learning-based Sentiment Analysis in Persian Language Mohammad Heydari et.al. 2403.11069v1 null
2024-03-16 DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages Fahim Faisal et.al. 2403.11009v1 link
2024-03-16 Energy-Based Models with Applications to Speech and Language Processing Zhijian Ou et.al. 2403.10961v1 null
2024-03-16 A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment Tianhe Wu et.al. 2403.10854v1 link
2024-03-16 Detecting Bias in Large Language Models: Fine-tuned KcBERT J. K. Lee et.al. 2403.10774v1 null
2024-03-15 A Multilingual Perspective on Probing Gender Bias Karolina Stańczak et.al. 2403.10699v1 null
2024-03-15 ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment Xiaofeng Wu et.al. 2403.10504v1 null
2024-03-15 TriSum: Learning Summarization Ability from Large Language Models with Structured Rationale Pengcheng Jiang et.al. 2403.10351v1 null
2024-03-15 NLP Verification: Towards a General Methodology for Certifying Robustness Marco Casadio et.al. 2403.10144v1 null
2024-03-15 Identifying Health Risks from Family History: A Survey of Natural Language Processing Techniques Xiang Dai et.al. 2403.09997v1 null
2024-03-15 ViTCN: Vision Transformer Contrastive Network For Reasoning Bo Song et.al. 2403.09962v1 null
2024-03-14 Fisher Mask Nodes for Language Model Merging Thennal D K et.al. 2403.09891v1 link
2024-03-14 Scaling Behavior of Machine Translation with Large Language Models under Prompt Injection Attacks Zhifan Sun et.al. 2403.09832v1 link
2024-03-14 Emotional Intelligence Through Artificial Intelligence : NLP and Deep Learning in the Analysis of Healthcare Texts Prashant Kumar Nag et.al. 2403.09762v1 null
2024-03-14 Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey Xiaoyu Liu et.al. 2403.09606v1 null
2024-03-14 PreConfig: A Pretrained Model for Automating Network Configuration Fuliang Li et.al. 2403.09369v1 null
2024-03-14 Exploring the Capabilities and Limitations of Large Language Models in the Electric Energy Sector Lin Dong et.al. 2403.09125v1 null
2024-03-14 Information Extraction: An application to the domain of hyper-local financial data on developing countries Abuzar Royesh et.al. 2403.09077v1 null
2024-03-13 Ethos: Rectifying Language Models in Orthogonal Parameter Space Lei Gao et.al. 2403.08994v1 null
2024-03-13 Predictive Analysis of Tuberculosis Treatment Outcomes Using Machine Learning: A Karnataka TB Data Study at a Scale SeshaSai Nath Chinagudaba et.al. 2403.08834v1 null
2024-03-13 SoK: Reducing the Vulnerability of Fine-tuned Language Models to Membership Inference Attacks Guy Amit et.al. 2403.08481v1 null
2024-03-13 Specification Overfitting in Artificial Intelligence Benjamin Roth et.al. 2403.08425v1 null
2024-03-12 VANP: Learning Where to See for Navigation with Self-Supervised Vision-Action Pre-Training Mohammad Nazeri et.al. 2403.08109v1 null
2024-03-12 Mechanics of Next Token Prediction with Self-Attention Yingcong Li et.al. 2403.08081v1 null
2024-03-12 Exploring Safety Generalization Challenges of Large Language Models via Code Qibing Ren et.al. 2403.07865v1 null
2024-03-12 Fine-tuning Neural Network Quantum States Riccardo Rende et.al. 2403.07795v1 null
2024-03-12 MoralBERT: Detecting Moral Values in Social Discourse Vjosa Preniqi et.al. 2403.07678v1 null
2024-03-12 A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions Quoc-Vinh Lai-Dang et.al. 2403.07542v1 null
2024-03-12 Generalised Graph Grammars for Natural Language Processing Oliver Robert Fox et.al. 2403.07481v1 null
2024-03-12 Knowledge Graph Large Language Model (KG-LLM) for Link Prediction Dong Shu et.al. 2403.07311v1 null
2024-03-11 LSTM-Based Text Generation: A Study on Historical Datasets Mustafa Abbas Hussein Hussein et.al. 2403.07087v1 null
2024-03-11 ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis Yanming Liu et.al. 2403.06932v1 link
2024-03-11 Application of Quantum Tensor Networks for Protein Classification Debarshi Kundu et.al. 2403.06890v1 null
2024-03-11 Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting Wenting Chen et.al. 2403.06835v1 null
2024-03-11 ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model Zhiwei Liu et.al. 2403.06765v1 link
2024-03-11 NLP4RE Tools: Classification, Overview, and Management Julian Frattini et.al. 2403.06685v1 null
2024-03-11 QuantTune: Optimizing Model Quantization with Adaptive Outlier-Driven Fine Tuning Jiun-Man Chen et.al. 2403.06497v1 null
2024-03-11 'One size doesn't fit all': Learning how many Examples to use for In-Context Learning for Improved Text Classification Manish Chandra et.al. 2403.06402v1 null
2024-03-11 Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource Languages Michael Andersland et.al. 2403.06354v1 link
2024-03-10 ArgMed-Agents: Explainable Clinical Decision Reasoning with Large Language Models via Argumentation Schemes Shengxin Hong et.al. 2403.06294v1 null
2024-03-10 In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model Junhui Yin et.al. 2403.06126v1 null
2024-03-08 Debiasing Large Visual Language Models Yi-Fan Zhang et.al. 2403.05262v1 link
2024-03-08 Benchmarking Large Language Models for Molecule Prediction Tasks Zhiqiang Zhong et.al. 2403.05075v1 link
2024-03-08 Can we obtain significant success in RST discourse parsing by using Large Language Models? Aru Maekawa et.al. 2403.05065v1 link
2024-03-07 Analysis of Systems' Performance in Natural Language Processing Competitions Sergio Nava-Muñoz et.al. 2403.04693v1 null
2024-03-07 Classist Tools: Social Class Correlates with Performance in NLP Amanda Cercas Curry et.al. 2403.04445v1 null
2024-03-07 Advancing Biomedical Text Mining with Community Challenges Hui Zong et.al. 2403.04261v1 null
2024-03-06 Enhancing Instructional Quality: Leveraging Computer-Assisted Textual Analysis to Generate In-Depth Insights from Educational Artifacts Zewei Tian et.al. 2403.03920v1 null
2024-03-06 Impoverished Language Technology: The Lack of (Social) Class in NLP Amanda Cercas Curry et.al. 2403.03874v1 null
2024-03-06 German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset Laura Mascarell et.al. 2403.03750v1 link
2024-03-06 Probabilistic Topic Modelling with Transformer Representations Arik Reuter et.al. 2403.03737v1 link
2024-03-06 Enhancing ASD detection accuracy: a combined approach of machine learning and deep learning models with natural language processing Sergio Rubio-Martín et.al. 2403.03581v1 null
2024-03-06 Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem Yuhong Sun et.al. 2403.03558v1 link
2024-03-05 TTPXHunter: Actionable Threat Intelligence Extraction as TTPs form Finished Cyber Threat Reports Nanda Rani et.al. 2403.03267v1 null
2024-03-05 Detecting Concrete Visual Tokens for Multimodal Machine Translation Braeden Bowen et.al. 2403.03075v1 null
2024-03-05 Data Augmentation using LLMs: Data Perspectives, Learning Paradigms and Challenges Bosheng Ding et.al. 2403.02990v1 null
2024-03-05 A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching Dong Yao et.al. 2403.02975v1 null
2024-03-05 SimuCourt: Building Judicial Decision-Making Agents with Real-world Judgement Documents Zhitao He et.al. 2403.02959v1 link
2024-03-05 A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods Hanlei Jin et.al. 2403.02901v1 null
2024-03-05 Quantum Mixed-State Self-Attention Network Fu Chen et.al. 2403.02871v1 null
2024-03-05 Emerging Synergies Between Large Language Models and Machine Learning in Ecommerce Recommendations Xiaonan Xu et.al. 2403.02760v1 null
2024-03-05 Causal Prompting: Debiasing Large Language Model Prompting based on Front-Door Adjustment Congzhi Zhang et.al. 2403.02738v1 null
2024-03-05 Privacy-Aware Semantic Cache for Large Language Models Waris Gill et.al. 2403.02694v1 null
2024-03-04 A Tutorial on the Pretrain-Finetune Paradigm for Natural Language Processing Yu Wang et.al. 2403.02504v1 null
2024-03-02 LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems Tasnim Ahmed et.al. 2403.01342v1 null
2024-03-02 VNLP: Turkish NLP Package Meliksah Turker et.al. 2403.01309v1 null
2024-03-02 VBART: The Turkish LLM Meliksah Turker et.al. 2403.01308v1 null
2024-03-02 IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact Ruikang Liu et.al. 2403.01241v1 null
2024-03-02 Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions Flor Miriam Plaza-del-Arco et.al. 2403.01222v1 link
2024-03-02 Evaluating Large Language Models as Virtual Annotators for Time-series Physical Sensing Data Aritra Hota et.al. 2403.01133v1 null
2024-03-01 Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning Christian Raymond et.al. 2403.00865v1 link
2024-03-01 Beyond Single-Model Views for Deep Learning: Optimization versus Generalizability of Stochastic Optimization Algorithms Toki Tahmid Inan et.al. 2403.00574v1 null
2024-03-01 Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese Yuqi Chen et.al. 2403.00509v1 null
2024-03-01 Gender Bias in Large Language Models across Multiple Languages Jinman Zhao et.al. 2403.00277v1 null
2024-02-29 Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing Pranav Shetty et.al. 2402.19462v1 link
2024-02-29 Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines Lijia Ma et.al. 2402.19421v1 null
2024-02-29 Here's a Free Lunch: Sanitizing Backdoored Models with Model Merge Ansh Arora et.al. 2402.19334v1 null
2024-02-29 Improving Legal Judgement Prediction in Romanian with Long Text Encoders Mihai Masala et.al. 2402.19170v1 null
2024-02-29 Beyond Language Models: Byte Models are Digital World Simulators Shangda Wu et.al. 2402.19155v1 null
2024-02-29 Enhancing Steganographic Text Extraction: Evaluating the Impact of NLP Models on Accuracy and Semantic Coherence Mingyang Li et.al. 2402.18849v1 null
2024-02-29 MPAT: Building Robust Deep Neural Networks against Textual Adversarial Attacks Fangyuan Zhang et.al. 2402.18792v1 null
2024-02-28 Learning to Compress Prompt in Natural Language Formats Yu-Neng Chuang et.al. 2402.18700v1 null
2024-02-28 Large Language Models and Games: A Survey and Roadmap Roberto Gallotta et.al. 2402.18659v1 null
2024-02-28 Tokenization Is More Than Compression Craig W. Schmidt et.al. 2402.18376v1 null
2024-02-28 Towards Better Understanding of Contrastive Sentence Representation Learning: A Unified Paradigm for Gradient Mingxin Li et.al. 2402.18281v1 null
2024-02-28 Challenges in Pre-Training Graph Neural Networks for Context-Based Fake News Detection: An Evaluation of Current Strategies and Resource Limitations Gregor Donabauer et.al. 2402.18179v1 link
2024-02-28 Learning Intrinsic Dimension via Information Bottleneck for Explainable Aspect-based Sentiment Analysis Zhenxiao Cheng et.al. 2402.18145v1 null
2024-02-28 Saving the legacy of Hero Ibash: Evaluating Four Language Models for Aminoacian Yunze Xiao et.al. 2402.18121v1 null
2024-02-28 Using Text Embeddings for Deductive Qualitative Research at Scale in Physics Education Tor Ole B. Odden et.al. 2402.18087v1 link
2024-02-28 Data augmentation method for modeling health records with applications to clopidogrel treatment failure detection Sunwoong Choi et.al. 2402.18046v1 null
2024-02-28 Crisis talk: analysis of the public debate around the energy crisis and cost of living Rrubaa Panchendrarajan et.al. 2402.18043v1 null
2024-02-28 Datasets for Large Language Models: A Comprehensive Survey Yang Liu et.al. 2402.18041v1 link
2024-02-28 Gradient-Free Adaptive Global Pruning for Pre-trained Language Models Guangji Bai et.al. 2402.17946v1 link
2024-02-27 Navigator: A Decentralized Scheduler for Latency-Sensitive ML Workflows Yuting Yang et.al. 2402.17652v1 null
2024-02-27 From Text Segmentation to Smart Chaptering: A Novel Benchmark for Structuring Video Transcriptions Fabian Retkowski et.al. 2402.17633v1 null
2024-02-27 Neural Automated Writing Evaluation with Corrective Feedback Izia Xiaoxiao Wang et.al. 2402.17613v1 null
2024-02-27 Extreme Miscalibration and the Illusion of Adversarial Robustness Vyas Raina et.al. 2402.17509v1 null
2024-02-27 Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey Dinh-Viet-Toan Le et.al. 2402.17467v1 link
2024-02-27 Benchmarking GPT-4 on Algorithmic Problems: A Systematic Evaluation of Prompting Strategies Flavio Petruzzellis et.al. 2402.17396v1 null
2024-02-27 FairBelief - Assessing Harmful Beliefs in Language Models Mattia Setzu et.al. 2402.17389v1 null
2024-02-27 Curriculum Learning Meets Directed Acyclic Graph for Multimodal Emotion Recognition Cam-Van Thi Nguyen et.al. 2402.17269v1 link
2024-02-27 Deep Learning-Based Speech and Vision Synthesis to Improve Phishing Attack Detection through a Multi-layer Adaptive Framework Tosin Ige et.al. 2402.17249v1 null
2024-02-27 Does Negative Sampling Matter? A Review with Insights into its Theory and Applications Zhen Yang et.al. 2402.17238v1 null
2024-02-26 ProLLaMA: A Protein Large Language Model for Multi-Task Protein Language Processing Liuzhenghao Lv et.al. 2402.16445v1 link
2024-02-26 MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property Shiwen Ni et.al. 2402.16389v1 link
2024-02-25 From Text to Transformation: A Comprehensive Review of Large Language Models' Versatility Pravneet Kaur et.al. 2402.16142v1 null
2024-02-25 Deep Learning Approaches for Improving Question Answering Systems in Hepatocellular Carcinoma Research Shuning Huo et.al. 2402.16038v1 null
2024-02-25 $C^3$ : Confidence Calibration Model Cascade for Inference-Efficient Cross-Lingual Natural Language Understanding Taixi Lu et.al. 2402.15991v1 null
2024-02-24 SportQA: A Benchmark for Sports Understanding in Large Language Models Haotian Xia et.al. 2402.15862v1 null
2024-02-24 Prompt Perturbation Consistency Learning for Robust Language Models Yao Qiang et.al. 2402.15833v1 null
2024-02-24 Linguistic Intelligence in Large Language Models for Telecommunications Tasnim Ahmed et.al. 2402.15818v1 null
2024-02-23 Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models Yanzheng Xiang et.al. 2402.15637v1 null
2024-02-23 Transformers are Expressive, But Are They Expressive Enough for Regression? Swaroop Nath et.al. 2402.15478v1 link
2024-02-23 United We Pretrain, Divided We Fail! Representation Learning for Time Series by Pretraining on 75 Datasets at Once Maurice Kraus et.al. 2402.15404v1 null
2024-02-23 Fine-Grained Detoxification via Instance-Level Prefixes for Large Language Models Xin Yi et.al. 2402.15202v1 null
2024-02-23 Improving Sentence Embeddings with an Automatically Generated NLI Dataset Soma Sato et.al. 2402.15132v1 null
2024-02-23 Descripción automática de secciones delgadas de rocas: una aplicación Web Stalyn Paucar et.al. 2402.15039v1 null
2024-02-22 Ar-Spider: Text-to-SQL in Arabic Saleh Almohaimeed et.al. 2402.15012v1 null
2024-02-22 How Important Is Tokenization in French Medical Masked Language Models? Yanis Labrak et.al. 2402.15010v1 null
2024-02-22 LLMs with Industrial Lens: Deciphering the Challenges and Prospects -- A Survey Ashok Urlana et.al. 2402.14558v1 null
2024-02-22 Should We Respect LLMs? A Cross-Lingual Study on the Influence of Prompt Politeness on LLM Performance Ziqi Yin et.al. 2402.14531v1 null
2024-02-22 Malaysian English News Decoded: A Linguistic Resource for Named Entity and Relation Extraction Mohan Raj Chanthran et.al. 2402.14521v1 link
2024-02-22 SpanSeq: Similarity-based sequence data splitting method for improved development and assessment of deep learning projects Alfred Ferrer Florensa et.al. 2402.14482v1 link
2024-02-22 Novi jezički modeli za srpski jezik Mihailo Škorić et.al. 2402.14379v1 null
2024-02-22 Vision-Language Navigation with Embodied Intelligence: A Survey Peng Gao et.al. 2402.14304v1 null
2024-02-22 Mitigating Biases of Large Language Models in Stance Detection with Calibration Ang Li et.al. 2402.14296v1 null
2024-02-22 Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education Rui Yang et.al. 2402.14293v1 link
2024-02-22 Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding Yu-Qi Yang et.al. 2402.14215v1 link
2024-02-22 Content Conditional Debiasing for Fair Text Embedding Wenlong Deng et.al. 2402.14208v1 null
2024-02-21 Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models Chenyang Lyu et.al. 2402.13887v1 null
2024-02-21 Using Large Language Models for Natural Language Processing Tasks in Requirements Engineering: A Systematic Guideline Andreas Vogelsang et.al. 2402.13823v1 null
2024-02-21 Beyond Hate Speech: NLP's Challenges and Opportunities in Uncovering Dehumanizing Language Hezhao Zhang et.al. 2402.13818v1 null
2024-02-21 From Text to CQL: Bridging Natural Language and Corpus Search Engine Luming Lu et.al. 2402.13740v1 null
2024-02-21 RESTRuler: Towards Automatically Identifying Violations of RESTful Design Rules in Web APIs Justus Bogner et.al. 2402.13710v1 null
2024-02-21 CMNER: A Chinese Multimodal NER Dataset based on Social Media Yuanze Ji et.al. 2402.13693v1 link
2024-02-21 An Augmented Lagrangian Method for Training Recurrent Neural Networks Yue Wang et.al. 2402.13687v1 null
2024-02-21 Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning Zhaorui Yang et.al. 2402.13669v1 link
2024-02-21 Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions Lei Pan et.al. 2402.13647v1 null
2024-02-21 Overview of the VLSP 2023 -- ComOM Shared Task: A Data Challenge for Comparative Opinion Mining from Vietnamese Product Reviews Hoang-Quynh Le et.al. 2402.13613v1 null
2024-02-20 CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models Yizhi LI et.al. 2402.13109v1 null
2024-02-20 Few shot clinical entity recognition in three languages: Masked language models outperform LLM prompting Marco Naguib et.al. 2402.12801v1 null
2024-02-19 Predicting trucking accidents with truck drivers 'safety climate perception across companies: A transfer learning approach Kailai Sun et.al. 2402.12417v1 null
2024-02-19 Analysis of Persian News Agencies on Instagram, A Words Co-occurrence Graph-based Approach Mohammad Heydari et.al. 2402.12272v1 null
2024-02-19 Synthetic location trajectory generation using categorical diffusion models Simon Dirmeier et.al. 2402.12242v1 link
2024-02-19 Language Model Adaptation to Specialized Domains through Selective Masking based on Genre and Topical Characteristics Anas Belfathi et.al. 2402.12036v1 link
2024-02-19 Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space Zongru Wu et.al. 2402.12026v1 null
2024-02-19 Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing? Marco Gaido et.al. 2402.12025v1 null
2024-02-19 What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects Verena Blaschke et.al. 2402.11968v1 null
2024-02-19 DB-LLM: Accurate Dual-Binarization for Efficient LLMs Hong Chen et.al. 2402.11960v1 null
2024-02-19 AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimization Jiyao Li et.al. 2402.11940v1 null
2024-02-19 Semantic Textual Similarity Assessment in Chest X-ray Reports Using a Domain-Specific Cosine-Based Metric Sayeh Gholipour Picha et.al. 2402.11908v1 link
2024-02-19 InMD-X: Large Language Models for Internal Medicine Doctors Hansle Gwon et.al. 2402.11883v1 null
2024-02-16 Construction of a Syntactic Analysis Map for Yi Shui School through Text Mining and Natural Language Processing Research Hanqing Zhao et.al. 2402.10743v1 null
2024-02-16 Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm Yuanzhen Xie et.al. 2402.10671v1 link
2024-02-16 Fine Tuning Named Entity Extraction Models for the Fantasy Domain Aravinth Sivaganeshan et.al. 2402.10662v1 null
2024-02-16 Linear Transformers with Learnable Kernel Functions are Better In-Context Models Yaroslav Aksenov et.al. 2402.10644v1 link
2024-02-16 BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation Dayou Du et.al. 2402.10631v1 link
2024-02-16 Zero-shot sampling of adversarial entities in biomedical question answering R. Patrick Xian et.al. 2402.10527v1 null
2024-02-16 Parametric Augmentation for Time Series Contrastive Learning Xu Zheng et.al. 2402.10434v1 link
2024-02-16 Understanding In-Context Learning with a Pelican Soup Framework Ting-Rui Chiang et.al. 2402.10424v1 null
2024-02-16 LogELECTRA: Self-supervised Anomaly Detection for Unstructured Logs Yuuki Yamanaka et.al. 2402.10397v1 null
2024-02-15 Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention Romain Ilbert et.al. 2402.10198v1 link
2024-02-15 Reusing Softmax Hardware Unit for GELU Computation in Transformers Christodoulos Peltekis et.al. 2402.10118v1 link
2024-02-15 Balancing the Causal Effects in Class-Incremental Learning Junhao Zheng et.al. 2402.10063v1 null
2024-02-15 Fast Vocabulary Transfer for Language Model Compression Leonidas Gee et.al. 2402.09977v1 null
2024-02-15 Multi-Word Tokenization for Sequence Compression Leonidas Gee et.al. 2402.09949v1 link
2024-02-15 BUSTER: a "BUSiness Transaction Entity Recognition" dataset Andrea Zugarini et.al. 2402.09916v1 null
2024-02-15 Camouflage is all you need: Evaluating and Enhancing Language Model Robustness Against Camouflage Adversarial Attacks Álvaro Huertas-García et.al. 2402.09874v1 null
2024-02-15 Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Quentin Gallouédec et.al. 2402.09844v1 link
2024-02-15 All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining Haihong Zhao et.al. 2402.09834v1 null
2024-02-14 LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset Botao Yu et.al. 2402.09391v1 link
2024-02-14 Leveraging Large Language Models for Enhanced NLP Task Performance through Knowledge Distillation and Optimized Training Strategies Yining Huang et.al. 2402.09282v1 null
2024-02-14 Personalized Large Language Models Stanisław Woźniak et.al. 2402.09269v1 null
2024-02-14 Advancing NLP Models with Strategic Text Augmentation: A Comprehensive Study of Augmentation Methods and Curriculum Strategies Himmet Toprak Kesgin et.al. 2402.09141v1 null
2024-02-14 SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks Jiwon Song et.al. 2402.09025v1 link
2024-02-14 Research and application of Transformer based anomaly detection model: A literature review Mingrui Ma et.al. 2402.08975v1 null
2024-02-13 BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation Omid Nejati Manzari et.al. 2402.08793v1 link
2024-02-13 COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability Xingang Guo et.al. 2402.08679v1 link
2024-02-13 Online Foundation Model Selection in Robotics Po-han Li et.al. 2402.08570v1 null
2024-02-13 Intriguing Differences Between Zero-Shot and Systematic Evaluations of Vision-Language Transformer Models Shaeke Salman et.al. 2402.08473v1 null
2024-02-13 Generating Java Methods: An Empirical Assessment of Four AI-Based Code Assistants Vincenzo Corso et.al. 2402.08431v1 null
2024-02-13 Visual Question Answering Instruction: Unlocking Multimodal Large Language Model To Domain-Specific Visual Multitasks Jusung Lee et.al. 2402.08360v1 null
2024-02-13 Explicit References to Social Values in Fairy Tales: A Comparison between Three European Cultures Alba Morollon Diaz-Faes et.al. 2402.08318v1 link
2024-02-13 QuApprox: A Framework for Benchmarking the Approximability of Variational Quantum Circuit Jinyang Li et.al. 2402.08261v1 null
2024-02-13 A survey of recent methods for addressing AI fairness and bias in biomedicine Yifan Yang et.al. 2402.08250v1 null
2024-02-12 Enhancing Amharic-LLaMA: Integrating Task Specific and Generative Datasets Israel Abebe Azime et.al. 2402.08015v1 null
2024-02-12 Empowering Federated Learning for Massive Models with NVIDIA FLARE Holger R. Roth et.al. 2402.07792v1 null
2024-02-12 Text Detoxification as Style Transfer in English and Hindi Sourabrata Mukherjee et.al. 2402.07767v1 null
2024-02-12 AraSpider: Democratizing Arabic-to-SQL Ahmed Heakl et.al. 2402.07448v1 link
2024-02-12 Dólares or Dollars? Unraveling the Bilingual Prowess of Financial LLMs Between Spanish and English Xiao Zhang et.al. 2402.07405v1 link
2024-02-12 Beyond the Headlines: Understanding Sentiments and Morals Impacting Female Employment in Spain Oscar Araque et.al. 2402.07339v1 null
2024-02-11 Differentially Private Training of Mixture of Experts Models Pierre Tholoniat et.al. 2402.07334v1 null
2024-02-11 Insights into Natural Language Database Query Errors: From Attention Misalignment to User Handling Strategies Zheng Ning et.al. 2402.07304v1 null
2024-02-11 TransGPT: Multi-modal Generative Pre-trained Transformer for Transportation Peng Wang et.al. 2402.07233v1 null
2024-02-11 Learning by Watching: A Review of Video-based Learning Approaches for Robot Manipulation Chrisantus Eze et.al. 2402.07127v1 null
2024-02-10 In-Context Data Distillation with TabPFN Junwei Ma et.al. 2402.06971v1 null
2024-02-09 Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Shivalika Singh et.al. 2402.06619v1 null
2024-02-09 FaBERT: Pre-training BERT on Persian Blogs Mostafa Masumi et.al. 2402.06617v1 null
2024-02-09 TIC: Translate-Infer-Compile for accurate 'text to plan' using LLMs and logical intermediate representations Sudhir Agarwal et.al. 2402.06608v1 null
2024-02-09 G-SciEdBERT: A Contextualized LLM for Science Assessment Tasks in German Ehsan Latif et.al. 2402.06584v1 null
2024-02-09 A Unified Causal View of Instruction Tuning Lu Chen et.al. 2402.06220v1 null
2024-02-08 On the Convergence of Zeroth-Order Federated Tuning in Large Language Models Zhenqing Ling et.al. 2402.05926v1 null
2024-02-08 FAQ-Gen: An automated system to generate domain-specific FAQs to aid content comprehension Sahil Kale et.al. 2402.05812v1 null
2024-02-08 Efficient Models for the Detection of Hate, Abuse and Profanity Christoph Tillmann et.al. 2402.05624v1 null
2024-02-08 Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings Elena Senger et.al. 2402.05617v1 null
2024-02-08 Benchmarking Large Language Models on Communicative Medical Coaching: a Novel System and Dataset Hengguan Huang et.al. 2402.05547v1 null
2024-02-08 GPT-4 Generated Narratives of Life Events using a Structured Narrative Prompt: A Validation Study Christopher J. Lynch et.al. 2402.05435v1 null
2024-02-07 PAC Learnability under Explanation-Preserving Graph Perturbations Xu Zheng et.al. 2402.05039v1 null
2024-02-07 An Enhanced Prompt-Based LLM Reasoning Scheme via Knowledge Graph-Integrated Collaboration Yihao Li et.al. 2402.04978v1 null
2024-02-07 Chatbots in Knowledge-Intensive Contexts: Comparing Intent and LLM-Based Systems Samuel Kernan Freire et.al. 2402.04955v1 null
2024-02-07 SPARQL Generation: an analysis on fine-tuning OpenLLaMA for Question Answering over a Life Science Knowledge Graph Julio C. Rangel et.al. 2402.04627v1 link
2024-02-07 Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models Chirag Agarwal et.al. 2402.04614v1 null
2024-02-07 RA-Rec: An Efficient ID Representation Alignment Framework for LLM-based Recommendation Xiaohan Yu et.al. 2402.04527v1 null
2024-02-07 Developments in Sheaf-Theoretic Models of Natural Language Ambiguities Kin Ian Lo et.al. 2402.04505v1 null
2024-02-06 Adaptive Inference: Theoretical Limits and Unexplored Opportunities Soheil Hor et.al. 2402.04359v1 null
2024-02-06 LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text Dor Bernsohn et.al. 2402.04335v1 link
2024-02-06 Explaining Autonomy: Enhancing Human-Robot Interaction through Explanation Generation with Large Language Models David Sobrín-Hidalgo et.al. 2402.04206v1 null
2024-02-06 Scientific Language Modeling: A Quantitative Review of Large Language Models in Molecular Science Pengfei Liu et.al. 2402.04119v1 link
2024-02-06 The Use of a Large Language Model for Cyberbullying Detection Bayode Ogunleye et.al. 2402.04088v1 null
2024-02-06 Systematic Biases in LLM Simulations of Debates Amir Taubenfeld et.al. 2402.04049v1 null
2024-02-06 AlbNews: A Corpus of Headlines for Topic Modeling in Albanian Erion Çano et.al. 2402.04028v1 link
2024-02-06 Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs Simone Balloccu et.al. 2402.03927v1 null
2024-02-06 Intensive Vision-guided Network for Radiology Report Generation Fudan Zheng et.al. 2402.03754v1 null
2024-02-06 Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies Zhixuan Chu et.al. 2402.03628v1 null
2024-02-06 Partially Recentralization Softmax Loss for Vision-Language Models Robustness Hao Wang et.al. 2402.03627v1 null
2024-02-05 Is Mamba Capable of In-Context Learning? Riccardo Grazzi et.al. 2402.03170v1 link
2024-02-05 EEVEE: An Easy Annotation Tool for Natural Language Processing Axel Sorensen et.al. 2402.02864v1 null
2024-02-05 Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate Can Jin et.al. 2402.02769v1 link
2024-02-04 It's how you do things that matters": Attending to Process to Better Serve Indigenous Communities with Language Technologies Ned Cooper et.al. 2402.02639v1 null
2024-02-04 Predicting Machine Translation Performance on Low-Resource Languages: The Role of Domain Similarity Eric Khiu et.al. 2402.02633v1 null
2024-02-04 DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging Matteo Pagliardini et.al. 2402.02622v1 null
2024-02-04 ClipFormer: Key-Value Clipping of Transformers on Memristive Crossbars for Write Noise Mitigation Abhiroop Bhattacharjee et.al. 2402.02586v1 null
2024-02-04 A Quantitative Discourse Analysis of Asian Workers in the US Historical Newspapers Jaihyun Park et.al. 2402.02572v1 null
2024-02-04 Integration of cognitive tasks into artificial general intelligence test for large models Youzhi Qu et.al. 2402.02547v1 null
2024-02-04 Absolute convergence and error thresholds in non-active adaptive sampling Manuel Vilares Ferro et.al. 2402.02522v1 null
2024-02-02 Code-Switched Language Identification is Harder Than You Think Laurie Burchell et.al. 2402.01505v1 link
2024-02-02 From Words to Molecules: A Survey of Large Language Models in Chemistry Chang Liao et.al. 2402.01439v1 null
2024-02-02 Beyond the Answers: Reviewing the Rationality of Multiple Choice Question Answering for the Evaluation of Large Language Models Haochun Wang et.al. 2402.01349v1 null
2024-02-02 Efficient Prompt Caching via Embedding Similarity Hanlin Zhu et.al. 2402.01173v1 null
2024-02-02 A Survey for Foundation Models in Autonomous Driving Haoxiang Gao et.al. 2402.01105v1 null
2024-02-01 Domain-Independent Deception: A New Taxonomy and Linguistic Analysis Rakesh M. Verma et.al. 2402.01019v1 null
2024-02-01 HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent Weijie Xu et.al. 2402.01018v1 link
2024-02-01 An Information-Theoretic Approach to Analyze NLP Classification Tasks Luran Wang et.al. 2402.00978v1 link
2024-02-01 SPARQL Generation with Entity Pre-trained GPT for KG Question Answering Diego Bustamante et.al. 2402.00969v1 link
2024-02-01 Can Large Language Models Understand Context? Yilun Zhu et.al. 2402.00858v1 null
2024-02-01 ReAGent: Towards A Model-agnostic Feature Attribution Method for Generative Language Models Zhixue Zhao et.al. 2402.00794v1 link
2024-02-01 Neural Policy Style Transfer Raul Fernandez-Fernandez et.al. 2402.00677v1 null
2024-02-01 SA-MDKIF: A Scalable and Adaptable Medical Domain Knowledge Injection Framework for Large Language Models Tianhan Xu et.al. 2402.00474v1 null
2024-01-31 Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research Luca Soldaini et.al. 2402.00159v1 link
2024-01-31 Entity Linking in the Job Market Domain Mike Zhang et.al. 2401.17979v1 link
2024-01-31 SNNLP: Energy-Efficient Natural Language Processing Using Spiking Neural Networks R. Alexander Knipper et.al. 2401.17911v1 link
2024-01-31 Employing Label Models on ChatGPT Answers Improves Legal Text Entailment Performance Chau Nguyen et.al. 2401.17897v1 null
2024-01-31 Document Structure in Long Document Transformers Jan Buchmann et.al. 2401.17658v1 null
2024-01-31 Assertion Detection Large Language Model In-context Learning LoRA Fine-tuning Yuelyu Ji et.al. 2401.17602v1 link
2024-01-31 Scavenging Hyena: Distilling Transformers into Long Convolution Models Tokiniaina Raharison Ralambomihanta et.al. 2401.17574v1 null
2024-01-31 Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels Negar Arabzadeh et.al. 2401.17543v1 null
2024-01-30 Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks Savas Yildirim et.al. 2401.17396v1 null
2024-01-30 Gazetteer-Enhanced Bangla Named Entity Recognition with BanglaBERT Semantic Embeddings K-Means-Infused CRF Model Niloy Farhan et.al. 2401.17206v1 link
2024-01-30 Large Language Model Evaluation via Matrix Entropy Lai Wei et.al. 2401.17139v1 link
2024-01-30 SAL-PIM: A Subarray-level Processing-in-Memory Architecture with LUT-based Linear Interpolation for Transformer-based Text Generation Wontak Han et.al. 2401.17005v1 null
2024-01-30 SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics Takaaki Saeki et.al. 2401.16812v1 link
2024-01-30 Engineering A Large Language Model From Scratch Abiodun Finbarrs Oketunji et.al. 2401.16736v1 null
2024-01-30 TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese Nicholas Kluge Corrêa et.al. 2401.16640v1 link
2024-01-30 Breaking Free Transformer Models: Task-specific Context Attribution Promises Improved Generalizability Without Fine-tuning Pre-trained LLMs Stepan Tytarenko et.al. 2401.16638v1 link
2024-01-29 Dynamic Electro-Optic Analog Memory for Neuromorphic Photonic Computing Sean Lam et.al. 2401.16515v1 null
2024-01-29 ViLexNorm: A Lexical Normalization Corpus for Vietnamese Social Media Text Thanh-Nhi Nguyen et.al. 2401.16403v1 link
2024-01-29 CO2: Efficient Distributed Training with Full Communication-Computation Overlap Weigao Sun et.al. 2401.16265v1 link
2024-01-29 Towards Red Teaming in Multimodal and Multilingual Translation Christophe Ropers et.al. 2401.16247v1 null
2024-01-29 A Survey on Structure-Preserving Graph Transformers Van Thuy Hoang et.al. 2401.16176v1 null
2024-01-29 E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models Jinchang Hou et.al. 2401.15927v1 link
2024-01-29 Unrestricted Error-Type Codebook Generation for Error Correction Code in DNA Storage Inspired by NLP Yi Lu et.al. 2401.15915v1 link
2024-01-29 DrBERT: Unveiling the Potential of Masked Language Modeling Decoder in BERT pretraining Wen Liang et.al. 2401.15861v1 null
2024-01-28 Fine-Tuned Large Language Models for Symptom Recognition from Spanish Clinical Text Mai A. Shaaban et.al. 2401.15780v1 null
2024-01-27 FloodLense: A Framework for ChatGPT-based Real-time Flood Detection Pranath Reddy Kumbam et.al. 2401.15501v1 null
2024-01-27 A Survey on Data Augmentation in Large Model Era Yue Zhou et.al. 2401.15422v1 link
2024-01-26 SliceGPT: Compress Large Language Models by Deleting Rows and Columns Saleh Ashkboos et.al. 2401.15024v1 link
2024-01-26 Memory-Inspired Temporal Prompt Interaction for Text-Image Classification Xinyao Yu et.al. 2401.14856v1 null
2024-01-26 Adaptive Point Transformer Alessandro Baiocchi et.al. 2401.14845v1 null
2024-01-26 ChemDFM: Dialogue Foundation Model for Chemistry Zihan Zhao et.al. 2401.14818v1 null
2024-01-26 Large Language Model Adaptation for Financial Sentiment Analysis Pau Rodriguez Inserte et.al. 2401.14777v1 null
2024-01-26 Topology-Aware Exploration of Energy-Based Models Equilibrium: Toric QC-LDPC Codes and Hyperbolic MET QC-LDPC Codes Vasiliy Usatyuk et.al. 2401.14749v1 null
2024-01-26 Listening to the Voices: Describing Ethical Caveats of Conversational User Interfaces According to Experts and Frequent Users Thomas Mildner et.al. 2401.14746v1 null
2024-01-26 An Empirical Investigation of Domain Adaptation Ability for Chinese Spelling Check Models Xi Wang et.al. 2401.14630v1 null
2024-01-25 TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation Gökçe Uludoğan et.al. 2401.14373v1 link
2024-01-25 Topologies of Reasoning: Demystifying Chains, Trees, and Graphs of Thoughts Maciej Besta et.al. 2401.14295v1 null
2024-01-25 Improving Natural Language Capability of Code Large Language Model Wei Li et.al. 2401.14242v1 link
2024-01-25 Parameter-Efficient Conversational Recommender System as a Language Processing Task Mathieu Ravaut et.al. 2401.14194v1 link
2024-01-25 How Can Large Language Models Understand Spatial-Temporal Data? Lei Liu et.al. 2401.14192v1 null
2024-01-25 Convolutional Neural Networks can achieve binary bail judgement classification Amit Barman et.al. 2401.14135v1 null
2024-01-25 (Chat)GPT v BERT: Dawn of Justice for Semantic Change Detection Francesco Periti et.al. 2401.14040v1 link
2024-01-25 Accelerating Retrieval-Augmented Language Model Serving with Speculation Zhihao Zhang et.al. 2401.14021v1 null
2024-01-25 ChatGPT and Human Synergy in Black-Box Testing: A Comparative Analysis Hiroyuki Kirinuki et.al. 2401.13924v1 null
2024-01-24 Investigating the Efficacy of Large Language Models for Code Clone Detection Mohamad Khajezade et.al. 2401.13802v1 link
2024-01-24 CNN architecture extraction on edge GPU Peter Horvath et.al. 2401.13575v1 null
2024-01-24 SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation Zhaohu Xing et.al. 2401.13560v1 link
2024-01-24 Research about the Ability of LLM in the Tamper-Detection Area Xinyu Yang et.al. 2401.13504v1 null
2024-01-24 Text Categorization Can Enhance Domain-Agnostic Stopword Extraction Houcemeddine Turki et.al. 2401.13398v1 null
2024-01-24 MaLA-500: Massive Language Adaptation of Large Language Models Peiqin Lin et.al. 2401.13303v1 null
2024-01-24 SpecLLM: Exploring Generation and Review of VLSI Design Specification with Large Language Model Mengming Li et.al. 2401.13266v1 link
2024-01-24 From Random to Informed Data Selection: A Diversity-Based Approach to Optimize Human Annotation and Few-Shot Learning Alexandre Alcoforado et.al. 2401.13229v1 null
2024-01-23 Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains Yu Zhang et.al. 2401.13129v1 link
2024-01-23 Free Form Medical Visual Question Answering in Radiology Abhishek Narayanan et.al. 2401.13081v1 null
2024-01-23 From Understanding to Utilization: A Survey on Explainability for Large Language Models Haoyan Luo et.al. 2401.12874v1 null
2024-01-23 KAM-CoT: Knowledge Augmented Multimodal Chain-of-Thoughts Reasoning Debjyoti Mondal et.al. 2401.12863v1 null
2024-01-23 Benchmarking LLMs via Uncertainty Quantification Fanghua Ye et.al. 2401.12794v1 link
2024-01-23 A Comprehensive View of the Biases of Toxicity and Sentiment Analysis Methods Towards Utterances with African American English Expressions Guilherme H. Resende et.al. 2401.12720v1 null
2024-01-23 From Numbers to Words: Multi-Modal Bankruptcy Prediction Using the ECL Dataset Henri Arno et.al. 2401.12652v1 link
2024-01-23 Key Information Retrieval to Classify the Unstructured Data Content of Preferential Trade Agreements Jiahui Zhao et.al. 2401.12520v1 null
2024-01-23 Digital cloning of online social networks for language-sensitive agent-based modeling of misinformation spread Prateek Puri et.al. 2401.12509v1 null
2024-01-23 Comparing Human-Centered Language Modeling: Is it Better to Model Groups, Individual Traits, or Both? Nikita Soni et.al. 2401.12492v1 null
2024-01-23 Assessing and Understanding Creativity in Large Language Models Yunpu Zhao et.al. 2401.12491v1 null
2024-01-23 Contrastive Learning in Distilled Models Valerie Lim et.al. 2401.12472v1 link
2024-01-22 Temporal Blind Spots in Large Language Models Jonas Wallat et.al. 2401.12078v1 link
2024-01-22 NLP-based Relation Extraction Methods in RE Quim Motger et.al. 2401.12075v1 null
2024-01-22 Cross-lingual Transfer Learning for Javanese Dependency Parsing Fadli Aulawi Al Ghiffari et.al. 2401.12072v1 null
2024-01-22 Synergizing Machine Learning & Symbolic Methods: A Survey on Hybrid Approaches to Natural Language Processing Rrubaa Panchendrarajan et.al. 2401.11972v1 null
2024-01-22 Knowledge Navigation: Inferring the Interlocking Map of Knowledge from Research Trajectories Shibing Xiang et.al. 2401.11742v1 link
2024-01-22 Revolutionizing Finance with LLMs: An Overview of Applications and Insights Huaqin Zhao et.al. 2401.11641v1 null
2024-01-21 Simple Domain Adaptation for Sparse Retrievers Mathias Vast et.al. 2401.11509v1 null
2024-01-21 Integration of Large Language Models in Control of EHD Pumps for Precise Color Synthesis Yanhong Peng et.al. 2401.11500v1 null
2024-01-21 Towards Better Inclusivity: A Diverse Tweet Corpus of English Varieties Nhi Pham et.al. 2401.11487v1 link
2024-01-21 AttentionLego: An Open-Source Building Block For Spatially-Scalable Large Language Model Accelerator With Processing-In-Memory Technology Rongqing Cong et.al. 2401.11459v1 null
2024-01-19 Advancements in eHealth Data Analytics through Natural Language Processing and Deep Learning Elena-Simona Apostol et.al. 2401.10850v1 null
2024-01-19 Data Augmentation for Traffic Classification Chao Wang et.al. 2401.10754v1 null
2024-01-19 Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models Mayank Agarwal et.al. 2401.10716v1 null
2024-01-19 Attentive Fusion: A Transformer-based Approach to Multimodal Hate Speech Detection Atanu Mandal et.al. 2401.10653v1 link
2024-01-19 The "Colonial Impulse" of Natural Language Processing: An Audit of Bengali Sentiment Analysis Tools and Their Identity-based Biases Dipto Das et.al. 2401.10535v1 null
2024-01-18 Learning High-Quality and General-Purpose Phrase Representations Lihu Chen et.al. 2401.10407v1 link
2024-01-18 Supervised Fine-tuning in turn Improves Visual Foundation Models Xiaohu Jiang et.al. 2401.10222v1 link
2024-01-18 Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap Xingyu Wu et.al. 2401.10034v1 null
2024-01-18 Framing Analysis of Health-Related Narratives: Conspiracy versus Mainstream Media Markus Reiter-Haas et.al. 2401.10030v1 null
2024-01-19 Better Explain Transformers by Illuminating Important Information Linxin Song et.al. 2401.09972v2 link
2024-01-18 A Survey on Hardware Accelerators for Large Language Models Christoforos Kachris et.al. 2401.09890v1 link
2024-01-18 Decades of Transformation: Evolution of the NASA Astrophysics Data System's Infrastructure Alberto Accomazzi et.al. 2401.09685v1 null
2024-01-17 Learning Shortcuts: On the Misleading Promise of NLU in Language Models Geetanjali Bihani et.al. 2401.09615v1 null
2024-01-17 BERTologyNavigator: Advanced Question Answering with BERT-based Semantics Shreya Rajpal et.al. 2401.09553v1 null
2024-01-17 Learning from Emotions, Demographic Information and Implicit User Feedback in Task-Oriented Document-Grounded Dialogues Dominic Petrak et.al. 2401.09248v1 link
2024-01-17 Dynamic Relation Transformer for Contextual Text Block Detection Jiawei Wang et.al. 2401.09232v1 null
2024-01-17 Narratives of Collective Action in YouTube's Discourse on Veganism Arianna Pera et.al. 2401.09210v1 link
2024-01-17 LLMs for Relational Reasoning: How Far are We? Zhiming Li et.al. 2401.09042v1 null
2024-01-16 EmoLLMs: A Series of Emotional Large Language Models and Annotation Tools for Comprehensive Affective Analysis Zhiwei Liu et.al. 2401.08508v1 link
2024-01-16 Content-Aware Tweet Location Inference using Quadtree Spatial Partitioning and Jaccard-Cosine Word Embedding Oluwaseun Ajao et.al. 2401.08506v1 null
2024-01-16 Machine Translation with Large Language Models: Prompt Engineering for Persian, English, and Russian Directions Nooshin Pourkamali et.al. 2401.08429v1 null
2024-01-16 Cross-lingual neural fuzzy matching for exploiting target-language monolingual corpora in computer-aided translation Miquel Esplà-Gomis et.al. 2401.08374v1 link
2024-01-16 Application of LLM Agents in Recruitment: A Novel Framework for Resume Screening Chengguang Gan et.al. 2401.08315v1 null
2024-01-15 The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey Saurav Pawar et.al. 2401.07872v1 null
2024-01-15 Quantum Transfer Learning for Acceptability Judgements Giuseppe Buonaiuto et.al. 2401.07777v1 null
2024-01-15 On the importance of Data Scale in Pretraining Arabic Language Models Abbas Ghaddar et.al. 2401.07760v1 link
2024-01-15 Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends Yunshi Lan et.al. 2401.07518v1 link
2024-01-15 Developing ChatGPT for Biology and Medicine: A Complete Review of Biomedical Question Answering Qing Li et.al. 2401.07510v1 null
2024-01-15 Graph database while computationally efficient filters out quickly the ESG integrated equities in investment management Partha Sen et.al. 2401.07483v1 null
2024-01-15 GWPT: A Green Word-Embedding-based POS Tagger Chengwei Wei et.al. 2401.07475v1 null
2024-01-15 Leveraging the power of transformers for guilt detection in text Abdul Gafar Manuel Meque et.al. 2401.07414v1 null
2024-01-12 Stylometry Analysis of Multi-authored Documents for Authorship and Author Style Change Detection Muhammad Tayyab Zamir et.al. 2401.06752v1 null
2024-01-12 Reframing Tax Law Entailment as Analogical Reasoning Xinrui Zou et.al. 2401.06715v1 null
2024-01-12 Cyborgs for strategic communication on social media Lynnette Hui Xian Ng et.al. 2401.06582v1 null
2024-01-12 INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning Yutao Zhu et.al. 2401.06532v1 link
2024-01-12 An investigation of structures responsible for gender bias in BERT and DistilBERT Thibaud Leteno et.al. 2401.06495v1 null
2024-01-12 Adapting Large Language Models for Document-Level Machine Translation Minghao Wu et.al. 2401.06468v1 null
2024-01-12 SamLP: A Customized Segment Anything Model for License Plate Detection Haoxuan Ding et.al. 2401.06374v1 link
2024-01-12 MuGI: Enhancing Information Retrieval through Multi-Text Generation Intergration with Large Language Models Le Zhang et.al. 2401.06311v1 link
2024-01-11 Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings Hiroaki Yamagiwa et.al. 2401.06112v1 link
2024-01-11 How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes Sabina Elkins et.al. 2401.05914v1 null
2024-01-11 Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems Tianyu Cui et.al. 2401.05778v1 null
2024-01-11 ConcEPT: Concept-Enhanced Pre-Training for Language Models Xintao Wang et.al. 2401.05669v1 null
2024-01-11 Natural Language Processing for Dialects of a Language: A Survey Aditya Joshi et.al. 2401.05632v1 null
2024-01-10 TrustLLM: Trustworthiness in Large Language Models Lichao Sun et.al. 2401.05561v1 link
2024-01-10 CADgpt: Harnessing Natural Language Processing for 3D Modelling to Enhance Computer-Aided Design Workflows Timo Kapsalis et.al. 2401.05476v1 null
2024-01-10 MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector Marta R. Costa-jussà et.al. 2401.05060v1 link
2024-01-09 Entity Recognition from Colloquial Text Tamara Babaian et.al. 2401.04853v1 null
2024-01-09 MoSECroT: Model Stitching with Static Word Embeddings for Crosslingual Zero-shot Transfer Haotian Ye et.al. 2401.04821v1 null
2024-01-09 Phishing Website Detection through Multi-Model Analysis of HTML Content Furkan Çolhak et.al. 2401.04820v1 null
2024-01-10 Low-Resource Vision Challenges for Foundation Models Yunhua Zhang et.al. 2401.04716v2 null
2024-01-09 TechGPT-2.0: A large language model project to solve the task of knowledge graph construction Jiaqi Wang et.al. 2401.04507v1 link
2024-01-09 LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training Khoi M. Le et.al. 2401.04348v1 link
2024-01-09 Know Your Needs Better: Towards Structured Understanding of Marketer Demands with Analogical Reasoning Augmented LLMs Junjie Wang et.al. 2401.04319v1 link
2024-01-08 Attention versus Contrastive Learning of Tabular Data -- A Data-centric Benchmarking Shourav B. Rabbani et.al. 2401.04266v1 null
2024-01-08 Large language models in bioinformatics: applications and perspectives Jiajia Liu et.al. 2401.04155v1 null
2024-01-08 Empirical Analysis of Efficient Fine-Tuning Methods for Large Pre-Trained Language Models Nigel Doering et.al. 2401.04051v1 null
2024-01-08 IDoFew: Intermediate Training Using Dual-Clustering in Language Models for Few Labels Text Classification Abdullah Alsuhaibani et.al. 2401.04025v1 null
2024-01-08 Aligned with LLM: a new multi-modal training paradigm for encoding fMRI activity in visual cortex Shuxiao Ma et.al. 2401.03851v1 null
2024-01-08 We Need to Talk About Classification Evaluation Metrics in NLP Peter Vickers et.al. 2401.03831v1 null
2024-01-08 Anatomy of Neural Language Models Majd Saleh et.al. 2401.03797v1 link
2024-01-08 Overview of the 2023 ICON Shared Task on Gendered Abuse Detection in Indic Languages Aatman Vaidya et.al. 2401.03677v1 null
2024-01-07 Is there really a Citation Age Bias in NLP? Hoa Nguyen et.al. 2401.03545v1 null
2024-01-07 Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions Yichi Zhang et.al. 2401.03495v1 link
2024-01-07 Maintaining Journalistic Integrity in the Digital Age: A Comprehensive NLP Framework for Evaluating Online News Content Ljubisa Bojic et.al. 2401.03467v1 null
2024-01-07 Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects Yuheng Cheng et.al. 2401.03428v1 link
2024-01-05 Introducing Bode: A Fine-Tuned Large Language Model for Portuguese Prompt-Based Task Gabriel Lino Garcia et.al. 2401.02909v1 null
2024-01-05 Nonlinear functional regression by functional deep neural network with kernel embedding Zhongjie Shi et.al. 2401.02890v1 null
2024-01-05 Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks Haoyuan Wu et.al. 2401.02731v1 link
2024-01-05 Beyond Fidelity: Explaining Vulnerability Localization of Learning-based Detectors Baijun Cheng et.al. 2401.02686v1 link
2024-01-05 Training and Serving System of Foundation Models: A Comprehensive Survey Jiahang Zhou et.al. 2401.02643v1 null
2024-01-04 L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages Aishwarya Mirashi et.al. 2401.02254v1 link
2024-01-04 SwitchTab: Switched Autoencoders Are Effective Tabular Learners Jing Wu et.al. 2401.02013v1 null
2024-01-03 Mining Temporal Attack Patterns from Cyberthreat Intelligence Reports Md Rayhanur Rahman et.al. 2401.01883v1 null
2024-01-03 Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling Himmet Toprak Kesgin et.al. 2401.01830v1 null
2024-01-03 Text mining arXiv: a look through quantitative finance papers Michele Leonardo Bianchi et.al. 2401.01751v1 null
2024-01-03 Predicting challenge moments from students' discourse: A comparison of GPT-4 to two traditional natural language processing approaches Wannapon Suraworachet et.al. 2401.01692v1 null
2024-01-04 Towards a Foundation Purchasing Model: Pretrained Generative Autoregression on Transaction Sequences Piotr Skalski et.al. 2401.01641v2 link
2024-01-03 Test-Time Personalization with Meta Prompt for Gaze Estimation Huan Liu et.al. 2401.01577v1 link
2024-01-03 Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models Rita Frieske et.al. 2401.01572v1 null
2024-01-03 LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training Rujiao Long et.al. 2401.01522v1 null
2024-01-03 Practical Guidelines for the Selection and Evaluation of NLP Techniques in RE Mehrdad Sabetzadeh et.al. 2401.01508v1 null
2024-01-03 Natural Language Processing and Multimodal Stock Price Prediction Kevin Taylor et.al. 2401.01487v1 null
2024-01-02 LLM Harmony: Multi-Agent Communication for Problem Solving Sumedh Rasal et.al. 2401.01312v1 link
2024-01-02 Fairness Certification for Natural Language Processing and Large Language Models Vincent Freiberger et.al. 2401.01262v1 null
2024-01-02 Imperio: Language-Guided Backdoor Attacks for Arbitrary Model Control Ka-Ho Chow et.al. 2401.01085v1 link
2024-01-02 Vietnamese Poem Generation & The Prospect Of Cross-Language Poem-To-Poem Translation Triet Huynh Minh et.al. 2401.01078v1 link
2024-01-02 Cheetah: Natural Language Generation for 517 African Languages Ife Adebara et.al. 2401.01053v1 null
2024-01-02 Safety and Performance, Why Not Both? Bi-Objective Optimized Model Compression against Heterogeneous Attacks Toward AI Software Deployment Jie Zhu et.al. 2401.00996v1 link
2024-01-01 Temporal Validity Change Prediction Georg Wenzel et.al. 2401.00779v1 null
2024-01-01 Large language model for Bible sentiment analysis: Sermon on the Mount Mahek Vora et.al. 2401.00689v1 link
2024-01-01 Predicting Anti-microbial Resistance using Large Language Models Hyunwoo Yoo et.al. 2401.00642v1 null
2023-12-31 Exploring the Effectiveness of Instruction Tuning in Biomedical Language Processing Omid Rohanian et.al. 2401.00579v1 null
2023-12-29 Action-Item-Driven Summarization of Long Meeting Transcripts Logan Golia et.al. 2312.17581v1 link
2023-12-29 Overview of the PromptCBLUE Shared Task in CHIP2023 Wei Zhu et.al. 2312.17522v1 link
2023-12-28 GitAgent: Facilitating Autonomous Agent with GitHub by Tool Extension Bohan Lyu et.al. 2312.17294v1 null
2023-12-29 Length Extrapolation of Transformers: A Survey from the Perspective of Position Encoding Liang Zhao et.al. 2312.17044v2 null
2023-12-28 Few-shot learning for automated content analysis: Efficient coding of arguments and claims in the debate on arms deliveries to Ukraine Jonas Rieger et.al. 2312.16975v1 null
2023-12-27 A proposed new metric for the conceptual diversity of a text İlknur Dönmez Phd et.al. 2312.16548v1 null
2023-12-26 Zur Darstellung eines mehrstufigen Prototypbegriffs in der multilingualen automatischen Sprachgenerierung: vom Korpus über word embeddings bis hin zum automatischen Wörterbuch María José Domínguez Vázquez et.al. 2312.16311v1 null
2023-12-26 Social-Transmotion: Promptable Human Trajectory Prediction Saeed Saadatnejad et.al. 2312.16168v1 link
2023-12-26 Dotless Representation of Arabic Text: Analysis and Modeling Maged S. Al-Shaibani et.al. 2312.16104v1 null
2023-12-26 FedMS: Federated Learning with Mixture of Sparsely Activated Foundations Models Panlong Wu et.al. 2312.15926v1 null
2023-12-26 Think and Retrieval: A Hypothesis Knowledge Graph Enhanced Medical Large Language Models Xinke Jiang et.al. 2312.15883v1 null
2023-12-26 Heterogeneous Encoders Scaling In The Transformer For Neural Machine Translation Jia Cheng Hu et.al. 2312.15872v1 null
2023-12-26 Punctuation Matters! Stealthy Backdoor Attack for Language Models Xuan Sheng et.al. 2312.15867v1 null
2023-12-26 Hypergraph Enhanced Knowledge Tree Prompt Learning for Next-Basket Recommendation Zi-Feng Mai et.al. 2312.15851v1 null
2023-12-25 Design and Implementation of a Tool for Extracting Uzbek Syllables Ulugbek Salaev et.al. 2312.15779v1 null
2023-12-25 Large Language Models are Not Stable Recommender Systems Tianhui Ma et.al. 2312.15746v1 null
2023-12-25 PersianLLaMA: Towards Building First Persian Large Language Model Mohammad Amin Abbasi et.al. 2312.15713v1 null
2023-12-22 YAYI 2: Multilingual Open-Source Large Language Models Yin Luo et.al. 2312.14862v1 null
2023-12-22 Large Language Model (LLM) Bias Index -- LLMBI Abiodun Finbarrs Oketunji et.al. 2312.14769v1 null
2023-12-22 Zero-shot Causal Graph Extrapolation from Text via LLMs Alessandro Antonucci et.al. 2312.14670v1 link
2023-12-22 Training Neural Networks with Internal State, Unconstrained Connectivity, and Discrete Activations Alexander Grushin et.al. 2312.14359v1 null
2023-12-21 Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs Juraj Vladika et.al. 2312.13881v1 null
2023-12-21 kNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels Jiaming Zhou et.al. 2312.13560v1 link
2023-12-21 Empowering Few-Shot Recommender Systems with Large Language Models -- Enhanced Representations Zhoumeng Wang et.al. 2312.13557v1 link
2023-12-20 AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation Dong Huang et.al. 2312.13010v1 link
2023-12-20 Segmenting Messy Text: Detecting Boundaries in Text Derived from Historical Newspaper Images Carol Anderson et.al. 2312.12773v1 null
2023-12-21 A Case Study on Test Case Construction with Large Language Models: Unveiling Practical Insights and Challenges Roberto Francisco de Lima Junior et.al. 2312.12598v2 null
2023-12-19 Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation Sihan Liu et.al. 2312.12470v1 link
2023-12-19 Geo-located Aspect Based Sentiment Analysis (ABSA) for Crowdsourced Evaluation of Urban Environments Demircan Tas et.al. 2312.12253v1 null
2023-12-19 Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models: A Critical Review and Assessment Lingling Xu et.al. 2312.12148v1 null
2023-12-19 Designing Guiding Principles for NLP for Healthcare: A Case Study of Maternal Health Maria Antoniak et.al. 2312.11803v1 link
2023-12-19 MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA Lang Yu et.al. 2312.11795v1 link
2023-12-19 MineObserver 2.0: A Deep Learning & In-Game Framework for Assessing Natural Language Descriptions of Minecraft Imagery Jay Mahajan et.al. 2312.11761v1 null
2023-12-18 A Heterogeneous Chiplet Architecture for Accelerating End-to-End Transformer Models Harsh Sharma et.al. 2312.11750v1 null
2023-12-18 Agent-based Learning of Materials Datasets from Scientific Literature Mehrad Ansari et.al. 2312.11690v1 link
2023-12-18 From Generalized Laughter to Personalized Chuckles: Unleashing the Power of Data Fusion in Subjective Humor Detection Julita Bielaniewicz et.al. 2312.11296v1 null
2023-12-18 Structure-Preserving Transformers for Learning Parametrized Hamiltonian Systems Benedikt Brantner et.al. 2312.11166v1 link
2023-12-18 Efficiency-oriented approaches for self-supervised speech representation learning Luis Lugo et.al. 2312.11142v1 null
2023-12-17 Validation of Rigorous Requirements Specifications and Document Automation with the ITLingo RSL Language Andre Rodrigues et.al. 2312.10822v1 null
2023-12-17 What Makes Digital Support Effective? How Therapeutic Skills Affect Clinical Well-Being Anna Fang et.al. 2312.10775v1 null
2023-12-17 Identification of Knowledge Neurons in Protein Language Models Divya Nori et.al. 2312.10770v1 null
2023-12-17 Can persistent homology whiten Transformer-based black-box models? A case study on BERT compression Luis Balderas et.al. 2312.10702v1 null
2023-12-17 Cross-Domain Robustness of Transformer-based Keyphrase Generation Anna Glazkova et.al. 2312.10700v1 null
2023-12-17 Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-hoc Retrieval Weihang Su et.al. 2312.10661v1 link
2023-12-17 Decoding Concerns: Multi-label Classification of Vaccine Sentiments in Social Media Somsubhra De et.al. 2312.10626v1 link
2023-12-16 CoCoGen: Physically-Consistent and Conditioned Score-based Generative Models for Forward and Inverse Problems Christian Jacobsen et.al. 2312.10527v1 null
2023-12-16 Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning Kaiyou Song et.al. 2312.10457v1 link
2023-12-16 An Attentive Inductive Bias for Sequential Recommendation Beyond the Self-Attention Yehjin Shin et.al. 2312.10325v1 link
2023-12-15 Faithful Persona-based Conversational Dataset Generation with Large Language Models Pegah Jandaghi et.al. 2312.10007v1 link
2023-12-15 LLaMAntino: LLaMA 2 Models for Effective Text Generation in Italian Language Pierpaolo Basile et.al. 2312.09993v1 null
2023-12-15 A Novel Dataset for Financial Education Text Simplification in Spanish Nelson Perez-Rojas et.al. 2312.09897v1 null
2023-12-15 Deep Unsupervised Domain Adaptation for Time Series Classification: a Benchmark Hassan Ismail Fawaz et.al. 2312.09857v1 link
2023-12-15 Algorithms for automatic intents extraction and utterances classification for goal-oriented dialogue systems Leonid Legashev et.al. 2312.09658v1 null
2023-12-15 Picking the Underused Heads: A Network Pruning Perspective of Attention Head Selection for Fusing Dialogue Coreference Information Zhengyuan Liu et.al. 2312.09541v1 null
2023-12-15 Riveter: Measuring Power and Social Dynamics Between Entities Maria Antoniak et.al. 2312.09536v1 link
2023-12-14 Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision Collin Burns et.al. 2312.09390v1 null
2023-12-13 N-Gram Unsupervised Compoundation and Feature Injection for Better Symbolic Music Understanding Jinhao Tian et.al. 2312.08931v1 link
2023-12-15 Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis Yafei Hu et.al. 2312.08782v2 null
2023-12-15 VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding Yi Xin et.al. 2312.08733v2 null
2023-12-14 A Comparative Analysis of Fine-Tuned LLMs and Few-Shot Learning of LLMs for Financial Sentiment Analysis Sorouralsadat Fatemi et.al. 2312.08725v1 null
2023-12-14 ChatSOS: LLM-based knowledge Q&A system for safety engineering Haiyang Tang et.al. 2312.08629v1 null
2023-12-13 A Survey of Generative AI for Intelligent Transportation Systems Huan Yan et.al. 2312.08248v1 null
2023-12-13 LAMM: Label Alignment for Multi-Modal Prompt Learning Jingsheng Gao et.al. 2312.08212v1 link
2023-12-13 Towards Model-Based Data Acquisition for Subjective Multi-Task NLP Problems Kamil Kanclerz et.al. 2312.08198v1 link
2023-12-13 CIDR: A Cooperative Integrated Dynamic Refining Method for Minimal Feature Removal Problem Qian Chen et.al. 2312.08157v1 link
2023-12-13 Efficient Representation of the Activation Space in Deep Neural Networks Tanya Akumu et.al. 2312.08143v1 null
2023-12-13 Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models Junhao Zheng et.al. 2312.07887v1 link
2023-12-13 Abusive Span Detection for Vietnamese Narrative Texts Nhu-Thanh Nguyen et.al. 2312.07831v1 null
2023-12-13 A Deep Learning-Based System for Automatic Case Summarization Minh Duong et.al. 2312.07824v1 null
2023-12-12 Estimation of embedding vectors in high dimensions Golara Ahmadi Azar et.al. 2312.07802v1 null
2023-12-12 Sentiment analysis in Tourism: Fine-tuning BERT or sentence embeddings concatenation? Ibrahim Bouabdallaoui et.al. 2312.07797v1 null
2023-12-12 MS-Twins: Multi-Scale Deep Self-Attention Networks for Medical Image Segmentation Jing Xu et.al. 2312.07128v1 null
2023-12-12 Towards Enhanced Human Activity Recognition through Natural Language Generation and Pose Estimation Nikhil Kashyap et.al. 2312.06965v1 null
2023-12-11 Self-supervised Machine Learning Based Approach to Orbit Modelling Applied to Space Traffic Management Emma Stevenson et.al. 2312.06854v1 null
2023-12-11 TaCo: Targeted Concept Removal in Output Embeddings for NLP via Information Theory and Explainability Fanny Jourdan et.al. 2312.06499v1 link
2023-12-11 Survey on Memory-Augmented Neural Networks: Cognitive Insights to AI Applications Savya Khosla et.al. 2312.06141v1 null
2023-12-11 Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need Cheng Peng et.al. 2312.06099v1 null
2023-12-11 SECNN: Squeeze-and-Excitation Convolutional Neural Network for Sentence Classification Shandong Yuan et.al. 2312.06088v1 null
2023-12-11 IEKG: A Commonsense Knowledge Graph for Idiomatic Expressions Ziheng Zeng et.al. 2312.06053v1 link
2023-12-10 Modeling Uncertainty in Personalized Emotion Prediction with Normalizing Flows Piotr Miłkowski et.al. 2312.06034v1 link
2023-12-10 Large Language Models on Lexical Semantic Change Detection: An Evaluation Ruiyu Wang et.al. 2312.06002v1 null
2023-12-10 Natural Interaction Modalities for Human-CPS Interaction in Construction Progress Monitoring Srijeet Halder et.al. 2312.05988v1 null
2023-12-10 FP8-BERT: Post-Training Quantization for Transformer Jianwei Li et.al. 2312.05725v1 null
2023-12-09 NLLG Quarterly arXiv Report 09/23: What are the most influential current AI Papers? Ran Zhang et.al. 2312.05688v1 link
2023-12-08 HALO: An Ontology for Representing Hallucinations in Generative Models Navapat Nananukul et.al. 2312.05209v1 null
2023-12-08 Converting Epics/Stories into Pseudocode using Transformers Gaurav Kolhatkar et.al. 2312.05047v1 null
2023-12-08 Illicit Darkweb Classification via Natural-language Processing: Classifying Illicit Content of Webpages based on Textual Information Giuseppe Cascavilla et.al. 2312.04944v1 null
2023-12-08 Ophtha-LLaMA2: A Large Language Model for Ophthalmology Huan Zhao et.al. 2312.04906v1 null
2023-12-08 How to Determine the Most Powerful Pre-trained Language Model without Brute Force Fine-tuning? An Empirical Survey Jun Bai et.al. 2312.04775v1 link
2023-12-07 The Impact of AI Innovations on U.S. Occupations Ali Akbar Septiandri et.al. 2312.04714v1 null
2023-12-07 Simul-LLM: A Framework for Exploring High-Quality Simultaneous Translation with Large Language Models Victor Agostinelli et.al. 2312.04691v1 link
2023-12-07 PyThaiNLP: Thai Natural Language Processing in Python Wannaphong Phatthiyaphaibun et.al. 2312.04649v1 link
2023-12-07 Leveraging Transformer-based Language Models to Automate Requirements Satisfaction Assessment Amrit Poudel et.al. 2312.04463v1 null
2023-12-07 CLadder: A Benchmark to Assess Causal Reasoning Capabilities of Language Models Zhijing Jin et.al. 2312.04350v1 link
2023-12-07 Beyond Surface: Probing LLaMA Across Scales and Layers Nuo Chen et.al. 2312.04333v1 link
2023-12-07 nerblackbox: A High-level Library for Named Entity Recognition in Python Felix Stollenwerk et.al. 2312.04306v1 link
2023-12-07 Graph Convolutions Enrich the Self-Attention in Transformers! Jeongwhan Choi et.al. 2312.04234v1 null
2023-12-07 CODEX: A Cluster-Based Method for Explainable Reinforcement Learning Timothy K. Mathes et.al. 2312.04216v1 link
2023-12-07 Language Model Knowledge Distillation for Efficient Question Answering in Spanish Adrián Bazaga et.al. 2312.04193v1 link
2023-12-07 Series2Vec: Similarity-based Self-supervised Representation Learning for Time Series Classification Navid Mohammadi Foumani et.al. 2312.03998v1 link
2023-12-06 Collaboration or Corporate Capture? Quantifying NLP's Reliance on Industry Artifacts and Contributions Will Aitken et.al. 2312.03912v1 null
2023-12-07 Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Umberto Cappellazzo et.al. 2312.03694v2 link
2023-12-06 KhabarChin: Automatic Detection of Important News in the Persian Language Hamed Hematian Hemati et.al. 2312.03361v1 link
2023-12-06 Measuring Misogyny in Natural Language Generation: Preliminary Results from a Case Study on two Reddit Communities Aaron J. Snoswell et.al. 2312.03330v1 null
2023-12-06 Detecting Rumor Veracity with Only Textual Information by Double-Channel Structure Alex Kim et.al. 2312.03195v1 null
2023-12-06 Corporate Bankruptcy Prediction with Domain-Adapted BERT Alex Kim et.al. 2312.03194v1 null
2023-12-05 Inherent limitations of LLMs regarding spatial information He Yan et.al. 2312.03042v1 link
2023-12-05 Concept Drift Adaptation in Text Stream Mining Settings: A Comprehensive Review Cristiano Mesquita Garcia et.al. 2312.02901v1 null
2023-12-05 Large Language Models on Graphs: A Comprehensive Survey Bowen Jin et.al. 2312.02783v1 link
2023-12-05 Empathy and Distress Detection using Ensembles of Transformer Models Tanmay Chavan et.al. 2312.02578v1 null
2023-12-05 Towards More Unified In-context Visual Understanding Dianmo Sheng et.al. 2312.02520v1 null
2023-12-05 MKA: A Scalable Medical Knowledge Assisted Mechanism for Generative Models on Medical Conversation Tasks Ke Liang et.al. 2312.02496v1 link
2023-12-04 Measuring Distributional Shifts in Text: The Advantage of Language Model-Based Embeddings Gyandev Gupta et.al. 2312.02337v1 null
2023-12-04 Revisiting Topic-Guided Language Models Carolina Zheng et.al. 2312.02331v1 link
2023-12-04 LLMs Accelerate Annotation for Medical Information Extraction Akshay Goel et.al. 2312.02296v1 null
2023-12-04 TPPoet: Transformer-Based Persian Poem Generation using Minimal Data and Advanced Decoding Techniques Amir Panahandeh et.al. 2312.02125v1 null
2023-12-04 Wild-Tab: A Benchmark For Out-Of-Distribution Generalization In Tabular Regression Sergey Kolesnikov et.al. 2312.01792v1 null
2023-12-04 Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites Lei Wang et.al. 2312.01701v1 link
2023-12-04 AGD: an Auto-switchable Optimizer using Stepwise Gradient Difference for Preconditioning Matrix Yun Yue et.al. 2312.01658v1 link
2023-12-03 AI-Powered Arabic Crossword Puzzle Generation for Educational Applications Kamyar Zeinalipour et.al. 2312.01339v1 null
2023-12-03 NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian Peng Liu et.al. 2312.01314v1 null
2023-12-03 Multiscale Topology in Interactomic Network: From Transcriptome to Antiaddiction Drug Repurposing Hongyan Du et.al. 2312.01272v1 null
2023-12-02 Enabling Quantum Natural Language Processing for Hindi Language Naman Srivastava et.al. 2312.01221v1 null
2023-12-02 Understanding Opinions Towards Climate Change on Social Media Yashaswi Pupneja et.al. 2312.01217v1 null
2023-12-02 From Voices to Validity: Leveraging Large Language Models (LLMs) for Textual Analysis of Policy Stakeholder Interviews Alex Liu et.al. 2312.01202v1 null
2023-12-01 Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals Tam Nguyen et.al. 2312.00751v1 null
2023-12-01 Infrared Image Super-Resolution via GAN Yongsong Huang et.al. 2312.00689v1 null
2023-12-01 Towards Transparency in Coreference Resolution: A Quantum-Inspired Approach Hadi Wazni et.al. 2312.00688v1 link
2023-12-01 Contextualized word senses: from attention to compositionality Pablo Gamallo et.al. 2312.00680v1 null
2023-12-01 Nonparametric Variational Regularisation of Pretrained Transformers Fabio Fehr et.al. 2312.00662v1 null
2023-12-01 CoLLiE: Collaborative Training of Large Language Models in an Efficient Way Kai Lv et.al. 2312.00407v1 link
2023-11-30 Towards Unsupervised Representation Learning: Learning, Evaluating and Transferring Visual Representations Bonifaz Stuhr et.al. 2312.00101v1 link
2023-11-30 Introducing Rhetorical Parallelism Detection: A New Task with Datasets, Metrics, and Baselines Stephen Bothwell et.al. 2312.00100v1 link
2023-11-30 CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation Pei Ke et.al. 2311.18702v1 link
2023-11-30 ESG Accountability Made Easy: DocQA at Your Service Lokesh Mishra et.al. 2311.18481v1 null
2023-11-30 Lessons from Building CodeBuddy: A Contextualized AI Coding Assistant gustavo Pinto et.al. 2311.18450v1 null
2023-11-30 Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models Sungjoo Byun et.al. 2311.18215v1 null
2023-11-29 Uncertainty Guided Global Memory Improves Multi-Hop Question Answering Alsu Sagirova et.al. 2311.18151v1 link
2023-11-29 Mukhyansh: A Headline Generation Dataset for Indic Languages Lokesh Madasu et.al. 2311.17743v1 link
2023-11-29 AviationGPT: A Large Language Model for the Aviation Domain Liya Wang et.al. 2311.17686v1 null
2023-11-29 Introduction to Transformers: an NLP Perspective Tong Xiao et.al. 2311.17633v1 link
2023-11-29 Model Performance Prediction for Hyperparameter Optimization of Deep Learning Models Using High Performance Computing and Quantum Annealing Juan Pablo García Amboage et.al. 2311.17508v1 null
2023-11-30 Grounding Foundation Models through Federated Transfer Learning: A General Framework Yan Kang et.al. 2311.17431v2 null
2023-11-29 Improving the Robustness of Transformer-based Large Language Models with Dynamic Attention Lujia Shen et.al. 2311.17400v1 null
2023-11-29 Are Large Language Models Good Fact Checkers: A Preliminary Study Han Cao et.al. 2311.17355v1 null
2023-11-29 A natural language processing-based approach: mapping human perception by understanding deep semantic features in street view images Haoran Ma et.al. 2311.17354v1 null
2023-11-29 Elo Uncovered: Robustness and Best Practices in Language Model Evaluation Meriem Boubdir et.al. 2311.17295v1 null
2023-11-28 Quantifying the redundancy between prosody and text Lukas Wolf et.al. 2311.17233v1 link
2023-11-28 Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis Aman Yadav et.al. 2311.16965v1 null
2023-11-28 A Benchmark for Evaluating Machine Translation Metrics on Dialects Without Standard Orthography Noëmi Aepli et.al. 2311.16865v1 link
2023-11-28 The curse of language biases in remote sensing VQA: the role of spatial attributes, language diversity, and the need for clear evaluation Christel Chappuis et.al. 2311.16782v1 null
2023-11-28 RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement Longhui Zhang et.al. 2311.16720v1 link
2023-11-28 Large Language Models Meet Computer Vision: A Brief Survey Raby Hamadi et.al. 2311.16673v1 null
2023-11-28 MedGen: A Python Natural Language Processing Toolkit for Medical Text Processing Rui Yang et.al. 2311.16588v1 link
2023-11-28 Graph Prompt Learning: A Comprehensive Survey and Beyond Xiangguo Sun et.al. 2311.16534v1 link
2023-11-27 Leveraging deep active learning to identify low-resource mobility functioning information in public clinical notes Tuan-Dung Le et.al. 2311.15946v1 null
2023-11-27 PIPE : Parallelized Inference Through Post-Training Quantization Ensembling of Residual Expansions Edouard Yvinec et.al. 2311.15806v1 null
2023-11-27 Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs Simone Conia et.al. 2311.15781v1 link
2023-11-27 Knowledge Unlearning for LLMs: Tasks, Methods, and Challenges Nianwen Si et.al. 2311.15766v1 null
2023-11-27 Italian Crossword Generator: Enhancing Education through Interactive Word Puzzles Kamyar Zeinalipour et.al. 2311.15723v1 null
2023-11-27 Cerbero-7B: A Leap Forward in Language-Specific LLMs Through Enhanced Chat Corpus Generation and Evaluation Federico A. Galatolo et.al. 2311.15698v1 link
2023-11-27 RoboGPT: an intelligent agent of making embodied long-term decisions for daily instruction tasks Yaran Chen et.al. 2311.15649v1 null
2023-11-27 Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text Finbarrs Oketunji et.al. 2311.15565v1 null
2023-11-27 A Comparative and Experimental Study on Automatic Question Answering Systems and its Robustness against Word Jumbling Shashidhar Reddy Javaji et.al. 2311.15513v1 null
2023-11-26 Local Convergence of Approximate Newton Method for Two Layer Nonlinear Regression Zhihang Li et.al. 2311.15390v1 null
2023-11-24 GPT Struct Me: Probing GPT Models on Narrative Entity Extraction Hugo Sousa et.al. 2311.14583v1 link
2023-11-24 CMed-GPT: Prompt Tuning for Entity-Aware Chinese Medical Dialogue Generation Zhijie Qu et.al. 2311.14539v1 null
2023-11-24 Narratives from GPT-derived Networks of News, and a link to Financial Markets Dislocations Deborah Miori et.al. 2311.14419v1 null
2023-11-24 LLamol: A Dynamic Multi-Conditional Generative Transformer for De Novo Molecular Design Niklas Dobberstein et.al. 2311.14407v1 link
2023-11-24 Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs Shengyin Sun et.al. 2311.14324v1 null
2023-11-24 Cosine Similarity Knowledge Distillation for Individual Class Information Transfer Gyeongdo Ham et.al. 2311.14307v1 null
2023-11-23 Uncovering Gender Stereotypes in Video Game Character Designs: A Multi-Modal Analysis of Honor of Kings Bingqing Liu et.al. 2311.14226v1 null
2023-11-23 Towards Explainable Strategy Templates using NLP Transformers Pallavi Bagga et.al. 2311.14061v1 null
2023-11-23 Efficient Trigger Word Insertion Yueqi Zeng et.al. 2311.13957v1 null
2023-11-22 Comparison of pipeline, sequence-to-sequence, and GPT models for end-to-end relation extraction: experiments with the rare disease use-case Shashank Gupta et.al. 2311.13729v1 link
2023-11-22 A Survey of Serverless Machine Learning Model Inference Kamil Kojs et.al. 2311.13587v1 null
2023-11-22 Machine Translation to Control Formality Features in the Target Language Harshita Tyagi et.al. 2311.13475v1 null
2023-11-22 Confidant: Customizing Transformer-based LLMs via Collaborative Edge Training Yuhao Chen et.al. 2311.13381v1 null
2023-11-22 Combatting Human Trafficking in the Cyberspace: A Natural Language Processing-Based Methodology to Analyze the Language in Online Advertisements Alejandro Rodriguez Perez et.al. 2311.13118v1 null
2023-11-21 A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs Jiageng Zhong et.al. 2311.12893v1 null
2023-11-21 Alpha Zero for Physics: Application of Symbolic Regression with Alpha Zero to find the analytical methods in physics Yoshihiro Michishita et.al. 2311.12713v1 null
2023-11-21 MathGloss: Building mathematical glossaries from text Lucy Horowitz et.al. 2311.12649v1 link
2023-11-21 Classification of Tabular Data by Text Processing Keshav Ramani et.al. 2311.12521v1 null
2023-11-21 Extracting Definienda in Mathematical Scholarly Articles with Transformers Shufan Jiang et.al. 2311.12448v1 link
2023-11-21 A Survey on Large Language Models for Personalized and Explainable Recommendations Junyi Chen et.al. 2311.12338v1 null
2023-11-21 AcademicGPT: Empowering Academic Research Shufa Wei et.al. 2311.12315v1 null
2023-11-21 ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science Sai Munikoti et.al. 2311.12289v1 null
2023-11-21 Equipping Pretrained Unconditional Music Transformers with Instrument and Genre Controls Weihan Xu et.al. 2311.12257v1 null
2023-11-20 Applications of Large Scale Foundation Models for Autonomous Driving Yu Huang et.al. 2311.12144v1 null
2023-11-20 Generating Valid and Natural Adversarial Examples with Large Language Models Zimu Wang et.al. 2311.11861v1 null
2023-11-20 Web News Timeline Generation with Extended Task Prompting Sha Wang et.al. 2311.11652v1 null
2023-11-20 Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks Ling Luo et.al. 2311.11608v1 link
2023-11-20 Exploring Prompting Large Language Models as Explainable Metrics Ghazaleh Mahmoudi et.al. 2311.11552v1 link
2023-11-20 Which AI Technique Is Better to Classify Requirements? An Experiment with SVM, LSTM, and ChatGPT Abdelkarim El-Hajjami et.al. 2311.11547v1 null
2023-11-20 ADAPTER-RL: Adaptation of Any Agent using Reinforcement Learning Yizhao Jin et.al. 2311.11537v1 null
2023-11-19 Self-Distilled Representation Learning for Time Series Felix Pieper et.al. 2311.11335v1 null
2023-11-19 Portuguese FAQ for Financial Services Paulo Finardi et.al. 2311.11331v1 null
2023-11-19 Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters Yinghui Li et.al. 2311.11268v1 link
2023-11-18 Hate speech and hate crimes: a data-driven study of evolving discourse around marginalized groups Malvina Bozhidarova et.al. 2311.11163v1 link
2023-11-17 Detection of Offensive and Threatening Online Content in a Low Resource Language Fatima Muhammad Adam et.al. 2311.10541v1 null
2023-11-17 ReuseSense: With Great Reuse Comes Greater Efficiency; Effectively Employing Computation Reuse on General-Purpose CPUs Nitesh Narayana GS et.al. 2311.10487v1 null
2023-11-17 Sinhala-English Word Embedding Alignment: Introducing Datasets and Benchmark for a Low Resource Language Kasun Wickramasinghe et.al. 2311.10436v1 null
2023-11-17 Causal Graph in Language Model Rediscovers Cortical Hierarchy in Human Narrative Processing Zhengqi He et.al. 2311.10431v1 null
2023-11-16 The Analysis and Extraction of Structure from Organizational Charts Nikhil Manali et.al. 2311.10234v1 null
2023-11-16 Revolutionizing Customer Interactions: Insights and Challenges in Deploying ChatGPT and Generative Chatbots for FAQs Feriel Khennouche et.al. 2311.09976v1 null
2023-11-16 OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking Chia-Hsuan Lee et.al. 2311.09758v1 null
2023-11-16 Trustworthy Large Models in Vision: A Survey Ziyan Guo et.al. 2311.09680v1 null
2023-11-17 FunctionMarker: Watermarking Language Datasets via Knowledge Injection Shuai Li et.al. 2311.09535v2 null
2023-11-16 AMRFact: Enhancing Summarization Factuality Evaluation with AMR-driven Training Data Generation Haoyi Qiu et.al. 2311.09521v1 link
2023-11-16 Atoms as Words: A Novel Approach to Deciphering Material Properties using NLP-inspired Machine Learning on Crystallographic Information Files (CIFs) Lalit Yadav et.al. 2311.09508v1 null
2023-11-16 SegMix: A Simple Structure-Aware Data Augmentation Method Yuxin Pei et.al. 2311.09505v1 null
2023-11-16 Show Your Work with Confidence: Confidence Bands for Tuning Curves Nicholas Lourie et.al. 2311.09480v1 link
2023-11-15 Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science Sridevi Wagle et.al. 2311.09358v1 link
2023-11-15 Mind's Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models Weize Liu et.al. 2311.09214v1 link
2023-11-15 Exploring the Potential of Large Language Models in Computational Argumentation Guizhen Chen et.al. 2311.09022v1 link
2023-11-15 Large Language Models are legal but they are not: Making the case for a powerful LegalLLM Thanmay Jayakumar et.al. 2311.08890v1 null
2023-11-15 Thread of Thought Unraveling Chaotic Contexts Yucheng Zhou et.al. 2311.08734v1 null
2023-11-15 Enabling CMF Estimation in Data-Constrained Scenarios: A Semantic-Encoding Knowledge Mining Model Yanlin Qi et.al. 2311.08690v1 null
2023-11-16 MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration Lin Xu et.al. 2311.08562v2 link
2023-11-14 Natural Language Processing for Financial Regulation Ixandra Achitouv et.al. 2311.08533v1 null
2023-11-14 GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer Urchade Zaratiana et.al. 2311.08526v1 link
2023-11-14 Functionality learning through specification instructions Pedro Henrique Luz de Araujo et.al. 2311.08481v1 null
2023-11-14 A Material Lens on Coloniality in NLP William Held et.al. 2311.08391v1 null
2023-11-14 Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration Zhenran Xu et.al. 2311.08152v1 link
2023-11-14 How to get better embeddings with code pre-trained models? An empirical study Yu Zhao et.al. 2311.08066v1 null
2023-11-14 How Well Do Text Embedding Models Understand Syntax? Yan Zhang et.al. 2311.07996v1 link
2023-11-14 How good are Large Language Models on African Languages? Jessica Ojo et.al. 2311.07978v1 null
2023-11-14 Towards Improving Robustness Against Common Corruptions in Object Detectors Using Adversarial Contrastive Learning Shashank Kotyan et.al. 2311.07928v1 null
2023-11-13 GreekT5: A Series of Greek Sequence-to-Sequence Models for News Summarization Nikolaos Giarelis et.al. 2311.07767v1 link
2023-11-13 MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks Sanchit Ahuja et.al. 2311.07463v1 null
2023-11-13 The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4 Microsoft Research AI4Science et.al. 2311.07361v1 null
2023-11-13 calamanCy: A Tagalog Natural Language Processing Toolkit Lester James V. Miranda et.al. 2311.07171v1 link
2023-11-13 STEER: Unified Style Transfer with Expert Reinforcement Skyler Hallinan et.al. 2311.07167v1 link
2023-11-12 Simulating Public Administration Crisis: A Novel Generative Agent-Based Simulation System to Lower Technology Barriers in Social Science Research Bushi Xiao et.al. 2311.06957v1 null
2023-11-12 Retrieval and Generative Approaches for a Pregnancy Chatbot in Nepali with Stemmed and Non-Stemmed Data : A Comparative Study Sujan Poudel et.al. 2311.06898v1 null
2023-11-12 GIELLM: Japanese General Information Extraction Large Language Model Utilizing Mutual Reinforcement Effect Chengguang Gan et.al. 2311.06838v1 null
2023-11-12 Explainability of Vision Transformers: A Comprehensive Review and New Perspectives Rojina Kashefi et.al. 2311.06786v1 null
2023-11-12 Detecting and Correcting Hate Speech in Multimodal Memes with Large Visual Language Model Minh-Hao Van et.al. 2311.06737v1 null
2023-11-12 Simple and Effective Input Reformulations for Translation Brian Yu et.al. 2311.06696v1 link
2023-11-10 BanglaBait: Semi-Supervised Adversarial Approach for Clickbait Detection on Bangla Clickbait Dataset Md. Motahar Mahtab et.al. 2311.06204v1 link
2023-11-10 Is it indeed bigger better? The comprehensive study of claim detection LMs applied for disinformation tackling Martin Hyben et.al. 2311.06121v1 null
2023-11-10 ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences Yuanhe Tian et.al. 2311.06025v1 link
2023-11-10 Hiformer: Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems Huan Gui et.al. 2311.05884v1 null
2023-11-10 Exploring Fine-tuning ChatGPT for News Recommendation Xinyi Li et.al. 2311.05850v1 null
2023-11-09 Long-Horizon Dialogue Understanding for Role Identification in the Game of Avalon with Large Language Models Simon Stepputtis et.al. 2311.05720v1 null
2023-11-09 A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions Lei Huang et.al. 2311.05232v1 link
2023-11-09 Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization Jangwhan Lee et.al. 2311.05161v1 null
2023-11-09 Mental Health Diagnosis in the Digital Age: Harnessing Sentiment Analysis on Social Media Platforms upon Ultra-Sparse Feature Content Haijian Shao et.al. 2311.05075v1 null
2023-11-08 Towards Effective Paraphrasing for Information Disguise Anmol Agarwal et.al. 2311.05018v1 link
2023-11-08 Interpreting Pretrained Language Models via Concept Bottlenecks Zhen Tan et.al. 2311.05014v1 link
2023-11-08 Evaluating Generative Ad Hoc Information Retrieval Lukas Gienapp et.al. 2311.04694v1 null
2023-11-09 Evaluating Diverse Large Language Models for Automatic and General Bug Reproduction Sungmin Kang et.al. 2311.04532v2 link
2023-11-08 Multi-label and Multi-target Sampling of Machine Annotation for Computational Stance Detection Zhengyuan Liu et.al. 2311.04495v1 link
2023-11-08 Twitter Sentiment Analysis of Covid Vacciness Wenbo Zhu et.al. 2311.04479v1 null
2023-11-07 Formal Aspects of Language Modeling Ryan Cotterell et.al. 2311.04329v1 null
2023-11-07 SpaDeLeF: A Dataset for Hierarchical Classification of Lexical Functions for Collocations in Spanish Yevhen Kostiuk et.al. 2311.04189v1 null
2023-11-07 Perturbed examples reveal invariances shared by language models Ruchit Rawal et.al. 2311.04166v1 null
2023-11-07 Unveiling Safety Vulnerabilities of Large Language Models George Kour et.al. 2311.04124v1 null
2023-11-07 DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding Kehinde Ajayi et.al. 2311.04098v1 link
2023-11-07 Personality Style Recognition via Machine Learning: Identifying Anaclitic and Introjective Personality Styles from Patients' Speech Semere Kiros Bitew et.al. 2311.04088v1 null
2023-11-07 Cup Curriculum: Curriculum Learning on Model Capacity Luca Scharr et.al. 2311.03956v1 link
2023-11-07 Conversations in Galician: a Large Language Model for an Underrepresented Language Eliseo Bao et.al. 2311.03812v1 link
2023-11-07 Loss Balancing for Fair Supervised Learning Mohammad Mahdi Khalili et.al. 2311.03714v1 link
2023-11-07 Generalization of NLP Models: Notion and Causation Aparna Elangovan et.al. 2311.03663v1 null
2023-11-07 Instruct Me More! Random Prompting for Visual In-Context Learning Jiahao Zhang et.al. 2311.03648v1 link
2023-11-06 Tackling Concept Shift in Text Classification using Entailment-style Modeling Sumegh Roychowdhury et.al. 2311.03320v1 null
2023-11-06 Architectural Sweet Spots for Modeling Human Label Variation by the Example of Argument Quality: It's Best to Relate Perspectives! Philipp Heinisch et.al. 2311.03153v1 link
2023-11-06 BanLemma: A Word Formation Dependent Rule and Dictionary Based Bangla Lemmatizer Sadia Afrin et.al. 2311.03078v1 link
2023-11-06 Zero-shot Bilingual App Reviews Mining with Large Language Models Jialiang Wei et.al. 2311.03058v1 link
2023-11-06 GLEN: Generative Retrieval via Lexical Index Learning Sunkyung Lee et.al. 2311.03057v1 link
2023-11-06 Adapting Pre-trained Generative Models for Extractive Question Answering Prabir Mallick et.al. 2311.02961v1 null
2023-11-06 Incorporating Worker Perspectives into MTurk Annotation Practices for NLP Olivia Huang et.al. 2311.02802v1 null
2023-11-05 Pyclipse, a library for deidentification of free-text clinical notes Callandra Moore et.al. 2311.02748v1 null
2023-11-05 mahaNLP: A Marathi Natural Language Processing Library Vidula Magdum et.al. 2311.02579v1 link
2023-11-05 Relation Extraction Model Based on Semantic Enhancement Mechanism Peiyu Liu et.al. 2311.02564v1 null
2023-11-03 Grounded Intuition of GPT-Vision's Abilities with Scientific Images Alyssa Hwang et.al. 2311.02069v1 link
2023-11-03 Hardness of Low Rank Approximation of Entrywise Transformed Matrix Products Tamas Sarlos et.al. 2311.01960v1 null
2023-11-03 Constructing Temporal Dynamic Knowledge Graphs from Interactive Text-based Games Keunwoo Peter Yu et.al. 2311.01928v1 link
2023-11-03 Enhancing search engine precision and user experience through sentiment-based polysemy resolution Mike Nkongolo et.al. 2311.01895v1 null
2023-11-03 TCM-GPT: Efficient Pre-training of Large Language Models for Domain Adaptation in Traditional Chinese Medicine Guoxing Yang et.al. 2311.01786v1 null
2023-11-03 Indo LEGO-ABSA: A Multitask Generative Aspect Based Sentiment Analysis for Indonesian Language Randy Zakya Suchrady et.al. 2311.01757v1 link
2023-11-03 Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models Sean Xie et.al. 2311.01732v1 link
2023-11-02 ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos Te-Lin Wu et.al. 2311.01620v1 link
2023-11-02 Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization Björn Deiseroth et.al. 2311.01544v1 null
2023-11-02 A Comprehensive Study of Governance Issues in Decentralized Finance Applications Wei Ma et.al. 2311.01433v1 null
2023-11-02 Efficient Vision Transformer for Accurate Traffic Sign Detection Javad Mirzapour Kaleybar et.al. 2311.01429v1 null
2023-11-02 Finding Common Ground: Annotating and Predicting Common Ground in Spoken Conversations Magdalena Markowska et.al. 2311.01273v1 link
2023-11-02 Generating QM1B with PySCF $_{\text{IPU}}$ Alexander Mathiasen et.al. 2311.01135v1 link
2023-11-02 Noise-Robust Fine-Tuning of Pretrained Language Models via External Guidance Song Wang et.al. 2311.01108v1 null
2023-11-02 On the Concerns of Developers When Using GitHub Copilot Xiyu Zhou et.al. 2311.01020v1 null
2023-11-01 Crosslingual Retrieval Augmented In-context Learning for Bangla Xiaoqian Li et.al. 2311.00587v1 null
2023-11-01 On the Opportunities of Green Computing: A Survey You Zhou et.al. 2311.00447v1 null
2023-11-01 From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities Md Farhan Ishmam et.al. 2311.00308v1 null
2023-11-01 Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models Ran Xu et.al. 2311.00287v1 link
2023-11-01 Transformers as Recognizers of Formal Languages: A Survey on Expressivity Lena Strobl et.al. 2311.00208v1 null
2023-10-31 Defining a New NLP Playground Sha Li et.al. 2310.20633v1 null
2023-10-31 ACL Anthology Helper: A Tool to Retrieve and Manage Literature from ACL Anthology Chen Tang et.al. 2310.20467v1 null
2023-10-31 The SourceData-NLP dataset: integrating curation into scientific publishing for training large language models Jorge Abreu-Vicente et.al. 2310.20440v1 link
2023-10-31 AMERICANO: Argument Generation with Discourse-driven Decomposition and Agent Interaction Zhe Hu et.al. 2310.20352v1 null
2023-10-30 Integrating Summarization and Retrieval for Enhanced Personalization via Large Language Models Chris Richardson et.al. 2310.20081v1 null
2023-10-30 Partial Tensorized Transformers for Natural Language Processing Subhadra Vadlamannati et.al. 2310.20077v1 null
2023-10-30 Evaluation Framework for Understanding Sensitive Attribute Association Bias in Latent Factor Recommendation Algorithms Lex Beattie et.al. 2310.20061v1 null
2023-10-30 BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing Hieu Tran et.al. 2310.19975v1 null
2023-10-30 Deep Learning-Enabled Text Semantic Communication under Interference: An Empirical Study Tilahun M. Getu et.al. 2310.19974v1 null
2023-10-30 BTRec: BERT-Based Trajectory Recommendation for Personalized Tours Ngai Lam Ho et.al. 2310.19886v1 link
2023-10-30 Adapter Pruning using Tropical Characterization Rishabh Bhardwaj et.al. 2310.19232v1 null
2023-10-29 A Survey on Recent Named Entity Recognition and Relation Classification Methods with Focus on Few-Shot Learning Approaches Sakher Alqaaidi et.al. 2310.19055v1 null
2023-10-29 Bipartite Graph Pre-training for Unsupervised Extractive Summarization with Graph Convolutional Auto-Encoders Qianren Mao et.al. 2310.18992v1 link
2023-10-29 A Multimodal Ecological Civilization Pattern Recommendation Method Based on Large Language Models and Knowledge Graph Zhihang Yu et.al. 2310.18951v1 null
2023-10-29 A foundational neural operator that continuously learns without forgetting Tapas Tripura et.al. 2310.18885v1 null
2023-10-29 Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition Isaac Slaughter et.al. 2310.18877v1 link
2023-10-28 Translating away Translationese without Parallel Data Rricha Jalota et.al. 2310.18830v1 null
2023-10-28 Setting the Trap: Capturing and Defeating Backdoors in Pretrained Language Models through Honeypots Ruixiang Tang et.al. 2310.18633v1 null
2023-10-27 Maximizing Equitable Reach and Accessibility of ETDs William A. Ingram et.al. 2310.18427v1 null
2023-10-27 On General Language Understanding David Schlangen et.al. 2310.18038v1 null
2023-10-27 NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark Oscar Sainz et.al. 2310.18018v1 null
2023-10-27 SOUL: Towards Sentiment and Opinion Understanding of Language Yue Deng et.al. 2310.17924v1 link
2023-10-27 Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method Yukun Zhao et.al. 2310.17918v1 null
2023-10-27 Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey Weixu Zhang et.al. 2310.17894v1 null
2023-10-26 BERT-PIN: A BERT-based Framework for Recovering Missing Data Segments in Time-series Load Profiles Yi Hu et.al. 2310.17742v1 null
2023-10-26 Is Explanation the Cure? Misinformation Mitigation in the Short Term and Long Term Yi-Li Hsu et.al. 2310.17711v1 null
2023-10-26 Sliceformer: Make Multi-head Attention as Simple as Sorting in Discriminative Tasks Shen Yuan et.al. 2310.17683v1 link
2023-10-26 torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP Yoshitomo Matsubara et.al. 2310.17644v1 link
2023-10-26 A Survey on Transferability of Adversarial Examples across Deep Neural Networks Jindong Gu et.al. 2310.17626v1 link
2023-10-26 De-novo Chemical Reaction Generation by Means of Temporarily Convolutional Neural Networks Andrei Buin et.al. 2310.17341v1 null
2023-10-26 Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance? Ahmed Alajrami et.al. 2310.17271v1 null
2023-10-26 miditok: A Python package for MIDI file tokenization Nathan Fradet et.al. 2310.17202v1 link
2023-10-26 M2C: Towards Automatic Multimodal Manga Complement Hongcheng Guo et.al. 2310.17130v1 link
2023-10-26 A Method for Network Intrusion Detection Using Flow Sequence and BERT Framework Loc Gia Nguyen et.al. 2310.17127v1 null
2023-10-25 This Reads Like That: Deep Learning for Interpretable Natural Language Processing Claudio Fanconi et.al. 2310.17010v1 link
2023-10-25 Understanding Social Structures from Contemporary Literary Fiction using Character Interaction Graph -- Half Century Chronology of Influential Bengali Writers Nafis Irtiza Tripto et.al. 2310.16968v1 null
2023-10-25 Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks Aradhana Sinha et.al. 2310.16955v1 null
2023-10-25 From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction Nima Shoghi et.al. 2310.16802v1 link
2023-10-25 HANSEN: Human and AI Spoken Text Benchmark for Authorship Analysis Nafis Irtiza Tripto et.al. 2310.16746v1 null
2023-10-25 SkyMath: Technical Report Liu Yang et.al. 2310.16713v1 null
2023-10-25 SSLCL: An Efficient Model-Agnostic Supervised Contrastive Learning Framework for Emotion Recognition in Conversations Tao Shi et.al. 2310.16676v1 link
2023-10-25 Exploring Large Language Models for Code Explanation Paheli Bhattacharya et.al. 2310.16673v1 null
2023-10-25 WSDMS: Debunk Fake News via Weakly Supervised Detection of Misinforming Sentences with Contextualized Social Wisdom Ruichao Yang et.al. 2310.16579v1 link
2023-10-25 FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning Jaemin Shin et.al. 2310.16538v1 null
2023-10-25 OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models Mingfeng Xue et.al. 2310.16517v1 link
2023-10-25 A Comprehensive Python Library for Deep Learning-Based Event Detection in Multivariate Time Series Data and Information Retrieval in NLP Menouar Azib et.al. 2310.16485v1 link
2023-10-25 Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training Max Müller-Eberstein et.al. 2310.16484v1 null
2023-10-24 Instruct and Extract: Instruction Tuning for On-Demand Information Extraction Yizhu Jiao et.al. 2310.16040v1 link
2023-10-24 This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models Iker García-Ferrero et.al. 2310.15941v1 link
2023-10-24 Ensemble of Task-Specific Language Models for Brain Encoding Sanjai Kumaran et.al. 2310.15720v1 link
2023-10-24 CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation Minzhi Li et.al. 2310.15638v1 link
2023-10-24 Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression Jiduan Liu et.al. 2310.15594v1 null
2023-10-24 Natural Language Processing for Drug Discovery Knowledge Graphs: promises and pitfalls J. Charles G. Jeynes et.al. 2310.15572v1 null
2023-10-24 Improving Language Models Meaning Understanding and Consistency by Learning Conceptual Roles from Dictionary Myeongjun Erik Jang et.al. 2310.15541v1 null
2023-10-24 Continual Event Extraction with Semantic Confusion Rectification Zitao Wang et.al. 2310.15470v1 link
2023-10-23 Specialist or Generalist? Instruction Tuning for Specific NLP Tasks Chufan Shi et.al. 2310.15326v1 null
2023-10-23 HetGPT: Harnessing the Power of Prompt Tuning in Pre-Trained Heterogeneous Graph Neural Networks Yihong Ma et.al. 2310.15318v1 null
2023-10-23 TableQAKit: A Comprehensive and Practical Toolkit for Table-based Question Answering Fangyu Lei et.al. 2310.15075v1 null
2023-10-23 Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge Te-Lin Wu et.al. 2310.15066v1 link
2023-10-23 From Proprietary to High-Level Trigger-Action Programming Rules: A Natural Language Processing Approach Ekene Attoh et.al. 2310.15024v1 null
2023-10-23 Efficient Data Learning for Open Information Extraction with Pre-trained Language Models Zhiyuan Fan et.al. 2310.15021v1 null
2023-10-23 We are Who We Cite: Bridges of Influence Between Natural Language Processing and Other Academic Fields Jan Philip Wahle et.al. 2310.14870v1 link
2023-10-23 Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing Sai Koneru et.al. 2310.14855v1 null
2023-10-23 ULTRA-DP: Unifying Graph Pre-training with Multi-task Graph Dual Prompt Mouxiang Chen et.al. 2310.14845v1 link
2023-10-23 Generative Pre-trained Transformer for Vietnamese Community-based COVID-19 Question Answering Tam Minh Vo et.al. 2310.14602v1 null
2023-10-23 Learning to Correct Noisy Labels for Fine-Grained Entity Typing via Co-Prediction Prompt Tuning Minghao Tang et.al. 2310.14596v1 link
2023-10-23 Exploring the Boundaries of GPT-4 in Radiology Qianchu Liu et.al. 2310.14573v1 null
2023-10-20 Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations Jihyoung Jang et.al. 2310.13420v1 null
2023-10-20 Democratizing Reasoning Ability: Tailored Learning from Large Language Model Zhaoyang Wang et.al. 2310.13332v1 link
2023-10-20 Anomaly Detection of Command Shell Sessions based on DistilBERT: Unsupervised and Supervised Approaches Zefang Liu et.al. 2310.13247v1 null
2023-10-20 The GitHub Recent Bugs Dataset for Evaluating LLM-based Debugging Applications Jae Yong Lee et.al. 2310.13229v1 link
2023-10-20 The Less the Merrier? Investigating Language Representation in Multilingual Models Hellina Hailu Nigatu et.al. 2310.13228v1 null
2023-10-19 A Use Case: Reformulating Query Rewriting as a Statistical Machine Translation Problem Abdullah Can Algan et.al. 2310.13031v1 null
2023-10-19 TabuLa: Harnessing Language Models for Tabular Data Synthesis Zilong Zhao et.al. 2310.12746v1 link
2023-10-19 Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing Yue Guo et.al. 2310.12664v1 null
2023-10-19 Towards Real-World Streaming Speech Translation for Code-Switched Speech Belen Alastruey et.al. 2310.12648v1 link
2023-10-19 An Exploration of In-Context Learning for Speech Language Model Ming-Hao Hsu et.al. 2310.12477v1 null
2023-10-19 Unmasking Transformers: A Theoretical Approach to Data Recovery via Attention Weights Yichuan Deng et.al. 2310.12462v1 null
2023-10-19 Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer Qingru Zhang et.al. 2310.12442v1 null
2023-10-19 Metadata for Scientific Experiment Reporting: A Case Study in Metal-Organic Frameworks Xintong Zhao et.al. 2310.12417v1 null
2023-10-19 LoMAE: Low-level Vision Masked Autoencoders for Low-dose CT Denoising Dayang Wang et.al. 2310.12405v1 null
2023-10-18 SPEED: Speculative Pipelined Execution for Efficient Decoding Coleman Hooper et.al. 2310.12072v1 null
2023-10-19 Transformers for scientific data: a pedagogical review for astronomers Dimitrios Tanoglidis et.al. 2310.12069v2 null
2023-10-18 Evaluating the Symbol Binding Ability of Large Language Models for Multiple-Choice Questions in Vietnamese General Education Duc-Vu Nguyen et.al. 2310.12059v1 null
2023-10-18 Removing Spurious Concepts from Neural Network Representations via Joint Subspace Estimation Floris Holstege et.al. 2310.11991v1 null
2023-10-18 Towards Graph Foundation Models: A Survey and Beyond Jiawei Liu et.al. 2310.11829v1 null
2023-10-18 Telecom AI Native Systems in the Age of Generative AI -- An Engineering Perspective Ricardo Britto et.al. 2310.11770v1 null
2023-10-18 Superiority of Softmax: Unveiling the Performance Edge Over Linear Attention Yichuan Deng et.al. 2310.11685v1 null
2023-10-18 Field-testing items using artificial intelligence: Natural language processing with transformers Hotaka Maeda et.al. 2310.11655v1 null
2023-10-17 Automatic News Summerization Kavach Dheer et.al. 2310.11520v1 null
2023-10-17 Neural Attention: Enhancing QKV Calculation in Self-Attention Mechanism with Neural Networks Muhan Zhang et.al. 2310.11398v1 link
2023-10-17 Last One Standing: A Comparative Analysis of Security and Privacy of Soft Prompt Tuning, LoRA, and In-Context Learning Rui Wen et.al. 2310.11397v1 null
2023-10-17 DialogueLLM: Context and Emotion Knowledge-Tuned LLaMA Models for Emotion Recognition in Conversations Yazhou Zhang et.al. 2310.11374v1 link
2023-10-17 Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations Shiyuan Huang et.al. 2310.11207v1 null
2023-10-17 ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing Quoc-Nam Nguyen et.al. 2310.11166v1 link
2023-10-17 Unsupervised Pre-Training Using Masked Autoencoders for ECG Analysis Guoxin Wang et.al. 2310.11153v1 null
2023-10-17 The Quo Vadis of the Relationship between Language and Large Language Models Evelina Leivada et.al. 2310.11146v1 null
2023-10-17 Core Building Blocks: Next Gen Geo Spatial GPT Application Ashley Fernandez et.al. 2310.11029v1 null
2023-10-17 Enhancing Deep Neural Network Training Efficiency and Performance through Linear Prediction Hejie Ying et.al. 2310.10958v1 null
2023-10-17 Enhanced Transformer Architecture for Natural Language Processing Woohyeon Moon et.al. 2310.10930v1 null
2023-10-16 "Mistakes Help Us Grow": Facilitating and Evaluating Growth Mindset Supportive Language in Classrooms Kunal Handa et.al. 2310.10637v1 null
2023-10-16 Unifying Image Processing as Visual Prompting Question Answering Yihao Liu et.al. 2310.10513v1 null
2023-10-16 Text Summarization Using Large Language Models: A Comparative Study of MPT-7b-instruct, Falcon-7b-instruct, and OpenAI Chat-GPT Models Lochan Basyal et.al. 2310.10449v1 link
2023-10-16 Prompt Tuning for Multi-View Graph Contrastive Learning Chenghua Gong et.al. 2310.10362v1 null
2023-10-16 NLP for Crypto-Asset Regulation: A Roadmap Carolina Camassa et.al. 2310.10333v1 null
2023-10-16 VIBE: Topic-Driven Temporal Adaptation for Twitter Classification Yuji Zhang et.al. 2310.10191v1 null
2023-10-16 Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset Arthur Amalvy et.al. 2310.10118v1 link
2023-10-16 Verbosity Bias in Preference Labeling by Large Language Models Keita Saito et.al. 2310.10076v1 null
2023-10-16 EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge Tom Bryan et.al. 2310.10050v1 null
2023-10-16 Empirical Study of Zero-Shot NER with ChatGPT Tingyu Xie et.al. 2310.10035v1 link
2023-10-12 A Survey on Heterogeneous Transfer Learning Runxue Bao et.al. 2310.08459v1 link
2023-10-12 Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction Kausik Hira et.al. 2310.08383v1 link
2023-10-12 Learn From Model Beyond Fine-Tuning: A Survey Hongling Zheng et.al. 2310.08184v1 link
2023-10-12 Who Wrote it and Why? Prompting Large-Language Models for Authorship Verification Chia-Yu Hung et.al. 2310.08123v1 null
2023-10-12 ClimateNLP: Analyzing Public Sentiment Towards Climate Change Using Natural Language Processing Ajay Krishnan T. K. et.al. 2310.08099v1 null
2023-10-11 Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention Huiyin Xue et.al. 2310.07911v1 null
2023-10-11 Hierarchical Pretraining on Multimodal Electronic Health Records Xiaochen Wang et.al. 2310.07871v1 link
2023-10-11 Framework for Question-Answering in Sanskrit through Automated Construction of Knowledge Graphs Hrishikesh Terdalkar et.al. 2310.07848v1 null
2023-10-11 Does Synthetic Data Make Large Language Models More Efficient? Sia Gholami et.al. 2310.07830v1 null
2023-10-11 Antarlekhaka: A Comprehensive Tool for Multi-task Natural Language Annotation Hrishikesh Terdalkar et.al. 2310.07826v1 link
2023-10-11 To Build Our Future, We Must Know Our Past: Contextualizing Paradigm Shifts in Natural Language Processing Sireesh Gururaja et.al. 2310.07715v1 null
2023-10-11 Composite Backdoor Attacks Against Large Language Models Hai Huang et.al. 2310.07676v1 link
2023-10-11 The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values Hannah Rose Kirk et.al. 2310.07629v1 null
2023-10-11 PHYDI: Initializing Parameterized Hypercomplex Neural Networks as Identity Functions Matteo Mancanelli et.al. 2310.07612v1 link
2023-10-11 Energy Estimates Across Layers of Computing: From Devices to Large-Scale Applications in Machine Learning for Natural Language Processing, Scientific Computing, and Cryptocurrency Mining Sadasivan Shankar et.al. 2310.07516v1 null
2023-10-11 KwaiYiiMath: Technical Report Jiayi Fu et.al. 2310.07488v1 null
2023-10-11 uxSense: Supporting User Experience Analysis with Visualization and Computer Vision Andrea Batch et.al. 2310.07300v1 link
2023-10-12 An Analysis on Large Language Models in Healthcare: A Case Study of BioBERT Shyni Sharaf et.al. 2310.07282v2 null
2023-10-11 BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations Qizhi Pei et.al. 2310.07276v1 link
2023-10-11 A Comparative Study of Pre-trained CNNs and GRU-Based Attention for Image Caption Generation Rashid Khan et.al. 2310.07252v1 null
2023-10-10 Topic-DPR: Topic-based Prompts for Dense Passage Retrieval Qingfa Xiao et.al. 2310.06626v1 null
2023-10-10 FTFT: efficient and robust Fine-Tuning by transFerring Training dynamics Yupei Du et.al. 2310.06588v1 link
2023-10-10 Watt For What: Rethinking Deep Learning's Energy-Performance Relationship Shreyank N Gowda et.al. 2310.06522v1 null
2023-10-10 Revisit Input Perturbation Problems for LLMs: A Unified Robustness Evaluation Framework for Noisy Slot Filling Task Guanting Dong et.al. 2310.06504v1 link
2023-10-10 Evolution of Natural Language Processing Technology: Not Just Language Processing Towards General Purpose AI Masahiro Yamamoto et.al. 2310.06228v1 null
2023-10-09 From Text to Knowledge with Graphs: modelling, querying and exploiting textual content Genoveva Vargas-Solar et.al. 2310.06122v1 null
2023-10-09 Improving Summarization with Human Edits Zonghai Yao et.al. 2310.05857v1 link
2023-10-10 Are Large Language Models Post Hoc Explainers? Nicholas Kroeger et.al. 2310.05797v2 link
2023-10-09 Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions Lucie-Aimée Kaffee et.al. 2310.05779v1 link
2023-10-09 Larth: Dataset and Machine Translation for Etruscan Gianluca Vico et.al. 2310.05688v1 link
2023-10-09 ViTs are Everywhere: A Comprehensive Study Showcasing Vision Transformers in Different Domain Md Sohag Mia et.al. 2310.05664v1 null
2023-10-09 Regulation and NLP (RegNLP): Taming Large Language Models Catalina Goanta et.al. 2310.05553v1 null
2023-10-09 Generative Judge for Evaluating Alignment Junlong Li et.al. 2310.05470v1 link
2023-10-09 Establishing Trustworthiness: Rethinking Tasks and Model Evaluation Robert Litschko et.al. 2310.05442v1 null
2023-10-09 Resolving the Imbalance Issue in Hierarchical Disciplinary Topic Inference via LLM-based Data Augmentation Xunxin Cai et.al. 2310.05318v1 null
2023-10-09 Enhancing Long-form Text Generation in Mental Health\ with Task-adaptive Tokenization Siyang Liu et.al. 2310.05317v1 link
2023-10-06 Multi-Industry Simplex : A Probabilistic Extension of GICS Maksim Papenkov et.al. 2310.04280v1 null
2023-10-06 Analysis of the Reasoning with Redundant Information Provided Ability of Large Language Models Wenbei Xie et.al. 2310.04039v1 null
2023-10-06 Quantized Transformer Language Model Implementations on Edge Devices Mohammad Wali Ur Rahman et.al. 2310.03971v1 null
2023-10-05 Multitask Learning for Time Series Data\with 2D Convolution Chin-Chia Michael Yeh et.al. 2310.03925v1 null
2023-10-05 The Anatomy of Deception: Technical and Human Perspectives on a Large-scale Phishing Campaign Anargyros Chrysanthou et.al. 2310.03498v1 null
2023-10-05 Procedural Text Mining with Large Language Models Anisa Rula et.al. 2310.03376v1 link
2023-10-05 A Formalism and Approach for Improving Robustness of Large Language Models Using Risk-Adjusted Confidence Scores Ke Shen et.al. 2310.03283v1 null
2023-10-05 InstructProtein: Aligning Human and Protein Language via Knowledge Instruction Zeyuan Wang et.al. 2310.03269v1 null
2023-10-05 Sparse Deep Learning for Time Series Data: Theory and Applications Mingxuan Zhang et.al. 2310.03243v1 null
2023-10-05 Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs Yijia Xiao et.al. 2310.03221v1 link
2023-10-04 Neural architecture impact on identifying temporally extended Reinforcement Learning tasks Victor Vadakechirayath George et.al. 2310.03161v1 null
2023-10-04 Federated Fine-Tuning of LLMs on the Very Edge: The Good, the Bad, the Ugly Herbert Woisetschläger et.al. 2310.03150v1 null
2023-10-04 MetaTool Benchmark: Deciding Whether to Use Tools and Which to Use Yue Huang et.al. 2310.03128v1 link
2023-10-04 Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making Jeonghye Kim et.al. 2310.03022v1 null
2023-10-04 DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning Jiong Xiong et.al. 2310.02954v1 link
2023-10-04 Low Resource Summarization using Pre-trained Language Models Mubashir Munaf et.al. 2310.02790v1 null
2023-10-04 SALSA: Semantically-Aware Latent Space Autoencoder Kathryn E. Kirchoff et.al. 2310.02744v1 null
2023-10-04 AGIR: Automating Cyber Threat Intelligence Reporting with Natural Language Generation Filippo Perrina et.al. 2310.02655v1 link
2023-10-03 Backdoor Adjustment of Confounding by Provenance for Robust Text Classification of Multi-institutional Clinical Notes Xiruo Ding et.al. 2310.02451v1 null
2023-10-03 A method to assess trustworthiness of machine coding at scale Rebeckah K. Fussell et.al. 2310.02335v1 null
2023-10-03 MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens Kaizhi Zheng et.al. 2310.02239v1 link
2023-10-03 Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View Jintian Zhang et.al. 2310.02124v1 link
2023-10-03 dFlow: A Domain Specific Language for the Rapid Development of open-source Virtual Assistants Nikolaos Malamas et.al. 2310.02102v1 null
2023-10-03 Jury: A Comprehensive Evaluation Toolkit Devrim Cavusoglu et.al. 2310.02040v1 link
2023-10-03 Hierarchical Evaluation Framework: Best Practices for Human Evaluation Iva Bojic et.al. 2310.01917v1 null
2023-10-03 Effective and Parameter-Efficient Reusing Fine-Tuned Models Weisen Jiang et.al. 2310.01886v1 null
2023-10-03 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models Ming Jin et.al. 2310.01728v1 link
2023-10-02 Transformers are efficient hierarchical chemical graph learners Zihan Pengmei et.al. 2310.01704v1 link
2023-10-02 Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language Models Zijun Wu et.al. 2310.01691v1 link
2023-10-02 A Review of Digital Learning Environments for Teaching Natural Language Processing in K-12 Education Xiaoyi Tian et.al. 2310.01603v1 null
2023-09-29 A Large Language Model Approach to Educational Survey Feedback Analysis Michael J. Parker et.al. 2309.17447v1 null
2023-09-29 Network Memory Footprint Compression Through Jointly Learnable Codebooks and Mappings Edouard Yvinec et.al. 2309.17361v1 null
2023-09-29 Overview of the BioLaySumm 2023 Shared Task on Lay Summarization of Biomedical Research Articles Tomsa Goldsack et.al. 2309.17332v1 null
2023-09-29 Benchmarking the Abilities of Large Language Models for RDF Knowledge Graph Creation and Comprehension: How Well Do LLMs Speak Turtle? Johannes Frey et.al. 2309.17122v1 link
2023-09-29 Interpretable Long-Form Legal Question Answering with Retrieval-Augmented Large Language Models Antoine Louis et.al. 2309.17050v1 link
2023-09-28 DeBERTinha: A Multistep Approach to Adapt DebertaV3 XSmall for Brazilian Portuguese Natural Language Processing Task Israel Campiotti et.al. 2309.16844v1 null
2023-09-28 How many words does ChatGPT know? The answer is ChatWords Gonzalo Martínez et.al. 2309.16777v1 link
2023-09-28 Neural scaling laws for phenotypic drug discovery Drew Linsley et.al. 2309.16773v1 null
2023-09-28 Qwen Technical Report Jinze Bai et.al. 2309.16609v1 link
2023-09-28 Augmenting LLMs with Knowledge: A survey on hallucination prevention Konstantinos Andriopoulos et.al. 2309.16459v1 null
2023-09-28 A Comprehensive Survey of Document-level Relation Extraction (2016-2022) Julien Delaunay et.al. 2309.16396v1 null
2023-09-27 ChatGPT-BCI: Word-Level Neural State Classification Using GPT, EEG, and Eye-Tracking Biomarkers in Semantic Inference Reading Comprehension Yuhong Zhang et.al. 2309.15714v1 null
2023-09-27 NLPBench: Evaluating Large Language Models on Solving NLP Problems Linxin Song et.al. 2309.15630v1 link
2023-09-27 Tackling VQA with Pretrained Foundation Models without Further Training Alvin De Jun Tan et.al. 2309.15487v1 null
2023-09-27 A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future Zheng Chu et.al. 2309.15402v1 link
2023-09-26 VPA: Fully Test-Time Visual Prompt Adaptation Jiachen Sun et.al. 2309.15251v1 null
2023-09-26 Eve Said Yes: AirBone Authentication for Head-Wearable Smart Voice Assistant Chenpei Huang et.al. 2309.15203v1 null
2023-09-26 The Role of Document Embedding in Research Paper Recommender Systems: To Breakdown or to Bolster Disciplinary Borders? Eoghan Cunningham et.al. 2309.14984v1 null
2023-09-27 Text-to-Image Generation for Abstract Concepts Jiayi Liao et.al. 2309.14623v2 null
2023-09-26 Confidence Intervals for the F1 Score: A Comparison of Four Methods Kevin Fu Yuan Lam et.al. 2309.14621v1 null
2023-09-25 When Automated Assessment Meets Automated Content Generation: Examining Text Quality in the Era of GPTs Marialena Bevilacqua et.al. 2309.14488v1 link
2023-09-25 Urdu Poetry Generated by Using Deep Learning Techniques Muhammad Shoaib Farooq et.al. 2309.14233v1 null
2023-09-25 Comprehensive Overview of Named Entity Recognition: Models, Domain-Specific Applications and Challenges Kalyani Pakhale et.al. 2309.14084v1 null
2023-09-25 Graph Representation Learning Towards Patents Network Analysis Mohammad Heydari et.al. 2309.13888v1 null
2023-09-24 Text Classification: A Perspective of Deep Learning Methods Zhongwei Wan et.al. 2309.13761v1 null
2023-09-24 Arabic Sentiment Analysis with Noisy Deep Explainable Model Md. Atabuzzaman et.al. 2309.13731v1 null
2023-09-24 Skill Check: Some Considerations on the Evaluation of Gamemastering Models for Role-playing Games Santiago Góngora et.al. 2309.13702v1 link
2023-09-24 Accelerating Large Batch Training via Gradient Signal to Noise Ratio (GSNR) Guo-qing Jiang et.al. 2309.13681v1 null
2023-09-23 Spanish Resource Grammar version 2023 Olga Zamaraeva et.al. 2309.13318v1 null
2023-09-23 Natural Language Processing for Requirements Formalization: How to Derive New Approaches? Viju Sudhi et.al. 2309.13272v1 link
2023-09-23 A Survey of Document-Level Information Extraction Hanwen Zheng et.al. 2309.13249v1 null
2023-09-22 Decoding Affect in Dyadic Conversations: Leveraging Semantic Similarity through Sentence Embedding Chen-Wei Yu et.al. 2309.12646v1 null
2023-09-22 Construction contract risk identification based on knowledge-augmented language model Saika Wong et.al. 2309.12626v1 null
2023-09-21 Understanding the language of molecules: Predicting pure component parameters for the PC-SAFT equation of state from SMILES Benedikt Winter et.al. 2309.12404v1 null
2023-09-21 Improving VTE Identification through Adaptive NLP Model Selection and Clinical Expert Rule-based Classifier from Radiology Reports Jamie Deng et.al. 2309.12273v1 null
2023-09-22 Rethinking the Evaluating Framework for Natural Language Understanding in AI Systems: Language Acquisition as a Core for Future Metrics Patricio Vera et.al. 2309.11981v2 null
2023-09-21 Stock Market Sentiment Classification and Backtesting via Fine-tuned BERT Jiashu Lou et.al. 2309.11979v1 null
2023-09-20 Transformers versus LSTMs for electronic trading Paul Bilokon et.al. 2309.11400v1 link
2023-09-20 Studying Lobby Influence in the European Parliament Aswin Suresh et.al. 2309.11381v1 null
2023-09-20 When to Trust AI: Advances and Challenges for Certification of Neural Networks Marta Kwiatkowska et.al. 2309.11196v1 null
2023-09-20 Prototype of a robotic system to assist the learning process of English language with text-generation through DNN Carlos Morales-Torres et.al. 2309.11142v1 null
2023-09-20 Language-Oriented Communication with Semantic Coding and Knowledge Distillation for Text-to-Image Generation Hyelin Nam et.al. 2309.11127v1 null
2023-09-20 AttentionMix: Data augmentation method that relies on BERT attention mechanism Dominik Lewy et.al. 2309.11104v1 null
2023-09-21 fakenewsbr: A Fake News Detection Platform for Brazilian Portuguese Luiz Giordani et.al. 2309.11052v2 null
2023-09-20 Making Small Language Models Better Multi-task Learners with Mixture-of-Task-Adapters Yukang Xie et.al. 2309.11042v1 null
2023-09-19 LMDX: Language Model-based Document Information Extraction and Localization Vincent Perot et.al. 2309.10952v1 null
2023-09-19 Artificial Intelligence-Enabled Intelligent Assistant for Personalized and Adaptive Learning in Higher Education Ramteja Sajja et.al. 2309.10892v1 null
2023-09-19 FRASIMED: a Clinical French Annotated Resource Produced through Crosslingual BERT-Based Annotation Projection Jamil Zaghir et.al. 2309.10770v1 null
2023-09-19 OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch Juntao Li et.al. 2309.10706v1 link
2023-09-19 NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages Samuel Cahyawijaya et.al. 2309.10661v1 link
2023-09-19 CFGPT: Chinese Financial Assistant with Large Language Model Jiangtong Li et.al. 2309.10654v1 link
2023-09-19 FRACAS: A FRench Annotated Corpus of Attribution relations in newS Ange Richard et.al. 2309.10604v1 null
2023-09-19 Mixed-Distil-BERT: Code-mixed Language Modeling for Bangla, English, and Hindi Md Nishat Raihan et.al. 2309.10272v1 null
2023-09-18 Stabilizing RLHF through Advantage Model and Selective Rehearsal Baolin Peng et.al. 2309.10202v1 null
2023-09-18 Automated Interviewer or Augmented Survey? Collecting Social Data with Large Language Models Alejandro Cuevas Villalba et.al. 2309.10187v1 link
2023-09-19 Watch the Speakers: A Hybrid Continuous Attribution Network for Emotion Recognition in Conversation With Emotion Disentanglement Shanglin Lei et.al. 2309.09799v2 null
2023-09-18 FedLALR: Client-Specific Adaptive Learning Rates Achieve Linear Speedup for Non-IID Data Hao Sun et.al. 2309.09719v1 null
2023-09-18 Do learned speech symbols follow Zipf's law? Shinnosuke Takamichi et.al. 2309.09690v1 null
2023-09-18 FactoFormer: Factorized Hyperspectral Transformers with Self-Supervised Pre-Training Shaheer Mohamed et.al. 2309.09431v1 link
2023-09-17 CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages Thuat Nguyen et.al. 2309.09400v1 null
2023-09-17 OWL: A Large Language Model for IT Operations Hongcheng Guo et.al. 2309.09298v1 null
2023-09-16 Constructing a Knowledge Graph for Vietnamese Legal Cases with Heterogeneous Graphs Thi-Hai-Yen Vuong et.al. 2309.09069v1 null
2023-09-16 Context-aware Adversarial Attack on Named Entity Recognition Shuguang Chen et.al. 2309.08999v1 null
2023-09-16 RMP: A Random Mask Pretrain Framework for Motion Prediction Yi Yang et.al. 2309.08989v1 link
2023-09-16 Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT) Parsa Kavehzadeh et.al. 2309.08968v1 null
2023-09-15 VulnSense: Efficient Vulnerability Detection in Ethereum Smart Contracts by Multimodal Learning with Graph Neural Network and Language Model Phan The Duy et.al. 2309.08474v1 null
2023-09-15 Understanding the limitations of self-supervised learning for tabular anomaly detection Kimberly T. Mai et.al. 2309.08374v1 null
2023-09-15 Exploring the Potential of ChatGPT in Automated Code Refinement: An Empirical Study Qi Guo et.al. 2309.08221v1 null
2023-09-14 An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing Sonish Sivarajkumar et.al. 2309.08008v1 null
2023-09-14 A Multi-In and Multi-Out Dendritic Neuron Model and its Optimization Yu Ding et.al. 2309.07791v1 null
2023-09-14 Complexity Scaling for Speech Denoising Hangting Chen et.al. 2309.07757v1 null
2023-09-14 Generative AI Text Classification using Ensemble LLM Approaches Harika Abburi et.al. 2309.07755v1 null
2023-09-14 NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation Jiaqi Zhang et.al. 2309.07705v1 link
2023-09-14 Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation? Rishav Hada et.al. 2309.07462v1 null
2023-09-14 SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects David Ifeoluwa Adelani et.al. 2309.07445v1 link
2023-09-14 Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts Dave Van Veen et.al. 2309.07430v1 link
2023-09-14 Multi-Grade Deep Learning for Partial Differential Equations with Applications to the Burgers Equation Yuesheng Xu et.al. 2309.07401v1 null
2023-09-14 Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS Yifan Yang et.al. 2309.07377v1 link
2023-09-13 Traveling Words: A Geometric Interpretation of Transformers Raul Molina et.al. 2309.07315v1 link
2023-09-13 Beyond original Research Articles Categorization via NLP Rosanna Turrisi et.al. 2309.07020v1 link
2023-09-13 Comparative Analysis of Contextual Relation Extraction based on Deep Learning Models R. Priyadharshini et.al. 2309.06814v1 null
2023-09-13 Electricity Demand Forecasting through Natural Language Processing with Long Short-Term Memory Networks Yun Bai et.al. 2309.06793v1 null
2023-09-13 Bias Amplification Enhances Minority Group Performance Gaotang Li et.al. 2309.06717v1 link
2023-09-13 Simultaneous Machine Translation with Large Language Models Minghan Wang et.al. 2309.06706v1 null
2023-09-12 Narrowing the Gap between Supervised and Unsupervised Sentence Representation Learning with Large Language Model Mingxin Li et.al. 2309.06453v1 link
2023-09-12 Grounded Language Acquisition From Object and Action Imagery James Robert Kubricht et.al. 2309.06335v1 null
2023-09-12 Improving and Evaluating the Detection of Fragmentation in News Recommendations with the Clustering of News Story Chains Alessandra Polimeno et.al. 2309.06192v1 null
2023-09-13 Backdoor Attacks and Countermeasures in Natural Language Processing Models: A Comprehensive Security Review Pengzhou Cheng et.al. 2309.06055v2 null
2023-09-11 Black-Box Analysis: GPTs Across Time in Legal Textual Entailment Task Ha-Thanh Nguyen et.al. 2309.05501v1 null
2023-09-11 NeCo@ALQAC 2023: Legal Domain Knowledge Acquisition for Low-Resource Languages through Data Enrichment Hai-Long Nguyen et.al. 2309.05500v1 null
2023-09-11 LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech Titouan Parcollet et.al. 2309.05472v1 null
2023-09-11 Improving Information Extraction on Business Documents with Specific Pre-Training Tasks Thibault Douzon et.al. 2309.05429v1 link
2023-09-11 Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach Tae Jin Park et.al. 2309.05248v1 null
2023-09-11 DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning Zhengxiang Shi et.al. 2309.05173v1 link
2023-09-10 What's Hard in English RST Parsing? Predictive Models for Error Analysis Yang Janet Liu et.al. 2309.04940v1 link
2023-09-10 Unsupervised Chunking with Hierarchical RNN Zijun Wu et.al. 2309.04919v1 link
2023-09-09 Distributional Data Augmentation Methods for Low Resource Language Mosleh Mahamud et.al. 2309.04862v1 link
2023-09-09 Leveraging Large Language Models for Exploiting ASR Uncertainty Pranay Dighe et.al. 2309.04842v1 null
2023-09-08 Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning David Yunis et.al. 2309.04459v1 null
2023-09-08 Active Learning for Classifying 2D Grid-Based Level Completability Mahsa Bazzaz et.al. 2309.04367v1 link
2023-09-08 Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts Erik Daxberger et.al. 2309.04354v1 null
2023-09-08 Fuzzy Fingerprinting Transformer Language-Models for Emotion Recognition in Conversations Patrícia Pereira et.al. 2309.04292v1 null
2023-09-08 LLMCad: Fast and Scalable On-device Large Language Model Inference Daliang Xu et.al. 2309.04255v1 null
2023-09-08 Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese Haochun Wang et.al. 2309.04175v1 null
2023-09-07 Conformal Autoregressive Generation: Beam Search with Coverage Guarantees Nicolas Deutschmann et.al. 2309.03797v1 null
2023-09-07 USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset Chengguang Gan et.al. 2309.03787v1 link
2023-09-07 Machine Learning for Tangible Effects: Natural Language Processing for Uncovering the Illicit Massage Industry & Computer Vision for Tactile Sensing Rui Ouyang et.al. 2309.03470v1 null
2023-09-06 J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News Tharindu Kumarage et.al. 2309.03164v1 link
2023-09-06 Leave no Place Behind: Improved Geolocation in Humanitarian Documents Enrico M. Belliardo et.al. 2309.02914v1 null
2023-09-06 ViCGCN: Graph Convolutional Network with Contextualized Language Models for Social Media Mining in Vietnamese Chau-Thang Phan et.al. 2309.02902v1 link
2023-09-07 Aligning Large Language Models for Clinical Tasks Supun Manathunga et.al. 2309.02884v2 link
2023-09-05 A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges Maryam Zare et.al. 2309.02473v1 null
2023-09-05 Sample Size in Natural Language Processing within Healthcare Research Jaya Chaturvedi et.al. 2309.02237v1 null
2023-09-05 Incorporating Dictionaries into a Neural Network Architecture to Extract COVID-19 Medical Concepts From Social Media Abul Hasan et.al. 2309.02188v1 null
2023-09-05 Bridging Emotion Role Labeling and Appraisal-based Emotion Analysis Roman Klinger et.al. 2309.02092v1 null
2023-09-05 Enhance Multi-domain Sentiment Analysis of Review Texts through Prompting Strategies Yajing Wang et.al. 2309.02045v1 null
2023-09-05 Bilevel Scheduled Sampling for Dialogue Generation Jiawen Liu et.al. 2309.01953v1 null
2023-09-04 Into the Single Cell Multiverse: an End-to-End Dataset for Procedural Knowledge Extraction in Biomedical Texts Ruth Dannenfelser et.al. 2309.01812v1 link
2023-09-04 Prompting or Fine-tuning? A Comparative Study of Large Language Models for Taxonomy Construction Boqi Chen et.al. 2309.01715v1 link
2023-09-04 ChatRule: Mining Logical Rules with Large Language Models for Knowledge Graph Reasoning Linhao Luo et.al. 2309.01538v1 link
2023-09-03 A Visual Interpretation-Based Self-Improved Classification System Using Virtual Adversarial Training Shuai Jiang et.al. 2309.01196v1 null
2023-09-03 Large Language Models for Generative Recommendation: A Survey and Visionary Discussions Lei Li et.al. 2309.01157v1 null
2023-09-01 When Do Discourse Markers Affect Computational Sentence Understanding? Ruiqi Li et.al. 2309.00368v1 null
2023-09-01 Comparative Topic Modeling for Determinants of Divergent Report Results Applied to Macular Degeneration Studies Lucas Cassiel Jacaruso et.al. 2309.00312v1 null
2023-09-01 FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Automated Fact-Checking Tsun-Hin Cheung et.al. 2309.00240v1 null
2023-09-01 ALJP: An Arabic Legal Judgment Prediction in Personal Status Cases Using Machine Learning Models Salwa Abbara et.al. 2309.00238v1 null
2023-08-31 Predicting Financial Market Trends using Time Series Analysis and Natural Language Processing Ali Asgarov et.al. 2309.00136v1 null
2023-08-31 PointLLM: Empowering Large Language Models to Understand Point Clouds Runsen Xu et.al. 2308.16911v1 link
2023-08-31 Using Large Language Models to Automate Category and Trend Analysis of Scientific Articles: An Application in Ophthalmology Hina Raja et.al. 2308.16688v1 null
2023-08-31 High Accuracy Location Information Extraction from Social Network Texts Using Natural Language Processing Lossan Bonde et.al. 2308.16615v1 null
2023-08-31 Link Prediction for Wikipedia Articles as a Natural Language Inference Task Chau-Thang Phan et.al. 2308.16469v1 link
2023-08-30 Debunking Disinformation: Revolutionizing Truth with NLP in Fake News Detection Li He et.al. 2308.16328v1 null
2023-08-30 Materials Informatics Transformer: A Language Model for Interpretable Materials Properties Prediction Hongshuo Huang et.al. 2308.16259v1 link
2023-08-30 Automatic assessment of text-based responses in post-secondary education: A systematic review Rujun Gao et.al. 2308.16151v1 null
2023-08-30 Conti Inc.: Understanding the Internal Discussions of a large Ransomware-as-a-Service Operator with Machine Learning Estelle Ruellan et.al. 2308.16061v1 null
2023-08-30 DTrOCR: Decoder-only Transformer for Optical Character Recognition Masato Fujitake et.al. 2308.15996v1 null
2023-08-30 AI-powered Fraud Detection in Decentralized Finance: A Project Life Cycle Perspective Bingqiao Luo et.al. 2308.15992v1 null
2023-08-30 WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model Tianyu Wang et.al. 2308.15962v1 null
2023-08-30 Benchmarking Multilabel Topic Classification in the Kyrgyz Language Anton Alekseev et.al. 2308.15952v1 link
2023-08-30 The Janus System: Multi-paradigm Programming in Prolog and Python Theresa Swift et.al. 2308.15893v1 null
2023-08-30 HAlf-MAsked Model for Named Entity Sentiment analysis Anton Kabaev et.al. 2308.15793v1 null
2023-08-29 Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis Sotirios Kastanas et.al. 2308.15517v1 link
2023-08-29 Vulgar Remarks Detection in Chittagonian Dialect of Bangla Tanjim Mahmud et.al. 2308.15448v1 null
2023-08-29 Historical patterns of rice farming explain modern-day language use in China and Japan more than modernization and urbanization Sharath Chandra Guntuku et.al. 2308.15352v1 null
2023-08-29 A Framework for Responsible Development of Automated Student Feedback with Generative AI Euan D Lindsay et.al. 2308.15334v1 null
2023-08-29 CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs Hiroyuki Ootomo et.al. 2308.15136v1 link
2023-08-29 Large Language Models on the Chessboard: A Study on ChatGPT's Formal Language Comprehension and Complex Reasoning Skills Mu-Tien Kuo et.al. 2308.15118v1 null
2023-08-29 Serving MoE Models on Resource-constrained Edge Devices via Dynamic Expert Swapping Rui Kong et.al. 2308.15030v1 null
2023-08-29 TransPrompt v2: A Transferable Prompting Framework for Cross-task Text Classification Jianing Wang et.al. 2308.15010v1 null
2023-08-29 CEFHRI: A Communication Efficient Federated Learning Framework for Recognizing Industrial Human-Robot Interaction Umar Khalid et.al. 2308.14965v1 link
2023-08-28 Diversified Ensemble of Independent Sub-Networks for Robust Self-Supervised Representation Learning Amirhossein Vahidi et.al. 2308.14705v1 null
2023-08-28 ANER: Arabic and Arabizi Named Entity Recognition using Transformer-Based Approach Abdelrahman "Boda" Sadallah et.al. 2308.14669v1 null
2023-08-28 Large Graph Models: A Perspective Ziwei Zhang et.al. 2308.14522v1 link
2023-08-28 Biomedical Entity Linking with Triple-aware Pre-Training Xi Yan et.al. 2308.14429v1 null
2023-08-28 Rethinking Mobile AI Ecosystem in the LLM Era Jinliang Yuan et.al. 2308.14363v1 link
2023-08-28 Can Transformer and GNN Help Each Other? Peiyan Zhang et.al. 2308.14355v1 null
2023-08-28 FonMTL: Towards Multitask Learning for the Fon Language Bonaventure F. P. Dossou et.al. 2308.14280v1 link
2023-08-28 Goodhart's Law Applies to NLP's Explanation Benchmarks Jennifer Hsia et.al. 2308.14272v1 null
2023-08-27 Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models Kaiyuan Gao et.al. 2308.14149v1 link
2023-08-27 Detecting Language Model Attacks with Perplexity Gabriel Alon et.al. 2308.14132v1 null
2023-08-25 ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent Detection Yihao Fang et.al. 2308.13517v1 link
2023-08-25 Ngambay-French Neural Machine Translation (sba-Fr) Sakayo Toadoum Sari et.al. 2308.13497v1 link
2023-08-25 Leveraging Knowledge and Reinforcement Learning for Enhanced Reliability of Language Models Nancy Tyagi et.al. 2308.13467v1 null
2023-08-25 ARTIST: ARTificial Intelligence for Simplified Text Lorenzo Corti et.al. 2308.13458v1 link
2023-08-25 QKSAN: A Quantum Kernel Self-Attention Network Ren-Xin Zhao et.al. 2308.13422v1 null
2023-08-25 In-context learning for model-free system identification Marco Forgione et.al. 2308.13380v1 link
2023-08-25 Construction Grammar and Language Models Harish Tayyar Madabushi et.al. 2308.13315v1 null
2023-08-25 LLM2KB: Constructing Knowledge Bases using instruction tuned context aware Large Language Models Anmol Nayak et.al. 2308.13207v1 link
2023-08-25 Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers Jiawen Xie et.al. 2308.13191v1 null
2023-08-25 OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models Wenqi Shao et.al. 2308.13137v1 link
2023-08-24 Text Similarity from Image Contents using Statistical and Semantic Analysis Techniques Sagar Kulkarni et.al. 2308.12842v1 null
2023-08-24 Sparks of Large Audio Models: A Survey and Outlook Siddique Latif et.al. 2308.12792v1 link
2023-08-24 Pre-training Code Representation with Semantic Flow Graph for Effective Bug Localization Yali Du et.al. 2308.12773v1 link
2023-08-23 Simple is Better and Large is Not Enough: Towards Ensembling of Foundational Language Models Nancy Tyagi et.al. 2308.12272v1 null
2023-08-23 Curriculum Learning with Adam: The Devil Is in the Wrong Details Lucas Weber et.al. 2308.12202v1 null
2023-08-23 Out of the Cage: How Stochastic Parrots Win in Cyber Security Environments Maria Rigaki et.al. 2308.12086v1 link
2023-08-23 Bridging the Gap: Deciphering Tabular Data Using Large Language Model Hengyuan Zhang et.al. 2308.11891v1 null
2023-08-22 Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model Yuezhou Zhang et.al. 2308.11773v1 null
2023-08-24 Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models Mohamed Elaraby et.al. 2308.11764v2 link
2023-08-22 Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices Elizaveta Kostenok et.al. 2308.11295v1 null
2023-08-22 The Software Heritage License Dataset (2022 Edition) Jesús M. González-Barahona et.al. 2308.11258v1 null
2023-08-22 ConcatPlexer: Additional Dim1 Batching for Faster ViTs Donghoon Han et.al. 2308.11199v1 null
2023-08-22 ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation Jianghao Lin et.al. 2308.11131v1 link
2023-08-21 Unlocking Hardware Security Assurance: The Potential of LLMs Xingyu Meng et.al. 2308.11042v1 null
2023-08-21 Practical Parallel Algorithms for Non-Monotone Submodular Maximization Shuang Cui et.al. 2308.10656v1 null
2023-08-21 Exploring Equation as a Better Intermediate Meaning Representation for Numerical Reasoning Dingzirui Wang et.al. 2308.10585v1 link
2023-08-22 An Effective Method using Phrase Mechanism in Neural Machine Translation Phuong Minh Nguyen et.al. 2308.10482v2 link
2023-08-21 Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP Concepts Fan Gao et.al. 2308.10410v1 link
2023-08-20 How Good Are Large Language Models at Out-of-Distribution Detection? Bo Liu et.al. 2308.10261v1 link
2023-08-20 ChatEDA: A Large Language Model Powered Autonomous Agent for EDA Zhuolun He et.al. 2308.10204v1 null
2023-08-19 Deep Generative Modeling-based Data Augmentation with Demonstration using the BFBT Benchmark Void Fraction Datasets Farah Alsafadi et.al. 2308.10120v1 null
2023-08-19 FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models Liwen Zhang et.al. 2308.09975v1 link
2023-08-19 A Transformer-based Framework For Multi-variate Time Series: A Remaining Useful Life Prediction Use Case Oluwaseyi Ogunfowora et.al. 2308.09884v1 null
2023-08-19 Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders Jie Cheng et.al. 2308.09882v1 link
2023-08-18 WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct Haipeng Luo et.al. 2308.09583v1 link
2023-08-18 Learnt Contrastive Concept Embeddings for Sign Recognition Ryan Wong et.al. 2308.09515v1 null
2023-08-18 Exploring Sampling Techniques for Generating Melodies with a Transformer Language Model Mathias Rose Bjare et.al. 2308.09454v1 null
2023-08-18 Differentiable Retrieval Augmentation via Generative Language Modeling for E-commerce Query Intent Classification Chenyu Zhao et.al. 2308.09308v1 null
2023-08-17 Characterizing Information Seeking Events in Health-Related Social Discourse Omar Sharif et.al. 2308.09156v1 null
2023-08-17 Enhancing API Documentation through BERTopic Modeling and Summarization AmirHossein Naghshzan et.al. 2308.09070v1 link
2023-08-17 Don't lose the message while paraphrasing: A study on content preserving style transfer Nikolay Babakov et.al. 2308.09055v1 link
2023-08-17 CodeCoT and Beyond: Learning to Program and Test like a Developer Dong Huang et.al. 2308.08784v1 null
2023-08-17 Real-Time Construction Algorithm of Co-Occurrence Network Based on Inverted Index Jiahao Cheng et.al. 2308.08756v1 null
2023-08-17 Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase Extraction Yuanzhen Luo et.al. 2308.08739v1 null
2023-08-16 Can Transformers Learn Optimal Filtering for Unknown Systems? Haldun Balim et.al. 2308.08536v1 link
2023-08-16 LLM4TS: Two-Stage Fine-Tuning for Time-Series Forecasting with Pre-Trained LLMs Ching Chang et.al. 2308.08469v1 null
2023-08-16 Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey Lovre Torbarina et.al. 2308.08234v1 null
2023-08-16 Fast Training of NMT Model with Data Sorting Daniela N. Rim et.al. 2308.08153v1 null
2023-08-15 Using Artificial Populations to Study Psychological Phenomena in Neural Models Jesse Roberts et.al. 2308.08032v1 link
2023-08-15 Through the Lens of Core Competency: Survey on Evaluation of Large Language Models Ziyu Zhuang et.al. 2308.07902v1 null
2023-08-15 Emotion Embeddings $\unicode{x2014}$ Learning Stable and Homogeneous Abstractions from Heterogeneous Affective Datasets Sven Buechel et.al. 2308.07871v1 null
2023-08-15 Attention Is Not All You Need Anymore Zhe Chen et.al. 2308.07661v1 null
2023-08-15 A Survey on Model Compression for Large Language Models Xunyu Zhu et.al. 2308.07633v1 null
2023-08-15 A User-Centered Evaluation of Spanish Text Simplification Adrian de Wynter et.al. 2308.07556v1 link
2023-08-14 Cross-Attribute Matrix Factorization Model with Shared User Embedding Wen Liang et.al. 2308.07284v1 null
2023-08-14 Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt Optimization for Few-shot Learning Chengzhengxu Li et.al. 2308.07272v1 link
2023-08-14 Human-centered NLP Fact-checking: Co-Designing with Fact-checkers using Matchmaking for AI Houjiang Liu et.al. 2308.07213v1 null
2023-08-14 Natural Language is All a Graph Needs Ruosong Ye et.al. 2308.07134v1 link
2023-08-15 Large Language Models for Information Retrieval: A Survey Yutao Zhu et.al. 2308.07107v2 link
2023-08-14 EcomGPT: Instruction-tuning Large Language Model with Chain-of-Task Tasks for E-commerce Yangning Li et.al. 2308.06966v1 link
2023-08-14 GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text Pengfei Liu et.al. 2308.06911v1 link
2023-08-13 An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM Sanad Aburass et.al. 2308.06828v1 null
2023-08-13 AerialVLN: Vision-and-Language Navigation for UAVs Shubo Liu et.al. 2308.06735v1 link
2023-08-12 Copilot Security: A User Study Owura Asare et.al. 2308.06587v1 link
2023-08-11 KETM:A Knowledge-Enhanced Text Matching method Kexin Jiang et.al. 2308.06235v1 link
2023-08-11 Large Language Models for Telecom: Forthcoming Impact on the Industry Ali Maatouk et.al. 2308.06013v1 null
2023-08-10 LASIGE and UNICAGE solution to the NASA LitCoin NLP Competition Pedro Ruas et.al. 2308.05609v1 null
2023-08-10 Bringing order into the realm of Transformer-based language models for artificial intelligence and law Candida M. Greco et.al. 2308.05502v1 null
2023-08-11 Exploring Machine Learning and Transformer-based Approaches for Deceptive Text Classification: A Comparative Analysis Anusuya Krishnan et.al. 2308.05476v2 null
2023-08-10 From CNN to Transformer: A Review of Medical Image Segmentation Models Wenjian Yao et.al. 2308.05305v1 null
2023-08-09 A Novel Method for improving accuracy in neural network by reinstating traditional back propagation technique Gokulprasath R et.al. 2308.05059v1 null
2023-08-09 Performance Analysis of Transformer Based Models (BERT, ALBERT and RoBERTa) in Fake News Detection Shafna Fitria Nur Azizah et.al. 2308.04950v1 link
2023-08-09 An Empirical Study on Using Large Language Models to Analyze Software Supply Chain Security Failures Tanmay Singla et.al. 2308.04898v1 null
2023-08-09 No Need to Lift a Finger Anymore? Assessing the Quality of Code Generation by ChatGPT Zhijie Liu et.al. 2308.04838v1 null
2023-08-09 TSSR: A Truncated and Signed Square Root Activation Function for Neural Networks Yuanhao Gong et.al. 2308.04832v1 null
2023-08-09 Optimizing a Transformer-based network for a deep learning seismic processing workflow Randy Harsuko et.al. 2308.04739v1 null
2023-08-09 A Comparative Study of Open-Source Large Language Models, GPT-4 and Claude 2: Multiple-Choice Test Taking in Nephrology Sean Wu et.al. 2308.04709v1 null
2023-08-09 Cross-Lingual Constituency Parsing for Middle High German: A Delexicalized Approach Ercong Nie et.al. 2308.04645v1 null
2023-08-08 Unmasking Nationality Bias: A Study of Human Perception of Nationalities in AI-Generated Articles Pranav Narayanan Venkit et.al. 2308.04346v1 null
2023-08-08 Deep Learning-Based Knowledge Injection for Metaphor Detection: A Comprehensive Review Cheng Yang et.al. 2308.04306v1 null
2023-08-08 CLASSLA-Stanza: The Next Step for Linguistic Processing of South Slavic Languages Luka Terčon et.al. 2308.04255v1 link
2023-08-08 Assistive Chatbots for healthcare: a succinct review Basabdatta Sen Bhattacharya et.al. 2308.04178v1 null
2023-08-08 I-WAS: a Data Augmentation Method with GPT-2 for Simile Detection Yongzhu Chang et.al. 2308.04109v1 null
2023-08-08 Portrayal: Leveraging NLP and Visualization for Analyzing Fictional Characters Md Naimul Hoque et.al. 2308.04056v1 null
2023-08-08 A Comparative Study on TF-IDF feature Weighting Method and its Analysis using Unstructured Dataset Mamata Das et.al. 2308.04037v1 null
2023-08-08 AI Chatbots as Multi-Role Pedagogical Agents: Transforming Engagement in CS Education Cassie Chen Cao et.al. 2308.03992v1 null
2023-08-07 Extracting detailed oncologic history and treatment plan from medical oncology notes with large language models Madhumita Sushil et.al. 2308.03853v1 link
2023-08-07 "Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models Xinyue Shen et.al. 2308.03825v1 link
2023-08-07 RCMHA: Relative Convolutional Multi-Head Attention for Natural Language Modelling Herman Sugiharto et.al. 2308.03429v1 link
2023-08-07 TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents Jingqing Ruan et.al. 2308.03427v1 null
2023-08-07 Symmetry-Preserving Program Representations for Learning Code Semantics Kexin Pei et.al. 2308.03312v1 null
2023-08-07 From Ambiguity to Explicitness: NLP-Assisted 5G Specification Abstraction for Formal Analysis Shiyu Yuan et.al. 2308.03277v1 null
2023-08-07 Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion Mining Nour Eddine Zekaoui et.al. 2308.03235v1 link
2023-08-06 Average-Hard Attention Transformers are Constant-Depth Uniform Threshold Circuits Lena Strobl et.al. 2308.03212v1 null
2023-08-04 Meta-Tsallis-Entropy Minimization: A New Self-Training Approach for Domain Adaptation on Text Classification Menglong Lu et.al. 2308.02746v1 null
2023-08-04 Universal Approximation of Linear Time-Invariant (LTI) Systems through RNNs: Power of Randomness in Reservoir Computing Shashank Jere et.al. 2308.02464v1 null
2023-08-04 Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from Text Nandana Mihindukulasooriya et.al. 2308.02357v1 link
2023-08-04 Sinhala-English Parallel Word Dictionary Dataset Kasun Wickramasinghe et.al. 2308.02234v1 link
2023-08-04 Explaining Relation Classification Models with Semantic Extents Lars Klöser et.al. 2308.02193v1 link
2023-08-04 From Fake to Hyperpartisan News Detection Using Domain Adaptation Răzvan-Alexandru Smădu et.al. 2308.02185v1 null
2023-08-04 ParaFuzz: An Interpretability-Driven Technique for Detecting Poisoned Samples in NLP Lu Yan et.al. 2308.02122v1 null
2023-08-04 Model Provenance via Model DNA Xin Mu et.al. 2308.02121v1 null
2023-08-03 Causality Guided Disentanglement for Cross-Platform Hate Speech Detection Paras Sheth et.al. 2308.02080v1 link
2023-08-03 Accurate Neural Network Pruning Requires Rethinking Sparse Optimization Denis Kuznedelev et.al. 2308.02060v1 null
2023-08-03 Seasonality Based Reranking of E-commerce Autocomplete Using Natural Language Queries Prateek Verma et.al. 2308.02055v1 null
2023-08-03 Tag Prediction of Competitive Programming Problems using Deep Learning Techniques Taha Lokat et.al. 2308.01863v1 null
2023-08-03 XNLP: An Interactive Demonstration System for Universal Structured NLP Hao Fei et.al. 2308.01846v1 null
2023-08-03 Lexicon and Rule-based Word Lemmatization Approach for the Somali Language Shafie Abdi Mohamed et.al. 2308.01785v1 link
2023-08-03 Does Correction Remain An Problem For Large Language Models? Xiaowu Zhang et.al. 2308.01776v1 null
2023-08-03 NBIAS: A Natural Language Processing Framework for Bias Identification in Text Shaina Razaa et.al. 2308.01681v1 null
2023-08-03 Holy Grail 2.0: From Natural Language to Constraint Models Dimos Tsouros et.al. 2308.01589v1 null
2023-08-03 Large Language Model Displays Emergent Ability to Interpret Novel Literary Metaphors Nicholas Ichien et.al. 2308.01497v1 null
2023-08-02 Manual Tests Do Smell! Cataloging and Identifying Natural Language Test Smells Elvys Soares et.al. 2308.01386v1 link
2023-08-02 Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification Laurin Wagner et.al. 2308.01327v1 null
2023-08-02 ADS-Cap: A Framework for Accurate and Diverse Stylized Captioning with Unpaired Stylistic Corpora Kanzhi Cheng et.al. 2308.01143v1 link
2023-08-02 Feature-aware conditional GAN for category text generation Xinze Li et.al. 2308.00939v1 null
2023-07-31 Predicting masked tokens in stochastic locations improves masked image modeling Amir Bar et.al. 2308.00566v1 null
2023-08-01 Discourse-Aware Text Simplification: From Complex Sentences to Linked Propositions Christina Niklaus et.al. 2308.00425v1 null
2023-08-01 LimeAttack: Local Explainable Method for Textual Hard-Label Adversarial Attack Hai Zhu et.al. 2308.00319v1 link
2023-08-01 LGViT: Dynamic Early Exiting for Accelerating Vision Transformer Guanyu Xu et.al. 2308.00255v1 null
2023-07-31 Adversarially Robust Neural Legal Judgement Systems Rohit Raj et.al. 2308.00165v1 null
2023-07-31 Structural Transfer Learning in NL-to-Bash Semantic Parsers Kyle Duffy et.al. 2307.16795v1 null
2023-08-02 LLMs4OL: Large Language Models for Ontology Learning Hamed Babaei Giglou et.al. 2307.16648v2 link
2023-07-31 Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks Xinyu Zhang et.al. 2307.16630v1 null
2023-07-31 Toward Quantum Machine Translation of Syntactically Distinct Languages Mina Abbaszade et.al. 2307.16576v1 null
2023-07-31 AMOE: a Tool to Automatically Extract and Assess Organizational Evidence for Continuous Cloud Audit Franz Deimling et.al. 2307.16541v1 null
2023-07-31 A Benchmark for Understanding Dialogue Safety in Mental Health Support Huachuan Qiu et.al. 2307.16457v1 link
2023-07-31 Camoscio: an Italian Instruction-tuned LLaMA Andrea Santilli et.al. 2307.16456v1 link
2023-07-31 LP-MusicCaps: LLM-Based Pseudo Music Captioning SeungHeon Doh et.al. 2307.16372v1 link
2023-07-30 Self-Supervised Learning of Gait-Based Biomarkers R. James Cotton et.al. 2307.16321v1 null
2023-07-30 Text Analysis Using Deep Neural Networks in Digital Humanities and Information Science Omri Suissa et.al. 2307.16217v1 null
2023-07-28 Universal Recurrent Event Memories for Streaming Data Ran Dou et.al. 2307.15694v1 null
2023-07-28 BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering Khiem Vinh Tran et.al. 2307.15335v1 null
2023-07-28 TrafficSafetyGPT: Tuning a Pre-trained Large Language Model to a Domain-Specific Expert in Transportation Safety Ou Zheng et.al. 2307.15311v1 link
2023-07-27 f-Divergence Minimization for Sequence-Level Knowledge Distillation Yuqiao Wen et.al. 2307.15190v1 link
2023-07-27 Text-guided Foundation Model Adaptation for Pathological Image Classification Yunkun Zhang et.al. 2307.14901v1 link
2023-07-27 Improving Natural Language Inference in Arabic using Transformer Models and Linguistically Informed Pre-Training Mohammad Majd Saad Al Deen et.al. 2307.14666v1 link
2023-07-27 Metric-Based In-context Learning: A Case Study in Text Simplification Subha Vadlamannati et.al. 2307.14632v1 link
2023-07-27 Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models Yuchi Qiu et.al. 2307.14587v1 null
2023-07-26 Words That Stick: Predicting Decision Making and Synonym Engagement Using Cognitive Biases and Computational Linguistics Nimrod Dvir et.al. 2307.14511v1 null
2023-07-26 A Predictive Model of Digital Information Engagement: Forecasting User Engagement With English Words by Incorporating Cognitive Biases, Computational Linguistics and Natural Language Processing Nimrod Dvir et.al. 2307.14500v1 null
2023-07-26 TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning Yury Gorishniy et.al. 2307.14338v1 link
2023-07-26 Comparative Analysis of Libraries for the Sentimental Analysis Wendy Ccoya et.al. 2307.14311v1 null
2023-07-26 Mining Reddit Data to Elicit Students' Requirements During COVID-19 Pandemic Shadikur Rahman et.al. 2307.14212v1 null
2023-07-26 A semantics-driven methodology for high-quality image annotation Fausto Giunchiglia et.al. 2307.14119v1 null
2023-07-26 Decoding ChatGPT: A Taxonomy of Existing Research, Current Challenges, and Possible Future Directions Shahab Saquib Sohail et.al. 2307.14107v1 null
2023-07-25 Evaluating Large Language Models for Radiology Natural Language Processing Zhengliang Liu et.al. 2307.13693v1 link
2023-07-25 Multilevel Large Language Models for Everyone Yuanhao Gong et.al. 2307.13221v1 null
2023-07-24 Explaining Math Word Problem Solvers Abby Newcomb et.al. 2307.13128v1 null
2023-07-24 Making Metadata More FAIR Using Large Language Models Sowmya S. Sundaram et.al. 2307.13085v1 null
2023-07-24 A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models Jindong Gu et.al. 2307.12980v1 link
2023-07-24 Aligning Large Language Models with Human: A Survey Yufei Wang et.al. 2307.12966v1 link
2023-07-24 Concept-based explainability for an EEG transformer model Anders Gjølbye Madsen et.al. 2307.12745v1 link
2023-07-23 Transformer-based Joint Source Channel Coding for Textual Semantic Communication Shicong Liu et.al. 2307.12266v1 null
2023-07-22 A Zero-shot and Few-shot Study of Instruction-Finetuned Large Language Models Applied to Clinical and Biomedical Tasks Yanis Labrak et.al. 2307.12114v1 null
2023-07-22 Sparse then Prune: Toward Efficient Vision Transformers Yogi Prasetyo et.al. 2307.11988v1 link
2023-07-22 HIQL: Offline Goal-Conditioned RL with Latent States as Actions Seohong Park et.al. 2307.11949v1 link
2023-07-21 Multimodal Document Analytics for Banking Process Automation Christopher Gerling et.al. 2307.11845v1 null
2023-07-21 Advancing Visual Grounding with Scene Knowledge: Benchmark and Method Zhihong Chen et.al. 2307.11558v1 link
2023-07-21 YOLOPose V2: Understanding and Improving Transformer-based 6D Pose Estimation Arul Selvam Periyasamy et.al. 2307.11550v1 null
2023-07-21 Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation Zunnan Xu et.al. 2307.11545v1 link
2023-07-20 A Systematic Evaluation of Federated Learning on Biomedical Natural Language Processing Le Peng et.al. 2307.11254v1 link
2023-07-20 Extreme Multi-Label Skill Extraction Training using Large Language Models Jens-Joris Decorte et.al. 2307.10778v1 null
2023-07-20 A Dataset and Strong Baselines for Classification of Czech News Texts Hynek Kydlíček et.al. 2307.10666v1 link
2023-07-20 Exploring the Landscape of Natural Language Processing Research Tim Schopf et.al. 2307.10652v1 link
2023-07-20 Instruction-following Evaluation through Verbalizer Manipulation Shiyang Li et.al. 2307.10558v1 null
2023-07-19 Mood Classification of Bangla Songs Based on Lyrics Maliha Mahajebin et.al. 2307.10314v1 null
2023-07-19 Alzheimer's Disease Detection from Spontaneous Speech and Text: A review Vrindha M. K. et.al. 2307.10005v1 null
2023-07-19 Large Language Models can accomplish Business Process Management Tasks Michael Grohs et.al. 2307.09923v1 null
2023-07-19 Chit-Chat or Deep Talk: Prompt Engineering for Process Mining Urszula Jessen et.al. 2307.09909v1 null
2023-07-19 Test-takers have a say: understanding the implications of the use of AI in language tests Dawen Zhang et.al. 2307.09885v1 null
2023-07-19 Enhancing conversational quality in language learning chatbots: An evaluation of GPT4 for ASR error correction Long Mai et.al. 2307.09744v1 null
2023-07-19 Improving Domain Generalization for Sound Classification with Sparse Frequency-Regularized Transformer Honglin Mu et.al. 2307.09723v1 link
2023-07-19 Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation Hao Peng et.al. 2307.09701v1 null
2023-07-18 Can Model Fusing Help Transformers in Long Document Classification? An Empirical Study Damith Premasiri et.al. 2307.09532v1 link
2023-07-18 Scaling Laws for Imitation Learning in NetHack Jens Tuyls et.al. 2307.09423v1 null
2023-07-18 UniTabE: Pretraining a Unified Tabular Encoder for Heterogeneous Tabular Data Yazheng Yang et.al. 2307.09249v1 null
2023-07-18 Mitigating masked pixels in climate-critical datasets Angelina Agabin et.al. 2307.09227v1 null
2023-07-18 Automated Ableism: An Exploration of Explicit Disability Biases in Sentiment and Toxicity Analysis Models Pranav Narayanan Venkit et.al. 2307.09209v1 null
2023-07-18 Unveiling Gender Bias in Terms of Profession Across LLMs: Analyzing and Addressing Sociological Implications Vishesh Thakur et.al. 2307.09162v1 null
2023-07-18 R-Cut: Enhancing Explainability in Vision Transformers with Relationship Weighted Out and Cut Yingjie Niu et.al. 2307.09050v1 null
2023-07-18 On the (In)Effectiveness of Large Language Models for Chinese Text Correction Yinghui Li et.al. 2307.09007v1 null
2023-07-18 NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning Tianxin Wei et.al. 2307.08941v1 link
2023-07-18 Teach model to answer questions after comprehending the document Ruiqing Sun et.al. 2307.08931v1 null
2023-07-17 Harnessing the Power of AI based Image Generation Model DALLE 2 in Agricultural Settings Ranjan Sapkota et.al. 2307.08789v1 null
2023-07-17 COLLIE: Systematic Construction of Constrained Text Generation Tasks Shunyu Yao et.al. 2307.08689v1 link
2023-07-17 Utilization of Pre-trained Language Model for Adapter-based Knowledge Transfer in Software Engineering Iman Saberi et.al. 2307.08540v1 null
2023-07-17 BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization Chaoya Jiang et.al. 2307.08504v1 null
2023-07-17 On the application of Large Language Models for language teaching and assessment technology Andrew Caines et.al. 2307.08393v1 null
2023-07-16 Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling Longyue Wang et.al. 2307.08074v1 null
2023-07-16 Fast Quantum Algorithm for Attention Computation Yeqi Gao et.al. 2307.08045v1 null
2023-07-16 A Survey of Techniques for Optimizing Transformer Inference Krishna Teja Chitty-Venkata et.al. 2307.07982v1 null
2023-07-15 AspectCSE: Sentence Embeddings for Aspect-based Semantic Textual Similarity using Contrastive Learning and Structured Knowledge Tim Schopf et.al. 2307.07851v1 null
2023-07-15 Improving Trace Link Recommendation by Using Non-Isotropic Distances and Combinations Christof Tinnes et.al. 2307.07781v1 null
2023-07-15 Leveraging Large Language Models to Generate Answer Set Programs Adam Ishay et.al. 2307.07699v1 link
2023-07-14 Investigating ChatGPT's Potential to Assist in Requirements Elicitation Processes Krishna Ronanki et.al. 2307.07381v1 null
2023-07-14 AIC-AB NET: A Neural Network for Image Captioning with Spatial Attention and Text Attributes Guoyun Tu et.al. 2307.07370v1 null
2023-07-14 A scoping review on multimodal deep learning in biomedical images and texts Zhaoyi Sun et.al. 2307.07362v1 null
2023-07-14 MaxSR: Image Super-Resolution Using Improved MaxViT Bincheng Yang et.al. 2307.07240v1 null
2023-07-14 Software Testing with Large Language Model: Survey, Landscape, and Vision Junjie Wang et.al. 2307.07221v1 null
2023-07-13 Making the Most Out of the Limited Context Length: Predictive Power Varies with Clinical Note Type and Note Section Hongyi Zheng et.al. 2307.07051v1 null
2023-07-13 Parmesan: mathematical concept extraction for education Jacob Collard et.al. 2307.06699v1 null
2023-07-13 Going Beyond Local: Global Graph-Enhanced Personalized News Recommendations Boming Yang et.al. 2307.06576v1 link
2023-07-13 Convolutional Neural Networks for Sentiment Analysis on Weibo Data: A Natural Language Processing Approach Yufei Xie et.al. 2307.06540v1 null
2023-07-13 Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study Zeping Min et.al. 2307.06530v1 null
2023-07-12 Transformers in Reinforcement Learning: A Survey Pranav Agarwal et.al. 2307.05979v1 null
2023-07-11 Machine Learning Study of the Extended Drug-target Interaction Network informed by Pain Related Voltage-Gated Sodium Channels Long Chen et.al. 2307.05794v1 link
2023-07-10 Exploring Large Language Model for Graph Data Understanding in Online Job Recommendations Likang Wu et.al. 2307.05722v1 link
2023-07-11 Objaverse-XL: A Universe of 10M+ 3D Objects Matt Deitke et.al. 2307.05663v1 null
2023-07-10 Hate Speech Detection via Dual Contrastive Learning Junyu Lu et.al. 2307.05578v1 null
2023-07-11 GujiBERT and GujiGPT: Construction of Intelligent Information Processing Foundation Language Models for Ancient Texts Dongbo Wang et.al. 2307.05354v1 null
2023-07-11 On the Effectiveness of Speech Self-supervised Learning for Music Yinghao Ma et.al. 2307.05161v1 null
2023-07-11 Hybrid hidden Markov LSTM for short-term traffic flow prediction Agnimitra Sengupta et.al. 2307.04954v1 null
2023-07-10 Entity Identifier: A Natural Text Parsing-based Framework For Entity Relation Extraction El Mehdi Chouham et.al. 2307.04892v1 null
2023-07-10 COMEX: A Tool for Generating Customized Source Code Representations Debeshee Das et.al. 2307.04693v1 link
2023-07-10 Search-time Efficient Device Constraints-Aware Neural Architecture Search Oshin Dutta et.al. 2307.04443v1 null
2023-07-10 Privacy-Preserving Graph Machine Learning from Data to Computation: A Survey Dongqi Fu et.al. 2307.04338v1 null
2023-07-10 CT-BERT: Learning Better Tabular Representations Through Cross-Table Pre-training Chao Ye et.al. 2307.04308v1 link
2023-07-09 ChatGPT in the Age of Generative AI and Large Language Models: A Concise Survey Salman Mohamadi et.al. 2307.04251v1 link
2023-07-09 A Novel Pipeline for Improving Optical Character Recognition through Post-processing Using Natural Language Processing Aishik Rakshit et.al. 2307.04245v1 null
2023-07-09 Can Generative Large Language Models Perform ASR Error Correction? Rao Ma et.al. 2307.04172v1 null
2023-07-09 Dream Content Discovery from Reddit with an Unsupervised Mixed-Method Approach Anubhab Das et.al. 2307.04167v1 null
2023-07-09 DebateKG: Automatic Policy Debate Case Creation with Semantic Knowledge Graphs Allen Roush et.al. 2307.04090v1 link
2023-07-08 Evaluating the Capability of Large-scale Language Models on Chinese Grammatical Error Correction Task Fanyi Qu et.al. 2307.03972v1 null
2023-07-07 ITA: An Energy-Efficient Attention and Softmax Accelerator for Quantized Transformers Gamze İslamoğlu et.al. 2307.03493v1 null
2023-07-06 Vision Language Transformers: A Survey Clayton Fields et.al. 2307.03254v1 null
2023-07-06 BrickPal: Augmented Reality-based Assembly Instructions for Brick Models Yao Shi et.al. 2307.03162v1 null
2023-07-06 A Survey on Evaluation of Large Language Models Yupeng Chang et.al. 2307.03109v1 link
2023-07-06 Efficient Domain Adaptation of Sentence Embeddings using Adapters Tim Schopf et.al. 2307.03104v1 link
2023-07-06 Efficient Semiring-Weighted Earley Parsing Andreas Opedal et.al. 2307.02982v1 link
2023-07-06 UIT-Saviors at MEDVQA-GI 2023: Improving Multimodal Learning with Image Enhancement for Gastrointestinal Visual Question Answering Triet M. Thai et.al. 2307.02783v1 null
2023-07-05 Unsupervised Sentiment Analysis of Plastic Surgery Social Media Posts Alexandrea K. Ramnarine et.al. 2307.02640v1 null
2023-07-05 ODD: A Benchmark Dataset for the NLP-based Opioid Related Aberrant Behavior Detection Sunjae Kwon et.al. 2307.02591v1 link
2023-07-05 Sumformer: Universal Approximation for Efficient Transformers Silas Alberti et.al. 2307.02301v1 null
2023-07-05 Make A Long Image Short: Adaptive Token Length for Vision Transformers Qiqi Zhou et.al. 2307.02092v1 null
2023-07-05 Emoji Prediction using Transformer Models Muhammad Osama Nusrat et.al. 2307.02054v1 link
2023-07-05 Recommender Systems in the Era of Large Language Models (LLMs) Wenqi Fan et.al. 2307.02046v1 null
2023-07-04 RRCNN: A novel signal decomposition approach based on recurrent residue convolutional neural network Feng Zhou et.al. 2307.01725v1 link
2023-07-04 A Language Model for Grammatical Error Correction in L2 Russian Nikita Remnev et.al. 2307.01609v1 null
2023-07-04 Learning to Prompt in the Classroom to Understand AI Limits: A pilot study Emily Theophilou et.al. 2307.01540v1 null
2023-07-04 All in One: Multi-task Prompting for Graph Neural Networks Xiangguo Sun et.al. 2307.01504v1 link
2023-07-04 On Evaluating and Mitigating Gender Biases in Multilingual Settings Aniket Vashishtha et.al. 2307.01503v1 null
2023-07-04 SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification Junjie Wu et.al. 2307.01488v1 null
2023-07-03 Improving Language Plasticity via Pretraining with Active Forgetting Yihong Chen et.al. 2307.01163v1 null
2023-07-03 Exploring the In-context Learning Ability of Large Language Model for Biomedical Concept Linking Qinyong Wang et.al. 2307.01137v1 null
2023-07-03 Challenges in Domain-Specific Abstractive Summarization and How to Overcome them Anum Afzal et.al. 2307.00963v1 null
2023-07-03 Automatic Design of Semantic Similarity Ensembles Using Grammatical Evolution Jorge Martinez-Gil et.al. 2307.00925v1 link
2023-07-03 Contextual Prompt Learning for Vision-Language Understanding Koustava Goswami et.al. 2307.00910v1 null
2023-07-03 Element similarity in high-dimensional materials representations Anthony Onwuli et.al. 2307.00784v1 null
2023-07-02 Neuro-Symbolic Sudoku Solver Ashutosh Hathidara et.al. 2307.00653v1 link
2023-07-02 Text based Large Language Model for Recommendation Jianchao Ji et.al. 2307.00457v1 link
2023-07-02 Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal Data Xinzhe Li et.al. 2307.00456v1 link
2023-07-01 SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency Yan Wang et.al. 2307.00280v1 null
2023-06-30 Towards Improving the Performance of Pre-Trained Speech Models for Low-Resource Languages Through Lateral Inhibition Andrei-Marius Avram et.al. 2306.17792v1 null
2023-06-30 Augmenting Holistic Review in University Admission using Natural Language Processing for Essays and Recommendation Letters Jinsook Lee et.al. 2306.17575v1 null
2023-06-30 A Cost-aware Study of Depression Language on Social Media using Topic and Affect Contextualization Andrea Laguna et.al. 2306.17564v1 null
2023-06-30 GPT-FinRE: In-context Learning for Financial Relation Extraction using Large Language Models Pawan Kumar Rajpoot et.al. 2306.17519v1 link
2023-06-29 Prediction of COVID-19 Patients' Emergency Room Revisit using Multi-Source Transfer Learning Yuelyu Ji et.al. 2306.17257v1 null
2023-06-29 Towards Grammatical Tagging for the Legal Language of Cybersecurity Gianpietro Castiglione et.al. 2306.17042v1 null
2023-06-29 Benchmarking Large Language Model Capabilities for Conditional Generation Joshua Maynez et.al. 2306.16793v1 null
2023-06-29 Principles and Guidelines for Evaluating Social Robot Navigation Algorithms Anthony Francis et.al. 2306.16740v1 null
2023-06-29 Beyond CO2 Emissions: The Overlooked Impact of Water Consumption of Information Retrieval Models Guido Zuccon et.al. 2306.16668v1 link
2023-06-28 An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs Haihao Shen et.al. 2306.16601v1 link
2023-06-28 Multi-Site Clinical Federated Learning using Recursive and Attentive Models and NVFlare Won Joon Yun et.al. 2306.16367v1 null
2023-06-28 cuSLINK: Single-linkage Agglomerative Clustering on the GPU Corey J. Nolet et.al. 2306.16354v1 link
2023-06-28 Generative User-Experience Research for Developing Domain-specific Natural Language Processing Applications Anastasia Zhukova et.al. 2306.16143v1 null
2023-06-28 ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases Jiaxi Cui et.al. 2306.16092v1 link
2023-06-28 Sentence-to-Label Generation Framework for Multi-task Learning of Japanese Sentence Classification and Named Entity Recognition Chengguang Gan et.al. 2306.15978v1 link
2023-06-28 Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias Yue Yu et.al. 2306.15895v1 link
2023-06-27 MAT: Mixed-Strategy Game of Adversarial Training in Fine-tuning Zhehua Zhong et.al. 2306.15826v1 null
2023-06-27 To Spike or Not To Spike: A Digital Hardware Perspective on Deep Learning Acceleration Fabrizio Ottati et.al. 2306.15749v1 link
2023-06-27 Exploring Durham University Physics exams with Large Language Models Will Yeadon et.al. 2306.15609v1 link
2023-06-27 Using Large Language Models to Provide Explanatory Feedback to Human Tutors Jionghao Lin et.al. 2306.15498v1 null
2023-06-27 Gender Bias in BERT -- Measuring and Analysing Biases through Sentiment Rating in a Realistic Downstream Classification Task Sophie Jentzsch et.al. 2306.15298v1 null
2023-06-28 Investigating Cross-Domain Behaviors of BERT in Review Understanding Albert Lu et.al. 2306.15123v2 null
2023-06-26 FeedbackMap: a tool for making sense of open-ended survey responses Doug Beeferman et.al. 2306.15112v1 link
2023-06-26 LM4HPC: Towards Effective Language Model Application in High-Performance Computing Le Chen et.al. 2306.14979v1 null
2023-06-26 The Art of Embedding Fusion: Optimizing Hate Speech Detection Mohammad Aflah Khan et.al. 2306.14939v1 link
2023-06-26 Learning to Modulate pre-trained Models in RL Thomas Schmied et.al. 2306.14884v1 link
2023-06-26 Enriching the NArabizi Treebank: A Multifaceted Approach to Supporting an Under-Resourced Language Riabi Arij et.al. 2306.14866v1 null
2023-06-26 Inter-Annotator Agreement in the Wild: Uncovering Its Emerging Roles and Considerations in Real-World Scenarios NamHyeok Kim et.al. 2306.14373v1 null
2023-06-25 Revolutionizing Cyber Threat Detection with Large Language Models Mohamed Amine Ferrag et.al. 2306.14263v1 null
2023-06-25 Towards Trustworthy Explanation: On Causal Rationalization Wenbo Zhang et.al. 2306.14115v1 link
2023-06-25 Chinese Fine-Grained Financial Sentiment Analysis with Large Language Models Yinyu Lan et.al. 2306.14096v1 link
2023-06-24 On the Uses of Large Language Models to Interpret Ambiguous Cyberattack Descriptions Reza Fayyazi et.al. 2306.14062v1 null
2023-06-24 Comparison of Pre-trained Language Models for Turkish Address Parsing Muhammed Cihat Ünal et.al. 2306.13947v1 null
2023-06-24 Large Sequence Models for Sequential Decision-Making: A Survey Muning Wen et.al. 2306.13945v1 null
2023-06-24 Spatio-temporal Storytelling? Leveraging Generative Models for Semantic Trajectory Analysis Shreya Ghosh et.al. 2306.13905v1 null
2023-06-23 Knowledge-Infused Self Attention Transformers Kaushik Roy et.al. 2306.13501v1 null
2023-06-23 Abstractive Text Summarization for Resumes With Cutting Edge NLP Transformers and LSTM Öykü Berfin Mercan et.al. 2306.13315v1 null
2023-06-22 Prompt to GPT-3: Step-by-Step Thinking Instructions for Humor Generation Yuetian Chen et.al. 2306.13195v1 link
2023-06-22 On Hate Scaling Laws For Data-Swamps Abeba Birhane et.al. 2306.13141v1 link
2023-06-22 Named entity recognition in resumes Ege Kesim et.al. 2306.13062v1 null
2023-06-22 Tracking public attitudes toward ChatGPT on Twitter using sentiment analysis and topic modeling Ratanond Koonchanok et.al. 2306.12951v1 link
2023-06-22 Cross-lingual Cross-temporal Summarization: Dataset, Models, Evaluation Ran Zhang et.al. 2306.12916v1 link
2023-06-22 Natural Language Processing in Electronic Health Records in Relation to Healthcare Decision-making: A Systematic Review Elias Hossain et.al. 2306.12834v1 null
2023-06-22 Instruct-FinGPT: Financial Sentiment Analysis by Instruction Tuning of General-Purpose Large Language Models Boyu Zhang et.al. 2306.12659v1 null
2023-06-22 Identifying and Extracting Rare Disease Phenotypes with Large Language Models Cathy Shyr et.al. 2306.12656v1 link
2023-06-21 SIFTER: A Task-specific Alignment Strategy for Enhancing Sentence Embeddings Chao Yu et.al. 2306.12280v1 null
2023-06-21 What Constitutes Good Contrastive Learning in Time-Series Forecasting? Chiyu Zhang et.al. 2306.12086v1 null
2023-06-21 Task-Robust Pre-Training for Worst-Case Downstream Adaptation Jianghui Wang et.al. 2306.12070v1 null
2023-06-21 Sample Attackability in Natural Language Adversarial Attacks Vyas Raina et.al. 2306.12043v1 link
2023-06-21 Multimodality Fusion for Smart Healthcare: a Journey from Data, Information, Knowledge to Wisdom Thanveer Shaik et.al. 2306.11963v1 null
2023-06-20 Deep Fusion: Efficient Network Training via Pre-trained Initializations Hanna Mazzawi et.al. 2306.11903v1 null
2023-06-20 Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications Saed Rezayi et.al. 2306.11892v1 null
2023-06-21 Event Stream GPT: A Data Pre-processing and Modeling Library for Generative, Pre-trained Transformers over Continuous-time Sequences of Complex Events Matthew B. A. McDermott et.al. 2306.11547v2 link
2023-06-20 One model to rule them all: ranking Slovene summarizers Aleš Žagar et.al. 2306.11518v1 null
2023-06-20 TrustGPT: A Benchmark for Trustworthy and Responsible Large Language Models Yue Huang et.al. 2306.11507v1 null
2023-06-20 Transforming Graphs for Enhanced Attribute-Based Clustering: An Innovative Graph Transformer Method Shuo Han et.al. 2306.11307v1 null
2023-06-20 UVSCAN: Detecting Third-Party Component Usage Violations in IoT Firmware Binbin Zhao et.al. 2306.11206v1 null
2023-06-19 BioREx: Improving Biomedical Relation Extraction by Leveraging Heterogeneous Datasets Po-Ting Lai et.al. 2306.11189v1 link
2023-06-18 Understanding and Characterizing Cryptocurrency Free Giveaway and Arbitrage Bot Scams In the Wild Kai Li et.al. 2306.10634v1 link
2023-06-17 Multilingual Multiword Expression Identification Using Lateral Inhibition and Domain Adaptation Andrei-Marius Avram et.al. 2306.10419v1 null
2023-06-16 SSE: A Metric for Evaluating Search System Explainability Catherine Chen et.al. 2306.10175v1 link
2023-06-16 Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects Kexin Zhang et.al. 2306.10125v1 link
2023-06-16 Rewriting the Script: Adapting Text Instructions for Voice Interaction Alyssa Hwang et.al. 2306.09992v1 null
2023-06-16 ClinicalGPT: Large Language Models Finetuned with Diverse Medical Data and Comprehensive Evaluation Guangyu Wang et.al. 2306.09968v1 null
2023-06-16 Revealing the impact of social circumstances on the selection of cancer therapy through natural language processing of social work notes Shenghuan Sun et.al. 2306.09877v1 null
2023-06-16 Full Parameter Fine-tuning for Large Language Models with Limited Resources Kai Lv et.al. 2306.09782v1 link
2023-06-16 Using Natural Language Processing and Networks to Automate Structured Literature Reviews: An Application to Farmers Climate Change Adaptation Sofia Gil-Clavel et.al. 2306.09737v1 null
2023-06-16 Reducing Computational Costs in Sentiment Analysis: Tensorized Recurrent Networks vs. Recurrent Networks Gabriel Lopez et.al. 2306.09705v1 null
2023-06-15 Building blocks for complex tasks: Robust generative event extraction for radiology reports under domain shifts Sitong Zhou et.al. 2306.09544v1 null
2023-06-15 FedMultimodal: A Benchmark For Multimodal Federated Learning Tiantian Feng et.al. 2306.09486v1 null
2023-06-15 From BERT to GPT-3 Codex: Harnessing the Potential of Very Large Language Models for Data Management Immanuel Trummer et.al. 2306.09339v1 null
2023-06-15 Opportunities for Large Language Models and Discourse in Engineering Design Jan Göpfert et.al. 2306.09169v1 null
2023-06-15 Mapping Researcher Activity based on Publication Data by means of Transformers Zineddine Bettouche et.al. 2306.09049v1 null
2023-06-15 Voting Booklet Bias: Stance Detection in Swiss Federal Communication Eric Egli et.al. 2306.08999v1 link
2023-06-15 Multilingual End to End Entity Linking Mikhail Plekhanov et.al. 2306.08896v1 link
2023-06-15 Description-Enhanced Label Embedding Contrastive Learning for Text Classification Kun Zhang et.al. 2306.08817v1 link
2023-06-14 Explore In-Context Learning for 3D Point Cloud Understanding Zhongbin Fang et.al. 2306.08659v1 link
2023-06-14 Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models Lingxi Xie et.al. 2306.08641v1 null
2023-06-14 SQL2Circuits: Estimating Metrics for SQL Queries with A Quantum Natural Language Processing Method Valter Uotila et.al. 2306.08529v1 link
2023-06-14 AlbMoRe: A Corpus of Movie Reviews for Sentiment Analysis in Albanian Erion Çano et.al. 2306.08526v1 link
2023-06-13 Adversarial Capsule Networks for Romanian Satire Detection and Sentiment Analysis Sebastian-Vasile Echim et.al. 2306.07845v1 null
2023-06-13 A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews Robert Lakatos et.al. 2306.07786v1 null
2023-06-13 Rethink the Effectiveness of Text Data Augmentation: An Empirical Analysis Zhengxiang Shi et.al. 2306.07664v1 link
2023-06-12 Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and Siswati Andani Madodonga et.al. 2306.07426v1 link
2023-06-12 EriBERTa: A Bilingual Pre-Trained Language Model for Clinical Natural Language Processing Iker de la Iglesia et.al. 2306.07373v1 null
2023-06-11 A Comprehensive Survey on Applications of Transformers for Deep Learning Tasks Saidul Islam et.al. 2306.07303v1 null
2023-06-12 A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation Jeremy Gwinnup et.al. 2306.07198v1 null
2023-06-12 A language-inspired machine learning approach for solving strongly correlated problems with dynamical mean-field theory Zelong Zhao et.al. 2306.06975v1 link
2023-06-12 A Brief Review of Hypernetworks in Deep Learning Vinod Kumar Chauhan et.al. 2306.06955v1 link
2023-06-11 AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural Language Processing Asaad Alghamdi et.al. 2306.06800v1 null
2023-06-11 Adapting to the Impact of AI in Scientific Writing: Balancing Benefits and Drawbacks while Developing Policies and Regulations Ahmed S. BaHammam et.al. 2306.06699v1 null
2023-06-11 Computational Language Assessment: Open Brain AI Charalambos Themistocleous et.al. 2306.06693v1 null
2023-06-11 EaSyGuide : ESG Issue Identification Framework leveraging Abilities of Generative Large Language Models Hanwool Lee et.al. 2306.06662v1 link
2023-06-11 RoBERTweet: A BERT Language Model for Romanian Tweets Iulian-Marius Tăiatu et.al. 2306.06598v1 null
2023-06-10 Universal Language Modelling agent Anees Aslam et.al. 2306.06521v1 null
2023-06-10 A Comprehensive Review of State-of-The-Art Methods for Java Code Generation from Natural Language Text Jessica López Espejel et.al. 2306.06371v1 null
2023-06-09 FinGPT: Open-Source Financial Large Language Models Hongyang Yang et.al. 2306.06031v1 link
2023-06-09 HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine Rodrigo Agerri et.al. 2306.06029v1 null
2023-06-09 Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect? Wissam Antoun et.al. 2306.05871v1 null
2023-06-09 Towards the Exploitation of LLM-based Chatbot for Providing Legal Support to Palestinian Cooperatives Rabee Qasem et.al. 2306.05827v1 null
2023-06-09 How Can Recommender Systems Benefit from Large Language Models: A Survey Jianghao Lin et.al. 2306.05817v1 link
2023-06-09 Detecting Phishing Sites Using ChatGPT Takashi Koide et.al. 2306.05816v1 null
2023-06-09 Exploring Effective Mask Sampling Modeling for Neural Image Compression Lin Liu et.al. 2306.05704v1 null
2023-06-09 Customizing General-Purpose Foundation Models for Medical Report Generation Bang Yang et.al. 2306.05642v1 null
2023-06-09 Word sense extension Lei Yu et.al. 2306.05609v1 link
2023-06-08 Emotion and Sentiment Guided Paraphrasing Justin J. Xie et.al. 2306.05556v1 null
2023-06-08 Advancing Italian Biomedical Information Extraction with Large Language Models: Methodological Insights and Multicenter Practical Application Claudio Crema et.al. 2306.05323v1 null
2023-06-08 Are fairness metric scores enough to assess discrimination biases in machine learning? Fanny Jourdan et.al. 2306.05307v1 null
2023-06-08 Extensive Evaluation of Transformer-based Architectures for Adverse Drug Events Extraction Simone Scaboro et.al. 2306.05276v1 link
2023-06-09 Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models Tianzhe Chu et.al. 2306.05272v2 link
2023-06-08 M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models Wenxuan Zhang et.al. 2306.05179v1 link
2023-06-09 RRWKV: Capturing Long-range Dependencies in RWKV Leilei Wang et.al. 2306.05176v2 null
2023-06-08 Learning A Foundation Language Model for Geoscience Knowledge Understanding and Utilization Cheng Deng et.al. 2306.05064v1 link
2023-06-08 Knowledge Detection by Relevant Question and Image Attributes in Visual Question Answering Param Ahir et.al. 2306.04938v1 null
2023-06-08 covLLM: Large Language Models for COVID-19 Biomedical Literature Yousuf A. Khan et.al. 2306.04926v1 null
2023-06-08 Flow-based Network Intrusion Detection Based on BERT Masked Language Model Loc Gia Nguyen et.al. 2306.04920v1 null
2023-06-07 Cross-attention learning enables real-time nonuniform rotational distortion correction in OCT Haoran Zhang et.al. 2306.04512v1 null
2023-06-07 How to Find Opinion Leader on the Online Social Network? Bailu Jin et.al. 2306.04452v1 null
2023-06-07 Multilingual Clinical NER: Translation or Cross-lingual Transfer? Xavier Fontaine et.al. 2306.04384v1 null
2023-06-07 IUTEAM1 at MEDIQA-Chat 2023: Is simple fine tuning effective for multilayer summarization of clinical conversations? Dhananjay Srivastava et.al. 2306.04328v1 link
2023-06-07 Leveraging Knowledge Graph Embeddings to Enhance Contextual Representations for Relation Extraction Fréjus A. A. Laleye et.al. 2306.04203v1 null
2023-06-07 A Survey on Generative Diffusion Models for Structured Data Heejoon Koo et.al. 2306.04139v1 null
2023-06-06 GEO-Bench: Toward Foundation Models for Earth Monitoring Alexandre Lacoste et.al. 2306.03831v1 link
2023-06-06 Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models Fobo Shi et.al. 2306.03799v1 link
2023-06-06 On the Difference of BERT-style and CLIP-style Text Encoders Zhihong Chen et.al. 2306.03678v1 link
2023-06-06 Take the Hint: Improving Arabic Diacritization with Partially-Diacritized Text Parnia Bahar et.al. 2306.03557v1 link
2023-06-06 SciLit: A Platform for Joint Scientific Literature Discovery, Summarization and Citation Generation Nianlong Gu et.al. 2306.03535v1 link
2023-06-06 Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning Peggy Tang et.al. 2306.03415v1 link
2023-06-06 Stabilizing Contrastive RL: Techniques for Offline Goal Reaching Chongyi Zheng et.al. 2306.03346v1 link
2023-06-05 A Scalable and Adaptive System to Infer the Industry Sectors of Companies: Prompt + Model Tuning of Generative Language Models Lele Cao et.al. 2306.03313v1 null
2023-06-05 Easy-to-Read in Germany: A Survey on its Current State and Available Resources Margot Madina et.al. 2306.03189v1 null
2023-06-05 Machine Learning and Statistical Approaches to Measuring Similarity of Political Parties Daria Boratyn et.al. 2306.03079v1 null
2023-06-05 Using Sequences of Life-events to Predict Human Lives Germans Savcisens et.al. 2306.03009v1 link
2023-06-05 Gen-IR @ SIGIR 2023: The First Workshop on Generative Information Retrieval Gabriel Bénédict et.al. 2306.02887v1 null
2023-06-05 COMET: Learning Cardinality Constrained Mixture of Experts with Trees and Local Search Shibal Ibrahim et.al. 2306.02824v1 link
2023-06-05 Enhancing Language Representation with Constructional Information for Natural Language Understanding Lvxiaowei Xu et.al. 2306.02819v1 link
2023-06-05 Cheap-fake Detection with LLM using Prompt Engineering Guangyang Wu et.al. 2306.02776v1 null
2023-06-05 Colexifications for Bootstrapping Cross-lingual Datasets: The Case of Phonology, Concreteness, and Affectiveness Yiyi Chen et.al. 2306.02646v1 null
2023-06-04 Adversary for Social Good: Leveraging Adversarial Attacks to Protect Personal Attribute Privacy Xiaoting Li et.al. 2306.02488v1 null
2023-06-04 Modeling Cross-Cultural Pragmatic Inference with Codenames Duet Omar Shaikh et.al. 2306.02475v1 link
2023-06-04 Taught by the Internet, Exploring Bias in OpenAIs GPT3 Ali Ayaz et.al. 2306.02428v1 null
2023-06-02 Towards In-context Scene Understanding Ivana Balažević et.al. 2306.01667v1 null
2023-06-02 Analyzing Credit Risk Model Problems through NLP-Based Clustering and Machine Learning: Insights from Validation Reports Szymon Lis et.al. 2306.01618v1 null
2023-06-02 Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today Zhuo Wang et.al. 2306.01499v1 null
2023-06-02 Syntax-aware Hybrid prompt model for Few-shot multi-modal sentiment analysis Zikai Zhou et.al. 2306.01312v1 null
2023-06-02 Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation Hanbyul Kim et.al. 2306.01296v1 null
2023-06-02 Egocentric Planning for Scalable Embodied Task Achievement Xiaotian Liu et.al. 2306.01295v1 null
2023-06-02 Active Code Learning: Benchmarking Sample-Efficient Training of Code Models Qiang Hu et.al. 2306.01250v1 null
2023-06-02 Transforming ECG Diagnosis:An In-depth Review of Transformer-based DeepLearning Models in Cardiovascular Disease Detection Zibin Zhao et.al. 2306.01249v1 null
2023-06-01 Hybrid Long Document Summarization using C2F-FAR and ChatGPT: A Practical Study Guang Lu et.al. 2306.01169v1 null
2023-06-01 Leveraging Natural Language Processing For Public Health Screening On YouTube: A COVID-19 Case Study Ahrar Bin Aslam et.al. 2306.01164v1 null
2023-06-01 Effective Structured Prompting by Meta-Learning and Representative Verbalizer Weisen Jiang et.al. 2306.00618v1 link
2023-06-01 Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior Shashank Subramanian et.al. 2306.00258v1 null
2023-05-31 Measuring the Robustness of Natural Language Processing Models to Domain Shifts Nitay Calderon et.al. 2306.00168v1 link
2023-05-31 Multilingual Multi-Figurative Language Detection Huiyuan Lai et.al. 2306.00121v1 link
2023-05-31 Findings of the VarDial Evaluation Campaign 2023 Noëmi Aepli et.al. 2305.20080v1 null
2023-05-31 Computational Language Assessment in patients with speech, language, and communication impairments Charalambos Themistocleous et.al. 2305.20046v1 null
2023-05-31 ActiveAED: A Human in the Loop Improves Annotation Error Detection Leon Weber et.al. 2305.20045v1 link
2023-06-01 A Survey on Large Language Models for Recommendation Likang Wu et.al. 2305.19860v2 link
2023-05-31 UKP-SQuARE: An Interactive Tool for Teaching Question Answering Haishuo Fang et.al. 2305.19748v1 link
2023-05-31 Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation Joonhyuk Yang et.al. 2305.19666v1 link
2023-05-31 Large Language Models Are Not Abstract Reasoners Gaël Gendron et.al. 2305.19555v1 link
2023-05-31 Ethical Considerations for Machine Translation of Indigenous Languages: Giving a Voice to the Speakers Manuel Mager et.al. 2305.19474v1 null
2023-05-30 Examining risks of racial biases in NLP tools for child protective services Anjalie Field et.al. 2305.19409v1 null
2023-05-30 Quantum Natural Language Processing based Sentiment Analysis using lambeq Toolkit Srinjoy Ganguly et.al. 2305.19383v1 null
2023-05-30 Jointly Reparametrized Multi-Layer Adaptation for Efficient and Private Tuning Umang Gupta et.al. 2305.19264v1 link
2023-05-30 Grokking of Hierarchical Structure in Vanilla Transformers Shikhar Murty et.al. 2305.18741v1 link
2023-05-30 LonXplain: Lonesomeness as a Consequence of Mental Disturbance in Reddit Posts Muskan Garg et.al. 2305.18736v1 null
2023-05-30 An Annotated Dataset for Explainable Interpersonal Risk Factors of Mental Disturbance in Social Media Posts Muskan Garg et.al. 2305.18727v1 link
2023-05-31 Beyond One-Model-Fits-All: A Survey of Domain Specialization for Large Language Models Chen Ling et.al. 2305.18703v2 null
2023-05-30 Approximation and Estimation Ability of Transformers for Sequence-to-Sequence Functions with Infinite Dimensional Input Shokichi Takakura et.al. 2305.18699v1 null
2023-05-29 SlimFit: Memory-Efficient Fine-Tuning of Transformer-based Models Using Training Dynamics Arash Ardakani et.al. 2305.18513v1 null
2023-05-29 Leveraging Training Data in Few-Shot Prompting for Numerical Reasoning Zhanming Jie et.al. 2305.18170v1 link
2023-05-29 Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods Mengsay Loem et.al. 2305.18156v1 null
2023-05-30 Do Large Language Models Know What They Don't Know? Zhangyue Yin et.al. 2305.18153v2 link
2023-05-29 The Utility of Large Language Models and Generative AI for Education Research Andrew Katz et.al. 2305.18125v1 null
2023-05-29 Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning Xuankai Chang et.al. 2305.18108v1 null
2023-05-29 Semantic Role Labeling Guided Out-of-distribution Detection Jinan Zou et.al. 2305.18026v1 link
2023-05-29 Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition Xiaoliang Wu et.al. 2305.18011v1 null
2023-05-28 Transfer Learning for Power Outage Detection Task with Limited Training Data Olukunle Owolabi et.al. 2305.17817v1 null
2023-05-28 Tab-CoT: Zero-shot Tabular Chain of Thought Ziqi Jin et.al. 2305.17812v1 link
2023-05-28 ConvGenVisMo: Evaluation of Conversational Generative Vision Models Narjes Nikzad Khasmakhi et.al. 2305.17784v1 link
2023-05-26 Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model David Soong et.al. 2305.17116v1 null
2023-05-26 NeuroX Library for Neuron Analysis of Deep NLP Models Fahim Dalvi et.al. 2305.17073v1 link
2023-05-26 Counterfactuals of Counterfactuals: a back-translation-inspired approach to analyse counterfactual editors Giorgos Filandrianos et.al. 2305.17055v1 link
2023-05-26 Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation David Brandfonbrener et.al. 2305.16985v1 link
2023-05-26 Theoretical and Practical Perspectives on what Influence Functions Do Andrea Schioppa et.al. 2305.16971v1 null
2023-05-26 RankCSE: Unsupervised Sentence Representations Learning via Learning to Rank Jiduan Liu et.al. 2305.16726v1 null
2023-05-26 TADA: Task-Agnostic Dialect Adapters for English Will Held et.al. 2305.16651v1 link
2023-05-26 Are Fairy Tales Fair? Analyzing Gender Bias in Temporal Narrative Event Chains of Children's Fairy Tales Paulina Toro Isaza et.al. 2305.16641v1 null
2023-05-26 Zero is Not Hero Yet: Benchmarking Zero-Shot Performance of LLMs for Financial Tasks Agam Shah et.al. 2305.16633v1 link
2023-05-26 ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-Translation Kuan-Hao Huang et.al. 2305.16585v1 link
2023-05-25 Landmark Attention: Random-Access Infinite Context Length for Transformers Amirkeivan Mohtashami et.al. 2305.16300v1 link
2023-05-25 Understanding Idea Creation in Collaborative Discourse through Networks: The Joint Attention-Interaction-Creation (AIC) Framework Xinran Zhu et.al. 2305.16262v1 null
2023-05-25 Neural Natural Language Processing for Long Texts: A Survey of the State-of-the-Art Dimitrios Tsirmpas et.al. 2305.16259v1 null
2023-05-25 Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification Gokul Bhusal et.al. 2305.16239v1 null
2023-05-25 More than Words: Twitter Chatter and Financial Market Sentiment Travis Adams et.al. 2305.16164v1 null
2023-05-25 Training Data Extraction From Pre-trained Language Models: A Survey Shotaro Ishihara et.al. 2305.16157v1 null
2023-05-25 On Influence Functions, Classification Influence, Relative Influence, Memorization and Generalization Michael Kounavis et.al. 2305.16094v1 null
2023-05-25 Efficient Document Embeddings via Self-Contrastive Bregman Divergence Learning Daniel Saggau et.al. 2305.16031v1 null
2023-05-25 SING: A Plug-and-Play DNN Learning Technique Adrien Courtois et.al. 2305.15997v1 link
2023-05-25 Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data Aryan Patil et.al. 2305.15722v1 null
2023-05-24 READ: Recurrent Adaptation of Large Transformers Sid Wang et.al. 2305.15348v1 null
2023-05-24 EvEval: A Comprehensive Evaluation of Event Semantics for Large Language Models Zhengwei Tao et.al. 2305.15268v1 null
2023-05-24 SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation Tetsu Kasanishi et.al. 2305.15186v1 link
2023-05-24 A Mini Review on the utilization of Reinforcement Learning with OPC UA Simon Schindler et.al. 2305.15113v1 null
2023-05-24 GPT4Graph: Can Large Language Models Understand Graph Structured Data ? An Empirical Evaluation and Benchmarking Jiayan Guo et.al. 2305.15066v1 null
2023-05-24 Exploring Adapter-based Transfer Learning for Recommender Systems: Empirical Studies and Practical Insights Junchen Fu et.al. 2305.15036v1 link
2023-05-24 Unlocking Temporal Question Answering for Large Language Models Using Code Execution Xingxuan Li et.al. 2305.15014v1 link
2023-05-24 Bactrian-X : A Multilingual Replicable Instruction-Following Model with Low-Rank Adaptation Haonan Li et.al. 2305.15011v1 link
2023-05-24 Sentiment Analysis in the Era of Large Language Models: A Reality Check Wenxuan Zhang et.al. 2305.15005v1 link
2023-05-24 Frugal Prompting for Dialog Models Bishal Santra et.al. 2305.14919v1 link
2023-05-23 RET-LLM: Towards a General Read-Write Memory for Large Language Models Ali Modarressi et.al. 2305.14322v1 link
2023-05-23 VIP5: Towards Multimodal Foundation Models for Recommendation Shijie Geng et.al. 2305.14302v1 link
2023-05-23 LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages Milind Agarwal et.al. 2305.14263v1 link
2023-05-23 TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale Ziyun Zeng et.al. 2305.14173v1 link
2023-05-23 Out-of-Distribution Generalization in Text Classification: Past, Present, and Future Linyi Yang et.al. 2305.14104v1 null
2023-05-23 Predicting Survey Response with Quotation-based Modeling: A Case Study on Favorability towards the United States Alireza Amirshahi et.al. 2305.14086v1 null
2023-05-23 Robust Instruction Optimization for Large Language Models with Distribution Shifts Moxin Li et.al. 2305.13954v1 null
2023-05-23 Parameterized Complexity Classification for Interval Constraints Konrad K. Dabrowski et.al. 2305.13889v1 null
2023-05-23 PaD: Program-aided Distillation Specializes Large Models in Reasoning Xuekai Zhu et.al. 2305.13888v1 link
2023-05-23 A Trip Towards Fairness: Bias and De-Biasing in Large Language Models Leonardo Ranaldi et.al. 2305.13862v1 null
2023-05-22 Parallel Attention and Feed-Forward Net Design for Pre-training and Inference on Transformers Shashank Sonkar et.al. 2305.13297v1 null
2023-05-22 VideoLLM: Modeling Video Sequence with Large Language Models Guo Chen et.al. 2305.13292v1 link
2023-05-22 Watermarking Text Data on Large Language Models for Dataset Copyright Protection Yixin Liu et.al. 2305.13257v1 null
2023-05-22 Interactive Natural Language Processing Zekun Wang et.al. 2305.13246v1 null
2023-05-22 Should We Attend More or Less? Modulating Attention for Fairness Abdelrahman Zayed et.al. 2305.13088v1 null
2023-05-22 Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization Zihao Fu et.al. 2305.13066v1 link
2023-05-22 RWKV: Reinventing RNNs for the Transformer Era Bo Peng et.al. 2305.13048v1 link
2023-05-22 Rethinking Semi-supervised Learning with Language Models Zhengxiang Shi et.al. 2305.13002v1 link
2023-05-22 VanillaNet: the Power of Minimalism in Deep Learning Hanting Chen et.al. 2305.12972v1 link
2023-05-22 A Diachronic Analysis of the NLP Research Paradigm Shift: When, How, and Why? Aniket Pramanick et.al. 2305.12920v1 null
2023-05-19 Recent progress in the JARVIS infrastructure for next-generation data-driven materials design Daniel Wines et.al. 2305.11842v1 null
2023-05-19 Marginalized Beam Search Algorithms for Hierarchical HMMs Xuechun Xu et.al. 2305.11752v1 link
2023-05-19 Introspective Tips: Large Language Model for In-Context Decision Making Liting Chen et.al. 2305.11598v1 null
2023-05-19 Diving into the Inter-Consistency of Large Language Models: An Insightful Analysis through Debate Kai Xiong et.al. 2305.11595v1 link
2023-05-19 Speech-Text Dialog Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment Tianshu Yu et.al. 2305.11579v1 link
2023-05-19 Constructing Word-Context-Coupled Space Aligned with Associative Knowledge Relations for Interpretable Language Modeling Fanyu Wang et.al. 2305.11543v1 link
2023-05-19 A Sequence-to-Sequence Approach for Arabic Pronoun Resolution Hanan S. Murayshid et.al. 2305.11529v1 null
2023-05-19 AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation Sara Papi et.al. 2305.11408v1 link
2023-05-18 Comparing Biases and the Impact of Multilingual Training across Multiple Languages Sharon Levy et.al. 2305.11242v1 null
2023-05-18 Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model Siyuan Huang et.al. 2305.11176v1 link
2023-05-18 Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature Ana Cláudia Akemi Matsuki de Faria et.al. 2305.11033v1 null
2023-05-18 How Deep Learning Sees the World: A Survey on Adversarial Attacks & Defenses Joana C. Costa et.al. 2305.10862v1 null
2023-05-18 Deep Learning Methods for Extracting Metaphorical Names of Flowers and Plants Amal Haddad Haddad et.al. 2305.10833v1 null
2023-05-18 Expanding the Role of Affective Phenomena in Multimodal Interaction Research Leena Mathur et.al. 2305.10827v1 null
2023-05-18 A Survey on Time-Series Pre-Trained Models Qianli Ma et.al. 2305.10716v1 link
2023-05-18 Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding Taolin Zhang et.al. 2305.10714v1 null
2023-05-18 NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in Natural Language Processing Tingting Wu et.al. 2305.10709v1 link
2023-05-18 MolXPT: Wrapping Molecules with Text for Generative Pre-training Zequn Liu et.al. 2305.10688v1 null
2023-05-17 Incorporating Attribution Importance for Improving Faithfulness Metrics Zhixue Zhao et.al. 2305.10496v1 link
2023-05-17 G-Adapter: Towards Structure-Aware Parameter-Efficient Transfer Learning for Graph Transformer Networks Anchun Gui et.al. 2305.10329v1 null
2023-05-17 Towards More Robust NLP System Evaluation: Handling Missing Scores in Benchmarks Anas Himmi et.al. 2305.10284v1 null
2023-05-17 A quantitative study of NLP approaches to question difficulty estimation Luca Benedetto et.al. 2305.10236v1 link
2023-05-17 Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection Shadi Iskander et.al. 2305.10204v1 link
2023-05-17 Qualifying Chinese Medical Licensing Examination with Knowledge Enhanced Generative Pre-training Model Jiageng Wu et.al. 2305.10163v1 null
2023-05-17 Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark Wenjun Peng et.al. 2305.10036v1 link
2023-05-17 When Gradient Descent Meets Derivative-Free Optimization: A Match Made in Black-Box Scenario Chengcheng Han et.al. 2305.10013v1 null
2023-05-17 Semantic Similarity Measure of Natural Language Text through Machine Learning and a Keyword-Aware Cross-Encoder-Ranking Summarizer -- A Case Study Using UCGIS GIS&T Body of Knowledge Yuanyuan Tian et.al. 2305.09877v1 null
2023-05-17 Knowledge Graph Completion Models are Few-shot Learners: An Empirical Study of Relation Labeling in E-commerce with LLMs Jiao Chen et.al. 2305.09858v1 null
2023-05-16 Mirages: On Anthropomorphism in Dialogue Systems Gavin Abercrombie et.al. 2305.09800v1 null
2023-05-16 Adapting Sentence Transformers for the Aviation Domain Liya Wang et.al. 2305.09556v1 null
2023-05-16 Life of PII -- A PII Obfuscation Transformer Ajinkya Deshmukh et.al. 2305.09550v1 null
2023-05-16 MetaSRL++: A Uniform Scheme for Modelling Deeper Semantics Fritz Hohl et.al. 2305.09534v1 null
2023-05-16 On the Origins of Bias in NLP through the Lens of the Jim Code Fatma Elsafoury et.al. 2305.09281v1 null
2023-05-16 Progressive Translation: Improving Domain Robustness of Neural Machine Translation with Intermediate Sequences Chaojun Wang et.al. 2305.09154v1 link
2023-05-15 An assessment of measuring local levels of homelessness through proxy social media signals Yoshi Meke Bird et.al. 2305.08978v1 null
2023-05-15 Sentence Level Curriculum Learning for Improved Neural Conversational Models Sean Paulsen et.al. 2305.08818v1 null
2023-05-15 Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text Hanieh Khorashadizadeh et.al. 2305.08804v1 null
2023-05-15 Question-Answering System Extracts Information on Injection Drug Use from Clinical Progress Notes Maria Mahbub et.al. 2305.08777v1 link
2023-05-15 Measuring Consistency in Text-based Financial Forecasting Models Linyi Yang et.al. 2305.08524v1 link
2023-05-15 Beqi: Revitalize the Senegalese Wolof Language with a Robust Spelling Corrector Derguene Mbaye et.al. 2305.08518v1 null
2023-05-15 Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages Chunlan Ma et.al. 2305.08487v1 null
2023-05-15 What's the Meaning of Superhuman Performance in Today's NLU? Simone Tedeschi et.al. 2305.08414v1 null
2023-05-14 MatSci-NLP: Evaluating Scientific Language Models on Materials Science Language Tasks Using Text-to-Schema Modeling Yu Song et.al. 2305.08264v1 link
2023-05-14 Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity Raman Dutt et.al. 2305.08252v1 null
2023-05-14 Learning to Generalize for Cross-domain QA Yingjie Niu et.al. 2305.08208v1 link
2023-05-12 PALR: Personalization Aware LLMs for Recommendation Zheng Chen et.al. 2305.07622v1 null
2023-05-12 Retrospective End-User Walkthrough: A Method for Assessing How People Combine Multiple AI Models in Decision-Making Systems Vagner Figueredo de Santana et.al. 2305.07530v1 null
2023-05-12 ArtGPT-4: Artistic Vision-Language Understanding with Adapter-enhanced MiniGPT-4 Zhengqing Yuan et.al. 2305.07490v1 link
2023-05-12 Implications of Deep Circuits in Improving Quality of Quantum Question Answering Pragya Katyayan et.al. 2305.07374v1 null
2023-05-12 Gaussian Prior Reinforcement Learning for Nested Named Entity Recognition Yawen Yang et.al. 2305.07266v1 null
2023-05-12 T-former: An Efficient Transformer for Image Inpainting Ye Deng et.al. 2305.07239v1 link
2023-05-12 When Giant Language Brains Just Aren't Enough! Domain Pizzazz with Knowledge Sparkle Dust Minh-Tien Nguyen et.al. 2305.07230v1 null
2023-05-12 Asymmetric feature interaction for interpreting model predictions Xiaolei Lu et.al. 2305.07224v1 link
2023-05-11 Automated Smell Detection and Recommendation in Natural Language Requirements Alvaro Veizaga et.al. 2305.07097v1 null
2023-05-11 Cost-efficient Crowdsourcing for Span-based Sequence Labeling: Worker Selection and Data Augmentation Yujie Wang et.al. 2305.06683v1 null
2023-05-11 When the Majority is Wrong: Leveraging Annotator Disagreement for Subjective Tasks Eve Fleisig et.al. 2305.06626v1 null
2023-05-11 GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark Dongyang Li et.al. 2305.06545v1 null
2023-05-11 How Good are Commercial Large Language Models on African Languages? Jessica Ojo et.al. 2305.06530v1 null
2023-05-10 Exploring the Landscape of Machine Unlearning: A Survey and Taxonomy Thanveer Shaik et.al. 2305.06360v1 null
2023-05-10 CADGE: Context-Aware Dialogue Generation Enhanced with Graph-Structured Knowledge Aggregation Hongbo Zhanga et.al. 2305.06294v1 link
2023-05-09 Alleviating Over-smoothing for Unsupervised Sentence Representation Nuo Chen et.al. 2305.06154v1 link
2023-05-10 CrudeBERT: Applying Economic Theory towards fine-tuning Transformer-based Sentiment Analysis Models to the Crude Oil Market Himmet Kaplan et.al. 2305.06140v1 null
2023-05-10 Transformer-based model for monocular visual odometry: a video understanding approach André O. Françani et.al. 2305.06121v1 link
2023-05-10 XTab: Cross-table Pretraining for Tabular Transformers Bingzhao Zhu et.al. 2305.06090v1 link
2023-05-10 FedSOV: Federated Model Secure Ownership Verification with Unforgeable Signature Wenyuan Yang et.al. 2305.06085v1 null
2023-05-09 Learning to Parallelize with OpenMP by Augmented Heterogeneous AST Representation Le Chen et.al. 2305.05779v1 null
2023-05-09 Beyond Good Intentions: Reporting the Research Landscape of NLP for Social Good Fernando Gonzalez et.al. 2305.05471v1 link
2023-05-09 Estimating related words computationally using language model from the Mahabharata -- an Indian epic Vrunda Gadesha et.al. 2305.05420v1 null
2023-05-08 Knowledge-enhanced Agents for Interactive Text Games Prateek Chhikara et.al. 2305.05091v1 null
2023-05-08 A Unified Evaluation Framework for Novelty Detection and Accommodation in NLP with an Instantiation in Authorship Attribution Neeraj Varshney et.al. 2305.05079v1 null
2023-05-08 Dreams Are More "Predictable'' Than You Think Lorenzo Bertolini et.al. 2305.05054v1 link
2023-05-08 Knowledge Graph Guided Semantic Evaluation of Language Models For User Trust Kaushik Roy et.al. 2305.04989v1 null
2023-05-08 Towards Understanding Machine Learning Testing in Practise Arumoy Shome et.al. 2305.04988v1 null
2023-05-08 The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification Anastasiia Grishina et.al. 2305.04940v1 link
2023-05-08 Augmented Large Language Models with Parametric Knowledge Guiding Ziyang Luo et.al. 2305.04757v1 null
2023-05-08 Toeplitz Neural Network for Sequence Modeling Zhen Qin et.al. 2305.04749v1 link
2023-05-08 Differentially Private Attention Computation Yeqi Gao et.al. 2305.04701v1 null
2023-05-08 Putting Natural in Natural Language Processing Grzegorz Chrupała et.al. 2305.04572v1 null
2023-05-08 Multi-source Education Knowledge Graph Construction and Fusion for College Curricula Zeju Li et.al. 2305.04567v1 null
2023-05-08 Flex-SFU: Accelerating DNN Activation Functions by Non-Uniform Piecewise Approximation Enrico Reggiani et.al. 2305.04546v1 null
2023-05-08 A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues Yunxin Li et.al. 2305.04530v1 link
2023-05-08 Token-level Fitting Issues of Seq2seq Models Guangsheng Bao et.al. 2305.04493v1 null
2023-05-08 SmartState: A Protocol-Driven Human Interface Samuel E. Armstrong et.al. 2305.04411v1 link
2023-05-07 LatinCy: Synthetic Trained Pipelines for Latin NLP Patrick J. Burns et.al. 2305.04365v1 null
2023-05-05 How Segment Anything Model (SAM) Boost Medical Image Segmentation? Yichi Zhang et.al. 2305.03678v1 link
2023-05-05 Now It Sounds Like You: Learning Personalized Vocabulary On Device Sid Wang et.al. 2305.03584v1 null
2023-05-05 Context-Aware Semantic Similarity Measurement for Unsupervised Word Sense Disambiguation Jorge Martinez-Gil et.al. 2305.03520v1 link
2023-05-05 T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Large Language Model Signals for Science Question Answering Lei Wang et.al. 2305.03453v1 link
2023-05-05 Online Gesture Recognition using Transformer and Natural Language Processing G. C. M. Silvestre et.al. 2305.03407v1 null
2023-05-05 Visualization in the Era of Artificial Intelligence: Experiments for Creating Structural Visualizations by Prompting Large Language Models Hans-Georg Fill et.al. 2305.03380v1 null
2023-05-05 The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions, Cross-Cultural Humour, and Personalisation Lukas Christ et.al. 2305.03369v1 link
2023-05-05 MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic Damien Sileo et.al. 2305.03353v1 link
2023-05-05 HiPool: Modeling Long Documents Using Graph Neural Networks Irene Li et.al. 2305.03319v1 link
2023-05-05 A Survey on Out-of-Distribution Detection in NLP Hao Lang et.al. 2305.03236v1 null
2023-05-04 Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Recover the Whole Sentence Haoran Li et.al. 2305.03010v1 link
2023-05-04 Simple Noisy Environment Augmentation for Reinforcement Learning Raad Khraishi et.al. 2305.02882v1 link
2023-05-04 Interpretable Sentence Representation with Variational Autoencoders and Attention Ghazi Felhi et.al. 2305.02810v1 null
2023-05-04 The Elephant in the Room: Analyzing the Presence of Big Tech in Natural Language Processing Research Mohamed Abdalla et.al. 2305.02797v1 link
2023-05-04 DN at SemEval-2023 Task 12: Low-Resource Language Text Classification via Multilingual Pretrained Language Model Fine-tuning Daniil Homskiy et.al. 2305.02607v1 null
2023-05-04 AutoML-GPT: Automatic Machine Learning with GPT Shujian Zhang et.al. 2305.02499v1 null
2023-05-03 Quantifying the Dissimilarity of Texts Benjamin Shade et.al. 2305.02457v1 link
2023-05-03 Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs Deepak Narayanan et.al. 2305.02440v1 null
2023-05-03 Uncovering ChatGPT's Capabilities in Recommender Systems Sunhao Dai et.al. 2305.02182v1 link
2023-05-03 Natural language processing on customer note data Andrew Hilditch et.al. 2305.02029v1 null
2023-05-03 Exploring the Protein Sequence Space with Global Generative Models Sergio Romero-Romero et.al. 2305.01941v1 null
2023-05-03 Can Large Language Models Be an Alternative to Human Evaluations? Cheng-Han Chiang et.al. 2305.01937v1 null
2023-05-03 Improving Contrastive Learning of Sentence Embeddings from AI Feedback Qinyuan Cheng et.al. 2305.01918v1 link
2023-05-02 Post-Abstention: Towards Reliably Re-Attempting the Abstained Instances in QA Neeraj Varshney et.al. 2305.01812v1 null
2023-05-02 Don't Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner Zhengxiang Shi et.al. 2305.01711v1 link
2023-05-02 BrainNPT: Pre-training of Transformer networks for brain network classification Jinlong Hu et.al. 2305.01666v1 null
2023-05-02 The Benefits of Bad Advice: Autocontrastive Decoding across Model Layers Ariel Gera et.al. 2305.01628v1 link
2023-05-02 MultiLegalSBD: A Multilingual Legal Sentence Boundary Detection Dataset Tobias Brugger et.al. 2305.01211v1 link
2023-05-02 Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding Juan Zuluaga-Gomez et.al. 2305.01155v1 null
2023-05-02 RadAdapt: Radiology Report Summarization via Lightweight Domain Adaptation of Large Language Models Dave Van Veen et.al. 2305.01146v1 link
2023-05-01 Company classification using zero-shot learning Maryan Rizinski et.al. 2305.01028v1 null
2023-05-01 Attack-SAM: Towards Evaluating Adversarial Robustness of Segment Anything Model Chenshuang Zhang et.al. 2305.00866v1 null
2023-05-01 Performance and Energy Consumption of Parallel Machine Learning Algorithms Xidong Wu et.al. 2305.00798v1 null
2023-05-01 An Iterative Algorithm for Rescaled Hyperbolic Functions Regression Yeqi Gao et.al. 2305.00660v1 null
2023-05-01 Low-Resourced Machine Translation for Senegalese Wolof Language Derguene Mbaye et.al. 2305.00606v1 null
2023-04-30 Graph Global Attention Network with Memory for Fake News Detection Qian Chang et.al. 2305.00456v1 null
2023-04-29 Patent Mining by Extracting Functional Analysis Information Modelled As Graph Structure: A Patent Knowledge-base Collaborative Building Approach Manal E. Helal et.al. 2305.00309v1 null
2023-04-29 When Deep Learning Meets Polyhedral Theory: A Survey Joey Huchette et.al. 2305.00241v1 null
2023-04-29 Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework David Alonso del Barrio et.al. 2305.00182v1 null
2023-04-28 NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis Mingyang Wang et.al. 2305.00090v1 null
2023-04-28 Prompt Engineering for Healthcare: Methodologies and Applications Jiaqi Wang et.al. 2304.14670v1 null
2023-04-27 pyBibX -- A Python Library for Bibliometric and Scientometric Analysis Powered with Artificial Intelligence Tools Valdecy Pereira et.al. 2304.14516v1 link
2023-04-27 Framing the News:From Human Perception to Large Language Model Inferences David Alonso del Barrio et.al. 2304.14456v1 null
2023-04-27 string2string: A Modern Python Library for String-to-String Algorithms Mirac Suzgun et.al. 2304.14395v1 link
2023-04-26 Fine Tuning with Abnormal Examples Will Rieger et.al. 2304.13783v1 null
2023-04-27 Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond Jingfeng Yang et.al. 2304.13712v2 link
2023-04-26 FVP: Fourier Visual Prompting for Source-Free Unsupervised Domain Adaptation of Medical Image Segmentation Yan Wang et.al. 2304.13672v1 null
2023-04-26 Using Implicit Feedback to Improve Question Generation Hugo Rodrigues et.al. 2304.13664v1 null
2023-04-26 Impact of Position Bias on Language Models in Token Classification Mehdi Ben Amor et.al. 2304.13567v1 link
2023-04-26 Tensor Decomposition for Model Reduction in Neural Networks: A Review Xingyi Liu et.al. 2304.13539v1 null
2023-04-26 The Closeness of In-Context Learning and Weight Shifting for Softmax Regression Shuai Li et.al. 2304.13276v1 null
2023-04-25 Representing and extracting knowledge from single cell data Ionut Sebastian Mihai et.al. 2304.13084v1 null
2023-04-25 Optimizing Deep Learning Models For Raspberry Pi Salem Ameen et.al. 2304.13039v1 link
2023-04-25 The Potential of Visual ChatGPT For Remote Sensing Lucas Prado Osco et.al. 2304.13009v1 null
2023-04-24 Topological properties and organizing principles of semantic networks Gabriel Budel et.al. 2304.12940v1 null
2023-04-25 Lessons Learned from a Citizen Science Project for Natural Language Processing Jan-Christoph Klie et.al. 2304.12836v1 link
2023-04-25 What does BERT learn about prosody? Sofoklis Kakouros et.al. 2304.12706v1 null
2023-04-25 A Preliminary Evaluation of ChatGPT in Requirements Information Retrieval Jianzhang Zhang et.al. 2304.12562v1 link
2023-04-24 Understanding and Predicting Human Label Variation in Natural Language Inference through Explanation Nan-Jiang Jiang et.al. 2304.12443v1 null
2023-04-24 Semantic Tokenizer for Enhanced Natural Language Processing Sandeep Mehta et.al. 2304.12404v1 null
2023-04-24 ThreatCrawl: A BERT-based Focused Crawler for the Cybersecurity Domain Philipp Kuehn et.al. 2304.11960v1 null
2023-04-23 Graph Neural Networks for Text Classification: A Survey Kunze Wang et.al. 2304.11534v1 null
2023-04-22 Understanding Lexical Biases when Identifying Gang-related Social Media Communications Dhiraj Murthy et.al. 2304.11485v1 null
2023-04-22 A Review of Deep Learning for Video Captioning Moloud Abdar et.al. 2304.11431v1 null
2023-04-22 Romanian Multiword Expression Detection Using Multilingual Adversarial Training and Lateral Inhibition Andrei-Marius Avram et.al. 2304.11350v1 null
2023-04-21 The Role of AI in Human-AI Creative Writing for Hong Kong Secondary Students Hengky Susanto et.al. 2304.11276v1 null
2023-04-20 Backpropagation-free Training of Deep Physical Neural Networks Ali Momeni et.al. 2304.11042v1 null
2023-04-21 BERT Based Clinical Knowledge Extraction for Biomedical Knowledge Graph Construction and Analysis Ayoub Harnoune et.al. 2304.10996v1 null
2023-04-21 Information Extraction from Documents: Question Answering vs Token Classification in real-world setups Laurent Lam et.al. 2304.10994v1 null
2023-04-24 Text2Time: Transformer-based Article Time Period Prediction Karthick Prasad Gunasekaran et.al. 2304.10859v2 null
2023-04-21 Hyperbolic Geometry in Computer Vision: A Survey Pengfei Fang et.al. 2304.10764v1 null
2023-04-21 Improving Grounded Language Understanding in a Collaborative Environment by Interacting with Agents Through Help Feedback Nikhil Mehta et.al. 2304.10750v1 null
2023-04-20 IXA/Cogcomp at SemEval-2023 Task 2: Context-enriched Multilingual Named Entity Recognition using Knowledge Bases Iker García-Ferrero et.al. 2304.10637v1 link
2023-04-20 An Introduction to Transformers Richard E. Turner et.al. 2304.10557v1 null
2023-04-20 Multidimensional Uncertainty Quantification for Deep Neural Networks Xujiang Zhao et.al. 2304.10527v1 null
2023-04-20 Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health Shaoxiong Ji et.al. 2304.10447v1 null
2023-04-20 OptoGPT: A Foundation Model for Inverse Design in Optical Multilayer Thin Film Structures Taigao Ma et.al. 2304.10294v1 null
2023-04-20 Is augmentation effective to improve prediction in imbalanced text datasets? Gabriel O. Assunção et.al. 2304.10283v1 null
2023-04-20 Replication and Verifiability in Requirements Engineering: the NLP for RE Case Sallam Abualhaija et.al. 2304.10265v1 null
2023-04-20 Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning? Haoyang Peng et.al. 2304.10224v1 null
2023-04-19 Radar de Parité: An NLP system to measure gender representation in French news stories Valentin-Gabriel Soumah et.al. 2304.09982v1 link
2023-04-19 SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in Surgery Lalithkumar Seenivasan et.al. 2304.09974v1 link
2023-04-19 Catch Me If You Can: Identifying Fraudulent Physician Reviews with Large Language Models Using Generative Pre-Trained Transformers Aishwarya Deep Shukla et.al. 2304.09948v1 null
2023-04-19 Transformer-Based Visual Segmentation: A Survey Xiangtai Li et.al. 2304.09854v1 link
2023-04-19 Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models Pan Lu et.al. 2304.09842v1 link
2023-04-19 A Survey of Corpora for Germanic Low-Resource Languages and Dialects Verena Blaschke et.al. 2304.09805v1 link
2023-04-19 Bridging Natural Language Processing and Psycholinguistics: computationally grounded semantic similarity and relatedness datasets for Basque and Spanish J. Goikoetxea et.al. 2304.09616v1 null
2023-04-19 NetGPT: Generative Pretrained Transformer for Network Traffic Xuying Meng et.al. 2304.09513v1 null
2023-04-18 Revisiting k-NN for Pre-trained Language Models Lei Li et.al. 2304.09058v1 link
2023-04-18 From Words to Music: A Study of Subword Tokenization Techniques in Symbolic Music Generation Adarsh Kumar et.al. 2304.08953v1 null
2023-04-18 Along the Margins: Marginalized Communities' Ethical Concerns about Social Platforms Lauren Olson et.al. 2304.08882v1 null
2023-04-18 A Survey on Biomedical Text Summarization with Pre-trained Language Model Qianqian Xie et.al. 2304.08763v1 null
2023-04-17 Classification of US Supreme Court Cases using BERT-Based Techniques Shubham Vatsal et.al. 2304.08649v1 link
2023-04-17 Improving Autoregressive NLP Tasks via Modular Linearized Attention Victor Agostinelli et.al. 2304.08453v1 null
2023-04-17 Physics-inspired Neuroacoustic Computing Based on Tunable Nonlinear Multiple-scattering Ali Momeni et.al. 2304.08380v1 null
2023-04-17 Use of social media and Natural Language Processing (NLP) in natural hazard research José Augusto Proença Maia Devienne et.al. 2304.08341v1 null
2023-04-17 Thorny Roses: Investigating the Dual Use Dilemma in Natural Language Processing Lucie-Aimée Kaffee et.al. 2304.08315v1 link
2023-04-17 Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca Yiming Cui et.al. 2304.08177v1 link
2023-04-17 A Survey on Few-Shot Class-Incremental Learning Songsong Tian et.al. 2304.08130v1 null
2023-04-18 A Comparative Study between Full-Parameter and LoRA-based Fine-Tuning on Chinese Instruction Data for Instruction Following Large Language Model Xianghui Sun et.al. 2304.08109v2 link
2023-04-16 Chain of Thought Prompt Tuning in Vision Language Models Jiaxin Ge et.al. 2304.07919v1 null
2023-04-16 It's All in the Embedding! Fake News Detection Using Document Embeddings Ciprian-Octavian Truică et.al. 2304.07781v1 link
2023-04-16 Syntactic Complexity Identification, Measurement, and Reduction Through Controlled Syntactic Simplification Muhammad Salman et.al. 2304.07774v1 link
2023-04-14 Optimal inference of a generalised Potts model by single-layer transformers with factored attention Riccardo Rende et.al. 2304.07235v1 null
2023-04-14 DINOv2: Learning Robust Visual Features without Supervision Maxime Oquab et.al. 2304.07193v1 link
2023-04-14 Just Tell Me: Prompt Engineering in Business Process Management Kiran Busch et.al. 2304.07183v1 null
2023-04-14 Radio Galaxy Zoo EMU: Towards a Semantic Radio Galaxy Morphology Taxonomy Micah Bowles et.al. 2304.07171v1 link
2023-04-14 HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge Haochun Wang et.al. 2304.06975v1 link
2023-04-14 Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding Yu-Qi Yang et.al. 2304.06906v1 link
2023-04-14 Tempo vs. Pitch: understanding self-supervised tempo estimation Giovana Morais et.al. 2304.06868v1 link
2023-04-14 Exploring the State of the Art in Legal QA Systems Abdelrahman Abdallah et.al. 2304.06623v2 link
2023-04-13 Solving Tensor Low Cycle Rank Approximation Yichuan Deng et.al. 2304.06594v1 null
2023-04-13 Efficient Multimodal Fusion via Interactive Prompting Yaowei Li et.al. 2304.06306v1 null
2023-04-12 AGI for Agriculture Guoyu Lu et.al. 2304.06136v1 null
2023-04-12 ReDWINE: A Clinical Datamart with Text Analytical Capabilities to Facilitate Rehabilitation Research David Oniani et.al. 2304.05929v1 null
2023-04-12 ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning Viet Dac Lai et.al. 2304.05613v1 null
2023-04-11 A Survey of Resources and Methods for Natural Language Processing of Serbian Language Ulfeta A. Marovac et.al. 2304.05468v1 null
2023-04-10 SAM.MD: Zero-shot medical image segmentation capabilities of the Segment Anything Model Saikat Roy et.al. 2304.05396v1 null
2023-04-10 The Wall Street Neophyte: A Zero-Shot Analysis of ChatGPT Over MultiModal Stock Movement Prediction Challenges Qianqian Xie et.al. 2304.05351v1 null
2023-04-11 Toxicity in ChatGPT: Analyzing Persona-assigned Language Models Ameet Deshpande et.al. 2304.05335v1 null
2023-04-11 Prompt Learning for News Recommendation Zizhuo Zhang et.al. 2304.05263v1 link
2023-04-12 r-softmax: Generalized Softmax with Controllable Sparsity Rate Klaudia Bałazy et.al. 2304.05243v2 link
2023-04-11 What Food Do We Tweet about on a Rainy Day? Maija Kāle et.al. 2304.05041v1 null
2023-04-10 SELFormer: Molecular Representation Learning via SELFIES Language Models Atakan Yüksel et.al. 2304.04662v1 link
2023-04-10 On Evaluation of Bangla Word Analogies Mousumi Akter et.al. 2304.04613v1 null
2023-04-10 Two Steps Forward and One Behind: Rethinking Time Series Forecasting with Deep Learning Riccardo Ughi et.al. 2304.04553v1 null
2023-04-09 Extractive Summarization via ChatGPT for Faithful Summary Generation Haopeng Zhang et.al. 2304.04193v1 null
2023-04-08 MphayaNER: Named Entity Recognition for Tshivenda Rendani Mbuvha et.al. 2304.03952v1 link
2023-04-08 GPT4Rec: A Generative Framework for Personalized Recommendation and User Interests Interpretation Jinming Li et.al. 2304.03879v1 null
2023-04-07 On Efficient Training of Large-Scale Deep Learning Models: A Literature Review Li Shen et.al. 2304.03589v1 null
2023-04-07 HyperTab: Hypernetwork Approach for Deep Learning on Small Tabular Datasets Witold Wydmański et.al. 2304.03543v1 link
2023-04-06 Using LSTM and GRU With a New Dataset for Named Entity Recognition in the Arabic Language Alaa Shaker et.al. 2304.03399v1 null
2023-04-06 Deep Learning for Opinion Mining and Topic Classification of Course Reviews Anna Koufakou et.al. 2304.03394v1 null
2023-04-06 Entity Graphs for Exploring Online Discourse Nicholas Botzer et.al. 2304.03351v1 null
2023-04-06 On the Evaluations of ChatGPT and Emotion-enhanced Prompting for Mental Health Analysis Kailai Yang et.al. 2304.03347v1 link
2023-04-06 Locate: Low-Power Viterbi Decoder Exploration using Approximate Adders Rajat Bhattacharjya et.al. 2304.03257v1 null
2023-04-06 Bridging the Language Gap: Knowledge Injected Multilingual Question Answering Zhichao Duan et.al. 2304.03159v1 null
2023-04-06 Zero-Shot Next-Item Recommendation using Large Pretrained Language Models Lei Wang et.al. 2304.03153v1 null
2023-04-06 BotTriNet: A Unified and Efficient Embedding for Social Bots Detection via Metric Learning Jun Wu et.al. 2304.03144v1 null
2023-04-06 Static Fuzzy Bag-of-Words: a lightweight sentence embedding algorithm Matteo Muffo et.al. 2304.03098v1 null
2023-04-06 PointCAT: Cross-Attention Transformer for point cloud Xincheng Yang et.al. 2304.03012v1 link
2023-04-06 Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions Chen Feng Tsai et.al. 2304.02868v1 null
2023-04-06 Opportunities and challenges of ChatGPT for design knowledge management Xin Hu et.al. 2304.02796v1 null
2023-04-05 Application of Transformers based methods in Electronic Medical Records: A Systematic Literature Review Vitor Alcantara Batista et.al. 2304.02768v1 link
2023-04-05 The Saudi Privacy Policy Dataset Hend Al-Khalifa et.al. 2304.02757v1 link
2023-04-06 ParroT: Translating During Chat Using Large Language Models Wenxiang Jiao et.al. 2304.02426v2 link
2023-04-05 Machine Learning of Public Sentiments toward Wind Energy in Norway Oskar Vågerö et.al. 2304.02388v1 null
2023-04-05 Document-Level Machine Translation with Large Language Models Longyue Wang et.al. 2304.02210v1 link
2023-04-05 Unleashing the Power of ChatGPT for Translation: An Empirical Study Yuan Gao et.al. 2304.02182v1 null
2023-04-04 PromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language Models Aditi Mishra et.al. 2304.01964v1 null
2023-04-04 Summary of ChatGPT/GPT-4 Research and Perspective Towards the Future of Large Language Models Yiheng Liu et.al. 2304.01852v1 null
2023-04-04 Is ChatGPT a Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation Tao Fang et.al. 2304.01746v1 null
2023-04-04 Rumour Detection and Analysis on Twitter Yaohou Fan et.al. 2304.01712v1 null
2023-04-04 A Survey on Contextualised Semantic Shift Detection Stefano Montanelli et.al. 2304.01666v1 null
2023-04-04 Neural Comprehension: Language Models with Compiled Neural Networks Yixuan Weng et.al. 2304.01665v1 link
2023-04-04 EDeR: A Dataset for Exploring Dependency Relations Between Events Ruiqi Li et.al. 2304.01612v1 link
2023-04-04 G2PTL: A Pre-trained Model for Delivery Address and its Applications in Logistics System Lixia Wu et.al. 2304.01559v1 null
2023-04-04 RARE: Robust Masked Graph Autoencoder Wenxuan Tu et.al. 2304.01507v1 null
2023-04-04 Unsupervised Brain Tumor Segmentation with Image-based Prompts Xinru Zhang et.al. 2304.01472v1 null
2023-04-03 Changes to Captions: An Attentive Network for Remote Sensing Change Captioning Shizhen Chang et.al. 2304.01091v1 link
2023-04-03 DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains Yanis Labrak et.al. 2304.00958v1 null
2023-04-03 ScandEval: A Benchmark for Scandinavian Natural Language Processing Dan Saattrup Nielsen et.al. 2304.00906v1 link
2023-04-03 GreekBART: The First Pretrained Greek Sequence-to-Sequence Model Iakovos Evdaimon et.al. 2304.00869v1 link
2023-04-03 Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: A Preliminary Empirical Study Yi Chen et.al. 2304.00723v1 null
2023-04-03 MiniRBT: A Two-stage Distilled Small Chinese Pre-trained Model Xin Yao et.al. 2304.00717v1 link
2023-04-03 DiffuRec: A Diffusion Model for Sequential Recommendation Zihao Li et.al. 2304.00686v1 link
2023-04-02 Classifying COVID-19 Related Tweets for Fake News Detection and Sentiment Analysis with BERT-based Models Rabia Bounaama et.al. 2304.00636v1 null
2023-04-02 MMT: A Multilingual and Multi-Topic Indian Social Media Dataset Dwip Dalal et.al. 2304.00634v1 null
2023-04-02 Sequence-aware item recommendations for multiply repeated user-item interactions Juan Pablo Equihua et.al. 2304.00578v1 null
2023-03-31 A Closer Look at Parameter-Efficient Tuning in Diffusion Models Chendong Xiang et.al. 2303.18181v1 link
2023-03-31 BERTino: an Italian DistilBERT model Matteo Muffo et.al. 2303.18121v1 link
2023-03-31 Dataset and Baseline System for Multi-lingual Extraction and Normalization of Temporal and Numerical Expressions Sanxing Chen et.al. 2303.18103v1 link
2023-03-31 ConceptEVA: Concept-Based Interactive Exploration and Customization of Document Summaries Xiaoyu Zhang et.al. 2303.17826v1 null
2023-03-31 Attention is Not Always What You Need: Towards Efficient Classification of Domain-Specific Text Yasmen Wahba et.al. 2303.17786v1 null
2023-03-30 A CI-based Auditing Framework for Data Collection Practices Athina Markopoulou et.al. 2303.17740v1 null
2023-03-30 Evaluation of GPT and BERT-based models on identifying protein-protein interactions in biomedical text Hasin Rehana et.al. 2303.17728v1 null
2023-03-30 BOLT: An Automated Deep Learning Framework for Training and Deploying Large-Scale Neural Networks on Commodity CPU Hardware Nicholas Meisburger et.al. 2303.17727v1 link
2023-03-30 Whether and When does Endoscopy Domain Pretraining Make Sense? Dominik Batić et.al. 2303.17636v1 null
2023-03-30 A BERT-based Unsupervised Grammatical Error Correction Framework Nankai Lin et.al. 2303.17367v1 null
2023-03-30 Topics in the Haystack: Extracting and Evaluating Topics beyond Coherence Anton Thielmann et.al. 2303.17324v1 null
2023-03-29 Adapting to the Low-Resource Double-Bind: Investigating Low-Compute Methods on Low-Resource African Languages Colin Leong et.al. 2303.16985v1 null
2023-03-29 AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators Xingwei He et.al. 2303.16854v1 link
2023-03-27 ACO-tagger: A Novel Method for Part-of-Speech Tagging using Ant Colony Optimization Amirhossein Mohammadi et.al. 2303.16760v1 null
2023-03-28 How can Deep Learning Retrieve the Write-Missing Additional Diagnosis from Chinese Electronic Medical Record For DRG Shaohui Liu et.al. 2303.16757v1 null
2023-03-29 LMExplainer: a Knowledge-Enhanced Explainer for Language Models Zichen Chen et.al. 2303.16537v1 null
2023-03-28 Exploring Natural Language Processing Methods for Interactive Behaviour Modelling Guanhua Zhang et.al. 2303.16039v1 null
2023-03-28 Soft-prompt tuning to predict lung cancer using primary care free-text Dutch medical notes Auke Elfrink et.al. 2303.15846v1 link
2023-03-28 Evaluation of ChatGPT for NLP-based Mental Health Applications Bishal Lamichhane et.al. 2303.15727v1 null
2023-03-28 Explicit Planning Helps Language Models in Logical Reasoning Hongyu Zhao et.al. 2303.15714v1 link
2023-03-27 Typhoon: Towards an Effective Task-Specific Masking Strategy for Pre-trained Language Models Muhammed Shahir Abdurrahman et.al. 2303.15619v1 null
2023-03-27 Evaluating self-attention interpretability through human-grounded experimental protocol Milan Bhan et.al. 2303.15190v1 null
2023-03-27 unarXive 2022: All arXiv Publications Pre-Processed for NLP, Including Structured Full-Text and Citation Network Tarek Saier et.al. 2303.14957v1 link
2023-03-27 Unified Text Structuralization with Instruction-tuned Language Models Xuanfan Ni et.al. 2303.14956v1 null
2023-03-27 Improving Contextualized Topic Models with Negative Sampling Suman Adhya et.al. 2303.14951v1 link
2023-03-27 Coupling Artificial Neurons in BERT and Biological Neurons in the Human Brain Xu Liu et.al. 2303.14871v1 null
2023-03-26 MGTBench: Benchmarking Machine-Generated Text Detection Xinlei He et.al. 2303.14822v1 link
2023-03-26 Nature Language Reasoning, A Survey Fei Yu et.al. 2303.14725v1 link
2023-03-25 Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities Atnafu Lambebo Tonja et.al. 2303.14406v1 link
2023-03-25 An Analysis of GPT-3's Performance in Grammatical Error Correction Steven Coyne et.al. 2303.14342v1 null
2023-03-25 Backdoor Attacks with Input-unique Triggers in NLP Xukun Zhou et.al. 2303.14325v1 null
2023-03-24 The crime of being poor Georgina Curto et.al. 2303.14128v1 null
2023-03-24 Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods Thilo Hagendorff et.al. 2303.13988v1 null
2023-03-24 Unleasing ChatGPT on the Metaverse: Savior or Destroyer? Pengyuan Zhou et.al. 2303.13856v1 null
2023-03-24 Where to Go Next for Recommender Systems? ID- vs. Modality-based recommender models revisited Zheng Yuan et.al. 2303.13835v1 link
2023-03-24 Natural language processing to automatically extract the presence and severity of esophagitis in notes of patients undergoing radiotherapy Shan Chen et.al. 2303.13722v1 link
2023-03-23 Primer: Fast Private Transformer Inference on Encrypted Data Mengxin Zheng et.al. 2303.13679v1 null
2023-03-23 Prompting Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages Zheng-Xin Yong et.al. 2303.13592v1 null
2023-03-23 Return of the RNN: Residual Recurrent Networks for Invertible Sentence Embeddings Jeremy Wilkerson et.al. 2303.13570v1 null
2023-03-22 Extracting Physical Rehabilitation Exercise Information from Clinical Notes: a Comparison of Rule-Based and Machine Learning Natural Language Processing Techniques Stephen W. Shaffran et.al. 2303.13466v1 null
2023-03-23 Human Behavior in the Time of COVID-19: Learning from Big Data Hanjia Lyu et.al. 2303.13452v1 null
2023-03-23 Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse Xavier Tannier et.al. 2303.13451v1 null
2023-03-23 Parameter-Efficient Sparse Retrievers and Rerankers using Adapters Vaishali Pal et.al. 2303.13220v1 link
2023-03-22 Analyzing the Generalizability of Deep Contextualized Language Representations For Text Classification Berfu Buyukoz et.al. 2303.12936v1 null
2023-03-22 TRON: Transformer Neural Network Acceleration with Non-Coherent Silicon Photonics Salma Afifi et.al. 2303.12914v1 null
2023-03-22 A Small-Scale Switch Transformer and NLP-based Model for Clinical Narratives Classification Thanh-Dung Le et.al. 2303.12892v1 null
2023-03-22 MEGA: Multilingual Evaluation of Generative AI Kabir Ahuja et.al. 2303.12528v1 null
2023-03-22 System and Design Technology Co-optimization of SOT-MRAM for High-Performance AI Accelerator Memory System Kaniz Mishty et.al. 2303.12310v1 null
2023-03-21 Machine Learning for Brain Disorders: Transformers and Visual Transformers Robin Courant et.al. 2303.12068v1 null
2023-03-21 Transformers in Speech Processing: A Survey Siddique Latif et.al. 2303.11607v1 null
2023-03-21 Difficulty in learning chirality for Transformer fed with SMILES Yasuhiro Yoshikai et.al. 2303.11593v1 link
2023-03-21 SIFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency Shreyas Saxena et.al. 2303.11525v1 link
2023-03-20 Investigating Topological Order using Recurrent Neural Networks Mohamed Hibat-Allah et.al. 2303.11207v1 null
2023-03-20 On the Educational Impact of ChatGPT: Is Artificial Intelligence Ready to Obtain a University Degree? Kamil Malinka et.al. 2303.11146v1 null
2023-03-20 Controllable Ancient Chinese Lyrics Generation Based on Phrase Prototype Retrieving Li Yi et.al. 2303.11005v1 null
2023-03-20 Translate your gibberish: black-box adversarial attack on machine translation systems Andrei Chertkov et.al. 2303.10974v1 link
2023-03-20 Self-Improving-Leaderboard(SIL): A Call for Real-World Centric Natural Language Processing Leaderboards Chanjun Park et.al. 2303.10888v1 null
2023-03-20 NASA Science Mission Directorate Knowledge Graph Discovery Roelien C. Timmer et.al. 2303.10871v1 null
2023-03-20 Multi-task Transformer with Relation-attention and Type-attention for Named Entity Recognition Ying Mo et.al. 2303.10870v1 null
2023-03-18 Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning Qingru Zhang et.al. 2303.10512v1 link
2023-03-18 A Deep Learning System for Domain-specific speech Recognition Yanan Jia et.al. 2303.10510v1 null
2023-03-18 Is Prompt All You Need? No. A Comprehensive and Broader View of Instruction Learning Renze Lou et.al. 2303.10475v1 link
2023-03-17 IRGen: Generative Modeling for Image Retrieval Yidan Zhang et.al. 2303.10126v1 link
2023-03-17 STIXnet: A Novel and Modular Solution for Extracting All STIX Objects in CTI Reports Francesco Marchiori et.al. 2303.09999v1 link
2023-03-17 CoLT5: Faster Long-Range Transformers with Conditional Computation Joshua Ainslie et.al. 2303.09752v1 null
2023-03-16 Measuring Improvement of F $_1$ -Scores in Detection of Self-Admitted Technical Debt William Aiken et.al. 2303.09617v1 null
2023-03-17 BanglaCoNER: Towards Robust Bangla Complex Named Entity Recognition HAZ Sameen Shahgir et.al. 2303.09306v2 link
2023-03-16 Block-wise Bit-Compression of Transformer-based Models Gaochen Dong et.al. 2303.09184v1 null
2023-03-16 A Short Survey of Viewing Large Language Models in Legal Aspect Zhongxiang Sun et.al. 2303.09136v1 link
2023-03-15 Cross-domain Sentiment Classification in Spanish Lautaro Estienne et.al. 2303.08985v1 null
2023-03-17 Automated Interactive Domain-Specific Conversational Agents that Understand Human Dialogs Yankai Zeng et.al. 2303.08941v2 null
2023-03-15 Applying unsupervised keyphrase methods on concepts extracted from discharge sheets Hoda Memarzadeh et.al. 2303.08928v1 null
2023-03-15 ROSE: A Neurocomputational Architecture for Syntax Elliot Murphy et.al. 2303.08877v1 null
2023-03-15 Building an Effective Email Spam Classification Model with spaCy Kazem Taghandiki et.al. 2303.08792v1 null
2023-03-14 Clinical Concept and Relation Extraction Using Prompt-based Machine Reading Comprehension Cheng Peng et.al. 2303.08262v1 null
2023-03-14 Contextualized Medication Information Extraction Using Transformer-based Deep Learning Architectures Aokun Chen et.al. 2303.08259v1 null
2023-03-14 Progress Note Understanding -- Assessment and Plan Reasoning: Overview of the 2022 N2C2 Track 3 Shared Task Yanjun Gao et.al. 2303.08038v1 null
2023-03-14 Geolocation Predicting of Tweets Using BERT-Based Models Kateryna Lutsai et.al. 2303.07865v1 link
2023-03-14 Input-length-shortening and text generation via attention values Neşet Özkan Tan et.al. 2303.07585v1 null
2023-03-14 Diffusion Models in NLP: A Survey Yuansong Zhu et.al. 2303.07576v1 null
2023-03-13 Automated Vulnerability Detection in Source Code Using Quantum Natural Language Processing Mst Shapna Akter et.al. 2303.07525v1 null
2023-03-13 X-Former: In-Memory Acceleration of Transformers Shrihari Sridharan et.al. 2303.07470v1 null
2023-03-13 Learning the language of QCD jets with transformers Thorben Finke et.al. 2303.07364v1 null
2023-03-13 Scaling Vision-Language Models with Sparse Mixture of Experts Sheng Shen et.al. 2303.07226v1 null
2023-03-13 A Comprehensive Empirical Evaluation of Existing Word Embedding Approaches Obaidullah Zaland et.al. 2303.07196v1 null
2023-03-13 $\nabla$ SD: Differentiable Programming for Sparse Tensors Amir Shaikhha et.al. 2303.07030v1 null
2023-03-13 Roadmap towards Meta-being Tianyi Huang et.al. 2303.06795v1 null
2023-03-12 AidUI: Toward Automated Recognition of Dark Patterns in User Interfaces SM Hasan Mansur et.al. 2303.06782v1 link
2023-03-12 Diffusion Models for Non-autoregressive Text Generation: A Survey Yifan Li et.al. 2303.06574v1 link
2023-03-11 Graph Neural Network contextual embedding for Deep Learning on Tabular Data Mario Villaizán-Vallelado et.al. 2303.06455v1 link
2023-03-11 Explainable AI for Time Series via Virtual Inspection Layers Johanna Vielhaben et.al. 2303.06365v1 null
2023-03-10 Generating Query Focused Summaries without Fine-tuning the Transformer-based Pre-trained Models Deen Abdullah et.al. 2303.06230v1 null
2023-03-10 Towards MoE Deployment: Mitigating Inefficiencies in Mixture-of-Expert (MoE) Inference Haiyang Huang et.al. 2303.06182v1 null
2023-03-10 Distributionally Robust Optimization with Probabilistic Group Soumya Suvra Ghosal et.al. 2303.05809v1 link
2023-03-10 An Overview on Language Models: Recent Developments and Outlook Chengwei Wei et.al. 2303.05759v1 null
2023-03-10 Research on CPI Prediction Based on Natural Language Processing Xiaobin Tang et.al. 2303.05666v1 null
2023-03-09 Open World Classification with Adaptive Negative Samples Ke Bai et.al. 2303.05581v1 null
2023-03-08 Automatic Detection of Industry Sectors in Legal Articles Using Machine Learning Approaches Hui Yang et.al. 2303.05387v1 null
2023-03-09 Dynamic Stashing Quantization for Efficient Transformer Training Guo Yang et.al. 2303.05295v1 null
2023-03-09 Can a Frozen Pretrained Language Model be used for Zero-shot Neural Retrieval on Entity-centric Questions? Yasuto Hoshi et.al. 2303.05153v1 null
2023-03-09 ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction Jiabang He et.al. 2303.05063v1 link
2023-03-09 Rethinking Visual Prompt Learning as Masked Visual Token Modeling Ning Liao et.al. 2303.04998v1 null
2023-03-08 DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks Zohreh Aghababaeyan et.al. 2303.04878v1 link
2023-03-08 Non-Binary Gender Expression in Online Interactions Rebecca Dorn et.al. 2303.04837v1 null
2023-03-08 Comprehensive Event Representations using Event Knowledge Graphs and Natural Language Processing Tin Kuculo et.al. 2303.04794v1 null
2023-03-08 Student's t-Distribution: On Measuring the Inter-Rater Reliability When the Observations are Scarce Serge Gladkoff et.al. 2303.04526v1 null
2023-03-08 An Annexure to the Paper "Driving the Technology Value Stream by Analyzing App Reviews" Souvick Das et.al. 2303.04519v1 null
2023-03-07 A Challenging Benchmark for Low-Resource Learning Yudong Wang et.al. 2303.03840v1 link
2023-03-07 Exploring the Feasibility of ChatGPT for Event Extraction Jun Gao et.al. 2303.03836v1 null
2023-03-06 Multi-resolution Interpretation and Diagnostics Tool for Natural Language Classifiers Peyman Jalali et.al. 2303.03542v1 null
2023-03-06 Guilt Detection in Text: A Step Towards Understanding Complex Emotions Abdul Gafar Manuel Meque et.al. 2303.03510v1 null
2023-03-06 On the Visualisation of Argumentation Graphs to Support Text Interpretation Hanadi Mardah et.al. 2303.03235v1 null
2023-03-03 Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT Mostafa M. Amin et.al. 2303.03186v1 null
2023-03-03 Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis Vikramjit Mitra et.al. 2303.03177v1 null
2023-03-06 GlobalNER: Incorporating Non-local Information into Named Entity Recognition Chiao-Wei Hsu et.al. 2303.02915v1 null
2023-03-06 Artificial Intelligence: 70 Years Down the Road Lin Zhang et.al. 2303.02819v1 null
2023-03-05 Swim: A General-Purpose, High-Performing, and Efficient Activation Function for Locomotion Control Tasks Maryam Abdool et.al. 2303.02640v1 link
2023-03-04 Variational Quantum Classifiers for Natural-Language Text Daniel T. Chang et.al. 2303.02469v1 null
2023-03-04 Calibrating Transformers via Sparse Gaussian Processes Wenlong Chen et.al. 2303.02444v1 link
2023-03-03 TrojText: Test-time Invisible Textual Trojan Insertion Yepeng Liu et.al. 2303.02242v1 link
2023-03-03 Exploring Data Augmentation Methods on Social Media Corpora Isabel Garcia Pietri et.al. 2303.02198v1 null
2023-03-02 DeepLens: Interactive Out-of-distribution Data Detection in NLP Models Da Song et.al. 2303.01577v1 link
2023-03-02 DeepSeer: Interactive RNN Explanation and Debugging via State Abstraction Zhijie Wang et.al. 2303.01576v1 link
2023-03-02 Local data structures J. F. Jardine et.al. 2303.01415v1 null
2023-03-02 Letz Translate: Low-Resource Machine Translation for Luxembourgish Yewei Song et.al. 2303.01347v1 null
2023-03-01 Frauds Bargain Attack: Generating Adversarial Text Samples via Word Manipulation Process Mingze Ni et.al. 2303.01234v1 link
2023-03-02 Can BERT Refrain from Forgetting on Sequential Tasks? A Probing Study Mingxu Tao et.al. 2303.01081v1 link
2023-03-01 SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks Kai-Wei Chang et.al. 2303.00733v1 null
2023-03-01 Uzbek text summarization based on TF-IDF Khabibulla Madatov et.al. 2303.00461v1 null
2023-03-01 How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks Xuanting Chen et.al. 2303.00293v1 null
2023-03-01 Machine-learning Repurposing of DrugBank Compounds for Opioid Use Disorder Hongsong Feng et.al. 2303.00240v1 link
2023-02-28 Automatic Scoring of Dream Reports' Emotional Content with Large Language Models Lorenzo Bertolini et.al. 2302.14828v1 link
2023-02-28 AccelTran: A Sparsity-Aware Accelerator for Dynamic Inference with Transformers Shikhar Tuli et.al. 2302.14705v1 link
2023-02-28 Improving Expert Specialization in Mixture of Experts Yamuna Krishnamurthy et.al. 2302.14703v1 link
2023-02-27 SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic Speech Processing Weidong Chen et.al. 2302.14638v1 link
2023-02-28 H-AES: Towards Automated Essay Scoring for Hindi Shubhankar Singh et.al. 2302.14635v1 link
2023-02-28 A Survey on Long Text Modeling with Transformers Zican Dong et.al. 2302.14502v1 null
2023-02-28 Text classification dataset and analysis for Uzbek language Elmurod Kuriyozov et.al. 2302.14494v1 link
2023-02-28 Efficient Masked Autoencoders with Self-Consistency Zhaowen Li et.al. 2302.14431v1 null
2023-02-28 HugNLP: A Unified and Comprehensive Library for Natural Language Processing Jianing Wang et.al. 2302.14286v1 link
2023-02-27 Inseq: An Interpretability Toolkit for Sequence Generation Models Gabriele Sarti et.al. 2302.13942v1 link
2023-02-24 Adapting Pre-trained Language Models for Quantum Natural Language Processing Qiuchi Li et.al. 2302.13812v1 null
2023-02-26 A Survey on Uncertainty Quantification Methods for Deep Neural Networks: An Uncertainty Source Perspective Wenchong He et.al. 2302.13425v1 null
2023-02-26 From Audio to Symbolic Encoding Shenli Yuan et.al. 2302.13401v1 null
2023-02-26 The blame game: Understanding blame assignment in social media Ruijie Xi et.al. 2302.13352v1 null
2023-02-26 Bayesian Networks for Named Entity Prediction in Programming Community Question Answering Alexey Gorbatovski et.al. 2302.13253v1 null
2023-02-25 ChatAug: Leveraging ChatGPT for Text Data Augmentation Haixing Dai et.al. 2302.13007v1 null
2023-02-24 STA: Self-controlled Text Augmentation for Improving Text Classifications Congcong Wang et.al. 2302.12784v1 link
2023-02-24 Time-aware Multiway Adaptive Fusion Network for Temporal Knowledge Graph Question Answering Yonghao Liu et.al. 2302.12529v1 null
2023-02-24 SGL-PT: A Strong Graph Learner with Graph Prompt Tuning Yun Zhu et.al. 2302.12449v1 null
2023-02-23 What makes a language easy to deep-learn? Lukas Galke et.al. 2302.12239v1 link
2023-02-23 Deep learning model for Mongolian Citizens Feedback Analysis using Word Vector Embeddings Zolzaya Dashdorj et.al. 2302.12069v1 null
2023-02-23 Natural Language Processing in the Legal Domain Daniel Martin Katz et.al. 2302.12039v1 null
2023-02-23 Sentence Simplification via Large Language Models Yutao Feng et.al. 2302.11957v1 link
2023-02-23 Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers Minsoo Kim et.al. 2302.11812v1 link
2023-02-23 MUTANT: A Multi-sentential Code-mixed Hinglish Dataset Rahul Gupta et.al. 2302.11766v1 null
2023-02-24 VLSP2022 EVJVQA Challenge: Multilingual Visual Question Answering Ngan Luu-Thuy Nguyen et.al. 2302.11752v2 null
2023-02-22 Scaling Robot Learning with Semantically Imagined Experience Tianhe Yu et.al. 2302.11550v1 null
2023-02-22 Data Augmentation for Neural NLP Domagoj Pluščec et.al. 2302.11412v1 null
2023-02-22 Learning from Predictions: Fusing Training and Autoregressive Inference for Long-Term Spatiotemporal Forecasts Pantelis R. Vlachas et.al. 2302.11101v1 null
2023-02-22 Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks Sudipta Kar et.al. 2302.11074v1 null
2023-02-21 Device Tuning for Multi-Task Large Model Penghao Jiang et.al. 2302.10820v1 null
2023-02-21 ChatGPT: Jack of all trades, master of none Jan Kocoń et.al. 2302.10724v1 link
2023-02-21 NLPLego: Assembling Test Generation for Natural Language Processing Applications Pin Ji et.al. 2302.10499v1 null
2023-02-21 Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines Min Cen et.al. 2302.10406v1 null
2023-02-20 Exploring the Limits of Transfer Learning with Unified Model in the Cybersecurity Domain Kuntal Kumar Pal et.al. 2302.10346v1 null
2023-02-20 Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey Xiao Wang et.al. 2302.10035v1 link
2023-02-20 Boosting classification reliability of NLP transformer models in the long run Zoltán Kmetty et.al. 2302.10016v1 null
2023-02-19 mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization Kayhan Behdin et.al. 2302.09693v1 null
2023-02-19 Optimization Methods in Deep Learning: A Comprehensive Overview David Shulman et.al. 2302.09566v1 null
2023-02-19 SanskritShala: A Neural Sanskrit NLP Toolkit with Web-Based Interface for Pedagogical and Annotation Purposes Jivnesh Sandhan et.al. 2302.09527v1 link
2023-02-18 BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark Dakuan Lu et.al. 2302.09432v1 link
2023-02-18 A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT Ce Zhou et.al. 2302.09419v1 null
2023-02-18 Redes Generativas Adversarias (GAN) Fundamentos Teóricos y Aplicaciones Jordi de la Torre et.al. 2302.09346v1 null
2023-02-18 Transformadores: Fundamentos teoricos y Aplicaciones Jordi de la Torre et.al. 2302.09327v1 null
2023-02-17 Extraction of Constituent Factors of Digestion Efficiency in Information Transfer by Media Composed of Texts and Images Koike Hiroaki et.al. 2302.09189v1 null
2023-02-17 Massively Multilingual Shallow Fusion with Large Language Models Ke Hu et.al. 2302.08917v1 null
2023-02-16 Role of Bias Terms in Dot-Product Attention Mahdi Namazifar et.al. 2302.08626v1 null
2023-02-16 What A Situated Language-Using Agent Must be Able to Do: A Top-Down Analysis David Schlangen et.al. 2302.08590v1 null
2023-02-16 Foundation Models for Natural Language Processing -- Pre-trained Language Models Integrating Media Gerhard Paaß et.al. 2302.08575v1 null
2023-02-16 THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression Minghao Li et.al. 2302.08545v1 link
2023-02-16 Counting Carbon: A Survey of Factors Influencing the Emissions of Machine Learning Alexandra Sasha Luccioni et.al. 2302.08476v1 link
2023-02-17 Efficiency 360: Efficient Vision Transformers Badri N. Patro et.al. 2302.08374v2 link
2023-02-16 A Survey on Event-based News Narrative Extraction Brian Keith Norambuena et.al. 2302.08351v1 null
2023-02-16 Tuning computer vision models with task rewards André Susano Pinto et.al. 2302.08242v1 link
2023-02-16 Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition Minsu Kim et.al. 2302.08102v1 null
2023-02-16 Exploring the Limits of ChatGPT for Query or Aspect-based Text Summarization Xianjun Yang et.al. 2302.08081v1 null
2023-02-16 LabelPrompt: Effective Prompt-based Learning for Relation Classification Wenjie Zhang et.al. 2302.08068v1 null
2023-02-16 GraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural Networks Zemin Liu et.al. 2302.08043v1 null
2023-02-15 Commonsense Reasoning for Conversational AI: A Survey of the State of the Art Christopher Richardson et.al. 2302.07926v1 null
2023-02-15 Big Little Transformer Decoder Sehoon Kim et.al. 2302.07863v1 link
2023-02-15 Word class representations spontaneously emerge in a deep neural network trained on next word prediction Kishore Surendra et.al. 2302.07588v1 null
2023-02-14 Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models Shrimai Prabhumoye et.al. 2302.07388v1 null
2023-02-14 Few-shot learning approaches for classifying low resource domain specific software requirements Anmol Nayak et.al. 2302.06951v1 null
2023-02-14 SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains Koustava Goswami et.al. 2302.06868v1 link
2023-02-14 Language Model Analysis for Ontology Subsumption Inference Yuan He et.al. 2302.06761v1 link
2023-02-13 Large Scale Multi-Lingual Multi-Modal Summarization Dataset Yash Verma et.al. 2302.06560v1 link
2023-02-13 Visualizing Topic Uncertainty in Topic Modelling Peter Winker et.al. 2302.06482v1 null
2023-02-13 Linguistic ambiguity analysis in ChatGPT Miguel Ortega-Martín et.al. 2302.06426v1 null
2023-02-13 Dataset of Natural Language Queries for E-Commerce Andrea Papenmeier et.al. 2302.06355v1 null
2023-02-12 TextDefense: Adversarial Text Detection based on Word Importance Entropy Lujia Shen et.al. 2302.05892v1 null
2023-02-12 "Why is this misleading?": Detecting News Headline Hallucinations with Explanations Jiaming Shen et.al. 2302.05852v1 null
2023-02-11 Sequential Embedding-based Attentive (SEA) classifier for malware classification Muhammad Ahmed et.al. 2302.05728v1 link
2023-02-11 Synthesizing Human Gaze Feedback for Improved NLP Performance Varun Khurana et.al. 2302.05721v1 null
2023-02-11 MatKB: Semantic Search for Polycrystalline Materials Synthesis Procedures Xianjun Yang et.al. 2302.05597v1 link
2023-02-10 A Practical Mixed Precision Algorithm for Post-Training Quantization Nilesh Prasad Pandey et.al. 2302.05397v1 null
2023-02-10 Translating Natural Language to Planning Goals with Large-Language Models Yaqi Xie et.al. 2302.05128v1 link
2023-02-10 Step by Step Loss Goes Very Far: Multi-Step Quantization for Adversarial Text Attacks Piotr Gaiński et.al. 2302.05120v1 link
2023-02-09 Flexible, Model-Agnostic Method for Materials Data Extraction from Text Using General Purpose Language Models Maciej P. Polak et.al. 2302.04914v1 null
2023-02-09 AI-based Question Answering Assistance for Analyzing Natural-language Requirements Saad Ezzini et.al. 2302.04793v1 null
2023-02-09 Massively Multilingual Language Models for Cross Lingual Fact Extraction from Low Resource Indian Languages Bhavyajeet Singh et.al. 2302.04790v1 link
2023-02-09 Lightweight Transformers for Clinical Natural Language Processing Omid Rohanian et.al. 2302.04725v1 link
2023-02-09 Mixed-order self-paced curriculum learning for universal lesion detection Han Li et.al. 2302.04677v1 null
2023-02-09 NLP-based Decision Support System for Examination of Eligibility Criteria from Securities Prospectuses at the German Central Bank Christian Hänig et.al. 2302.04562v1 null
2023-02-09 Enhancing E-Commerce Recommendation using Pre-Trained Language Model and Fine-Tuning Nuofan Xu et.al. 2302.04443v1 null
2023-02-08 Sentiment analysis and opinion mining on educational data: A survey Thanveer Shaik et.al. 2302.04359v1 null
2023-02-08 CRL+: A Novel Semi-Supervised Deep Active Contrastive Representation Learning-Based Text Classification Model for Insurance Data Amir Namavar Jahromi et.al. 2302.04343v1 null
2023-02-08 Efficient Joint Learning for Clinical Named Entity Recognition and Relation Extraction Using Fourier Networks: A Use Case in Adverse Drug Events Anthony Yazdani et.al. 2302.04185v1 link
2023-02-08 Training-free Lexical Backdoor Attacks on Language Models Yujin Huang et.al. 2302.04116v1 link
2023-02-08 An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking Xubo Qin et.al. 2302.04112v1 null
2023-02-08 Automating Code-Related Tasks Through Transformers: The Impact of Pre-training Rosalia Tufano et.al. 2302.04048v1 link
2023-02-08 Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models Mohammadreza Banaei et.al. 2302.04045v1 link
2023-02-08 On the Applicability of Language Models to Block-Based Programs Elisabeth Griebl et.al. 2302.03927v1 null
2023-02-08 CRAFT: Criticality-Aware Fault-Tolerance Enhancement Techniques for Emerging Memories-Based Deep Neural Networks Thai-Hoang Nguyen et.al. 2302.03862v1 null
2023-02-07 Pre-train, Prompt and Recommendation: A Comprehensive Survey of Language Modelling Paradigm Adaptations in Recommender Systems Peng Liu et.al. 2302.03735v1 link
2023-02-07 Characterizing Financial Market Coverage using Artificial Intelligence Jean Marie Tshimula et.al. 2302.03694v1 null
2023-02-08 A Survey on Arabic Named Entity Recognition: Past, Recent Advances, and Future Trends Xiaoye Qu et.al. 2302.03512v2 null
2023-02-07 Natural Language Processing for Policymaking Zhijing Jin et.al. 2302.03490v1 null
2023-02-06 APAM: Adaptive Pre-training and Adaptive Meta Learning in Language Model for Noisy Labels and Long-tailed Learning Sunyi Chi et.al. 2302.03488v1 null
2023-02-07 What do Language Models know about word senses? Zero-Shot WSD with Language Models and Domain Inventories Oscar Sainz et.al. 2302.03353v1 null
2023-02-07 Continual Learning of Language Models Zixuan Ke et.al. 2302.03241v1 link
2023-02-07 Bringing the State-of-the-Art to Customers: A Neural Agent Assistant Framework for Customer Service Support Stephen Obadinma et.al. 2302.03222v1 link
2023-02-06 Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design Lyle Regenwetter et.al. 2302.02913v1 null
2023-02-06 Findings of the TSAR-2022 Shared Task on Multilingual Lexical Simplification Horacio Saggion et.al. 2302.02888v1 null
2023-02-07 Less is More: Understanding Word-level Textual Adversarial Attack via n-gram Frequency Descend Ning Lu et.al. 2302.02568v2 null
2023-02-06 Deep Learning for Time Series Classification and Extrinsic Regression: A Current Survey Navid Mohammadi Foumani et.al. 2302.02515v1 link
2023-02-05 VuLASTE: Long Sequence Model with Abstract Syntax Tree Embedding for vulnerability Detection Botong Zhu et.al. 2302.02345v1 null
2023-02-05 A Semantic Approach to Negation Detection and Word Disambiguation with Natural Language Processing Izunna Okpala et.al. 2302.02291v1 null
2023-02-04 Knowledge Distillation in Vision Transformers: A Critical Review Gousia Habib et.al. 2302.02108v1 null
2023-02-03 Witscript: A System for Generating Improvised Jokes in a Conversation Joe Toplyn et.al. 2302.02008v1 null
2023-02-06 Analyzing the impact of climate change on critical infrastructure from the scientific literature: A weakly supervised NLP approach Tanwi Mallick et.al. 2302.01887v2 null
2023-02-03 Lexical Simplification using multi level and modular approach Nikita Katyal et.al. 2302.01823v1 null
2023-02-03 Mitigating Data Scarcity for Large Language Models Hoang Van et.al. 2302.01806v1 link
2023-02-03 Bioformer: an efficient transformer language model for biomedical text mining Li Fang et.al. 2302.01588v1 link
2023-02-03 ResMem: Learn what you can and memorize the rest Zitong Yang et.al. 2302.01576v1 null
2023-02-03 Witgenstein's influence on artificial intelligence Piero Molino et.al. 2302.01570v1 null
2023-02-03 Using natural language processing and structured medical data to phenotype patients hospitalized due to COVID-19 Feier Chang et.al. 2302.01536v1 null
2023-02-03 SPADE: Self-supervised Pretraining for Acoustic DisEntanglement John Harvill et.al. 2302.01483v1 null
2023-02-02 Commonsense-Aware Prompting for Controllable Empathetic Dialogue Generation Yiren Liu et.al. 2302.01441v1 null
2023-02-02 Mixed Precision Post Training Quantization of Neural Networks with Sensitivity Guided Search Clemens JS Schaefer et.al. 2302.01382v1 null
2023-02-02 Modeling opinion polarization on social media: application to Covid-19 vaccination hesitancy in Italy Jonathan Franceschi et.al. 2302.01028v1 null
2023-02-02 Resilient Binary Neural Network Sheng Xu et.al. 2302.00956v1 link
2023-02-02 How to choose "Good" Samples for Text Data Augmentation Xiaotian Lin et.al. 2302.00894v1 null
2023-02-02 idT5: Indonesian Version of Multilingual T5 Transformer Mukhlish Fuadi et.al. 2302.00856v1 null
2023-02-01 FAVOR#: Sharp Attention Kernel Approximations via New Classes of Positive Random Features Valerii Likhosherstov et.al. 2302.00787v1 null
2023-02-01 User Study for Improving Tools for Bible Translation Joel Mathew et.al. 2302.00778v1 null
2023-02-01 Developing Hands-on Labs for Source Code Vulnerability Detection with AI Maryam Taeb et.al. 2302.00750v1 null
2023-02-01 Versatile Energy-Based Models for High Energy Physics Taoli Cheng et.al. 2302.00695v1 link
2023-02-01 Energy-Based Survival Models for Predictive Maintenance Olov Holmer et.al. 2302.00629v1 null
2023-02-01 Feed-Forward Blocks Control Contextualization in Masked Language Models Goro Kobayashi et.al. 2302.00456v1 link
2023-02-01 On the Role of Morphological Information for Contextual Lemmatization Olia Toporkov et.al. 2302.00407v1 null
2023-01-31 Large Language Models Can Be Easily Distracted by Irrelevant Context Freda Shi et.al. 2302.00093v1 link
2023-01-31 PADL: Language-Directed Physics-Based Character Control Jordan Juravsky et.al. 2301.13868v1 link
2023-01-31 Partitioning Distributed Compute Jobs with Reinforcement Learning and Graph Neural Networks Christopher W. F. Parsonson et.al. 2301.13799v1 null
2023-01-31 Zero-shot cross-lingual transfer language selection using linguistic similarity Juuso Eronen et.al. 2301.13720v1 null
2023-01-31 Friend-training: Learning from Models of Different but Related Tasks Mian Zhang et.al. 2301.13683v1 null
2023-02-01 What Makes Good Examples for Visual In-Context Learning? Yuanhan Zhang et.al. 2301.13670v2 link
2023-01-30 Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation Minglun Han et.al. 2301.13003v1 link
2023-01-30 Exploring AI Ethics of ChatGPT: A Diagnostic Analysis Terry Yue Zhuo et.al. 2301.12867v1 null
2023-01-30 Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features Sishuo Chen et.al. 2301.12715v1 link
2023-01-30 UzbekTagger: The rule-based POS tagger for Uzbek language Maksud Sharipov et.al. 2301.12711v1 null
2023-01-29 Large Language Models for Biomedical Causal Graph Construction Vahan Arsenyan et.al. 2301.12473v1 null
2023-01-29 DocILE 2023 Teaser: Document Information Localization and Extraction Štěpán Šimsa et.al. 2301.12394v1 null
2023-01-28 HAT-GAE: Self-Supervised Graph Auto-encoders with Hierarchical Adaptive Masking and Trainable Corruption Chengyu Sun et.al. 2301.12063v1 null
2023-01-27 Improved knowledge distillation by utilizing backward pass knowledge in neural networks Aref Jafari et.al. 2301.12006v1 null
2023-01-27 Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation Jessica Huynh et.al. 2301.12004v1 null
2023-01-27 Gender and Prestige Bias in Coronavirus News Reporting Rebecca Dorn et.al. 2301.11994v1 null
2023-01-27 A Comparative Study of Pretrained Language Models for Long Clinical Text Yikuan Li et.al. 2301.11847v1 link
2023-01-27 Incorporating Knowledge into Document Summarization: an Application of Prefix-Tuning on GPT-2 Chen Chen et.al. 2301.11719v1 null
2023-01-27 SLCNN: Sentence-Level Convolutional Neural Network for Text Classification Ali Jarrahi et.al. 2301.11696v1 null
2023-01-27 A rule-free workflow for the automated generation of databases from scientific literature Luke P. J. Gilligan et.al. 2301.11689v1 link
2023-01-27 Down the Rabbit Hole: Detecting Online Extremism, Radicalisation, and Politicised Hate Speech Jarod Govers et.al. 2301.11579v1 null
2023-01-26 Beyond Arabic: Software for Perso-Arabic Script Manipulation Alexander Gutkin et.al. 2301.11406v1 link
2023-01-24 Semi-Automated Construction of Food Composition Knowledge Base Jason Youn et.al. 2301.11322v1 link
2023-01-26 LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization Laura Nguyen et.al. 2301.11312v1 link
2023-01-26 Box $^2$ EL: Concept and Role Box Embeddings for the Description Logic EL++ Mathias Jackermeier et.al. 2301.11118v1 link
2023-01-26 NLP as a Lens for Causal Analysis and Perception Mining to Infer Mental Health on Social Media Muskan Garg et.al. 2301.11004v1 null
2023-01-25 Qualitative Analysis of a Graph Transformer Approach to Addressing Hate Speech: Adapting to Dynamically Changing Content Liam Hebert et.al. 2301.10871v1 null
2023-01-25 Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement Gavin Abercrombie et.al. 2301.10684v1 null
2023-01-25 Understanding and Improving Deep Graph Neural Networks: A Probabilistic Graphical Model Perspective Jiayuan Chen et.al. 2301.10536v1 null
2023-01-25 Cross-lingual Argument Mining in the Medical Domain Anar Yeginbergenova et.al. 2301.10527v1 link
2023-01-25 Knowledge-augmented Graph Neural Networks with Concept-aware Attention for Adverse Drug Event Detection Shaoxiong Ji et.al. 2301.10451v1 null
2023-01-25 BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing Jiali Wei et.al. 2301.10412v1 null
2023-01-24 A Framework To Improve User Story Sets Through Collaboration Salih Göktuğ Köse et.al. 2301.10070v1 null
2023-01-24 Multitask Instruction-based Prompting for Fallacy Recognition Tariq Alhindi et.al. 2301.09992v1 null
2023-01-24 Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression Jaeyong Song et.al. 2301.09830v1 null
2023-01-24 Transformer-Patcher: One Mistake worth One Neuron Zeyu Huang et.al. 2301.09785v1 link
2023-01-23 Noisy Parallel Data Alignment Ruoyu Xie et.al. 2301.09685v1 link
2023-01-22 Face Generation from Textual Features using Conditionally Trained Inputs to Generative Adversarial Networks Sandeep Shinde et.al. 2301.09123v1 null
2023-01-22 Differentially Private Natural Language Models: Recent Advances and Future Directions Lijie Hu et.al. 2301.09112v1 null
2023-01-22 Learning to Reject with a Fixed Predictor: Application to Decontextualization Christopher Mohri et.al. 2301.09044v1 null
2023-01-21 A Semantic Modular Framework for Events Topic Modeling in Social Media Arya Hadizadeh Moghaddam et.al. 2301.09009v1 null
2023-01-21 Blacks is to Anger as Whites is to Joy? Understanding Latent Affective Bias in Large Pre-trained Neural Language Models Anoop Kadan et.al. 2301.09003v1 link
2023-01-21 Exploring Methods for Building Dialects-Mandarin Code-Mixing Corpora: A Case Study in Taiwanese Hokkien Sin-En Lu et.al. 2301.08937v1 link
2023-01-21 Rationalization for Explainable NLP: A Survey Sai Gurrapu et.al. 2301.08912v1 null
2023-01-20 A Review of the Trends and Challenges in Adopting Natural Language Processing Methods for Education Feedback Analysis Thanveer Shaik et.al. 2301.08826v1 null
2023-01-19 Reversing The Twenty Questions Game Parth Parikh et.al. 2301.08718v1 null
2023-01-20 Which Features are Learned by CodeBert: An Empirical Study of the BERT-based Source Code Representation Learning Lan Zhang et.al. 2301.08427v1 null
2023-01-23 A Survey of research in Deep Learning for Robotics for Undergraduate research interns Narayanan PP et.al. 2301.08283v2 null
2023-01-19 Language Embeddings Sometimes Contain Typological Generalizations Robert Östling et.al. 2301.08115v1 link
2023-01-18 Automatically Reproducing Android Bug Reports Using Natural Language Processing and Reinforcement Learning Zhaoxu Zhang et.al. 2301.07775v1 null
2023-01-18 A Quantitative Exploration of Natural Language Processing Applications for Electricity Demand Analysis Yun Bai et.al. 2301.07535v1 null
2023-01-18 Discrete Latent Structure in Neural Networks Vlad Niculae et.al. 2301.07473v1 null
2023-01-17 On the State of German (Abstractive) Text Summarization Dennis Aumiller et.al. 2301.07095v1 link
2023-01-17 Transformer Based Implementation for Automatic Book Summarization Siddhant Porwal et.al. 2301.07057v1 null
2023-01-17 SECOMlint: A linter for Security Commit Messages Sofia Reis et.al. 2301.06959v1 null
2022-12-30 TA-DA: Topic-Aware Domain Adaptation for Scientific Keyphrase Identification and Classification (Student Abstract) Răzvan-Alexandru Smădu et.al. 2301.06902v1 null
2023-01-17 The Recent Advances in Automatic Term Extraction: A survey Hanh Thi Hong Tran et.al. 2301.06767v1 null
2023-01-17 Word Embeddings as Statistical Estimators Neil Dey et.al. 2301.06710v1 link
2023-01-16 XNLI 2.0: Improving XNLI dataset and performance on Cross Lingual Understanding (XLU) Ankit Kumar Upadhyay et.al. 2301.06527v1 null
2023-01-13 A Survey of Self-Supervised Learning from Multiple Perspectives: Algorithms, Theory, Applications and Future Trends Jie Gui et.al. 2301.05712v1 link
2023-01-13 Natural Language Processing of Aviation Occurrence Reports for Safety Management Patrick Jonk et.al. 2301.05663v1 link
2023-01-13 The 2022 n2c2/UW Shared Task on Extracting Social Determinants of Health Kevin Lybarger et.al. 2301.05571v1 null
2023-01-12 Rock Guitar Tablature Generation via Natural Language Processing Josue Casco-Rodriguez et.al. 2301.05295v1 link
2023-01-12 Counterfactual Explanations for Concepts in $\mathcal{ELH}$ Leonie Nora Sieger et.al. 2301.05109v1 null
2023-01-12 Improving Inference Performance of Machine Learning with the Divide-and-Conquer Principle Alex Kogan et.al. 2301.05099v1 null
2023-01-12 A Dataset of Kurdish (Sorani) Named Entities -- An Amendment to Kurdish-BLARK Named Entities Sazan Salar et.al. 2301.04962v1 link
2023-01-12 Machine-learning Analysis of Opioid Use Disorder Informed by MOR, DOR, KOR, NOR and ZOR-Based Interactome Networks Hongsong Feng et.al. 2301.04815v1 link
2023-01-13 Much Ado About Gender: Current Practices and Future Recommendations for Appropriate Gender-Aware Information Access Christine Pinney et.al. 2301.04780v2 null
2023-01-11 NarrowBERT: Accelerating Masked Language Model Pretraining and Inference Haoxin Li et.al. 2301.04761v1 link
2023-01-11 Semantic Web Enabled Geographic Question Answering Framework: GeoTR Ceren Ocal Tasar et.al. 2301.04752v1 null
2023-01-11 SensePOLAR: Word sense aware interpretability for pre-trained contextual word embeddings Jan Engler et.al. 2301.04704v1 link
2023-01-11 ML-FEED: Machine Learning Framework for Efficient Exploit Detection (Extended version) Tanujay Saha et.al. 2301.04314v1 null
2023-01-17 Word-Graph2vec: An efficient word embedding approach on word co-occurrence graph using random walk sampling Wenting Li et.al. 2301.04312v2 null
2023-01-11 A Multi-Modal Geographic Pre-Training Method Ruixue Ding et.al. 2301.04283v1 link
2023-01-10 User-Centered Security in Natural Language Processing Chris Emmery et.al. 2301.04230v1 null
2023-01-10 There is No Big Brother or Small Brother: Knowledge Infusion in Language Models for Link Prediction and Question Answering Ankush Agarwal et.al. 2301.04013v1 link
2023-01-10 Language Models sounds the Death Knell of Knowledge Graphs Kunal Suri et.al. 2301.03980v1 null
2023-01-10 AI based approach to Trailer Generation for Online Educational Courses Prakhar Mishra et.al. 2301.03957v1 null
2023-01-09 Transfer learning for conflict and duplicate detection in software requirement pairs Garima Malik et.al. 2301.03709v1 null
2023-01-10 Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review Reza Azad et.al. 2301.03505v2 link
2023-01-09 Mining Healthcare Procurement Data Using Text Mining and Natural Language Processing -- Reflection From An Industrial Project Ziqi Zhang et.al. 2301.03458v1 null
2023-01-09 Making Sense of Failure Logs in an Industrial DevOps Environment Muhammad Abbas et.al. 2301.03450v1 null
2023-01-09 Universal Multimodal Representation for Language Understanding Zhuosheng Zhang et.al. 2301.03344v1 null
2023-01-08 The State of Human-centered NLP Technology for Fact-checking Anubrata Das et.al. 2301.03056v1 null
2023-01-08 Topic Modelling of Swedish Newspaper Articles about Coronavirus: a Case Study using Latent Dirichlet Allocation Method Bernadeta Griciūtė et.al. 2301.03029v1 link
2023-01-08 Semantic rule Web-based Diagnosis and Treatment of Vector-Borne Diseases using SWRL rules Ritesh Chandra et.al. 2301.03013v1 null
2023-01-06 Systems for Parallel and Distributed Large-Model Deep Learning Training Kabir Nagrecha et.al. 2301.02691v1 null
2023-01-06 CHARM: Composing Heterogeneous Accelerators for Matrix Multiply on Versal ACAP Architecture Jinming Zhuang et.al. 2301.02359v1 link
2023-01-05 Sequentially Controlled Text Generation Alexander Spangher et.al. 2301.02299v1 null
2023-01-05 A Survey of Code-switching: Linguistic and Social Perspectives for Language Technologies A. Seza Doğruöz et.al. 2301.01967v1 null
2023-01-05 Corrupted by Algorithms? How AI-generated and Human-written Advice Shape (Dis)honesty Margarita Leib et.al. 2301.01954v1 null
2023-01-04 Parameter-Efficient Fine-Tuning Design Spaces Jiaao Chen et.al. 2301.01821v1 null
2023-01-04 MessageNet: Message Classification using Natural Language Processing and Meta-data Adar Kahana et.al. 2301.01808v1 null
2023-01-04 Infomaxformer: Maximum Entropy Transformer for Long Time-Series Forecasting Problem Peiwang Tang et.al. 2301.01772v1 null
2023-01-04 Anonymous Pattern Molecular Fingerprint and its Applications on Property Identification Xue Liu et.al. 2301.01620v1 null
2023-01-03 Linear chain conditional random fields, hidden Markov models, and related classifiers Elie Azeraf et.al. 2301.01293v1 null
2023-01-03 Introducing Variational Inference in Undergraduate Statistics and Data Science Curriculum Vojtech Kejzlar et.al. 2301.01251v1 link
2023-01-03 A Survey On Few-shot Knowledge Graph Completion with Structural and Commonsense Knowledge Haodi Ma et.al. 2301.01172v1 null
2023-01-03 Policy Pre-training for End-to-end Autonomous Driving via Self-supervised Geometric Modeling Penghao Wu et.al. 2301.01006v1 link
2023-01-03 Boosting Neural Networks to Decompile Optimized Binaries Ying Cao et.al. 2301.00969v1 null
2022-12-29 Ontology-based Context Aware Recommender System Application for Tourism Vitor T. Camacho et.al. 2301.00768v1 null
2023-01-02 Tsetlin Machine Embedding: Representing Words Using Logical Expressions Bimal Bhattarai et.al. 2301.00709v1 link
2022-12-30 Active Learning for Neural Machine Translation Neeraj Vashistha et.al. 2301.00688v1 link
2022-12-20 Addressing the Selection Bias in Voice Assistance: Training Voice Assistance Model in Python with Equal Data Selection Kashav Piya et.al. 2301.00646v1 null
2023-01-02 Statistical Machine Translation for Indic Languages Sudhansu Bala Das et.al. 2301.00539v1 null
2023-01-02 Adaptive Fine-tuning for Multiclass Classification over Software Requirement Data Savas Yildirim et.al. 2301.00495v1 null
2023-01-01 Integrating Semantic Information into Sketchy Reading Module of Retro-Reader for Vietnamese Machine Reading Comprehension Hang Thi-Thu Le et.al. 2301.00429v1 null
2023-01-01 CORGI-PM: A Chinese Corpus For Gender Bias Probing and Mitigation Ge Zhang et.al. 2301.00395v1 link
2023-01-01 A Functional approach for Two Way Dimension Reduction in Time Series Aniruddha Rajendra Rao et.al. 2301.00357v1 null
2022-12-31 Rethinking with Retrieval: Faithful Large Language Model Inference Hangfeng He et.al. 2301.00303v1 link
2022-12-31 RECOMMED: A Comprehensive Pharmaceutical Recommendation System Mariam Zomorodi et.al. 2301.00280v1 null
2022-12-31 A Survey for In-context Learning Qingxiu Dong et.al. 2301.00234v1 link
2022-12-31 Logic Mill -- A Knowledge Navigation System Sebastian Erhardt et.al. 2301.00200v1 null
2023-01-06 Examining Political Rhetoric with Epistemic Stance Detection Ankita Gupta et.al. 2212.14486v2 link
2022-12-29 On Learning the Structure of Clusters in Graphs Peter Macgregor et.al. 2212.14345v1 null
2022-12-29 On Transforming Reinforcement Learning by Transformer: The Development Trajectory Shengchao Hu et.al. 2212.14164v1 null
2022-12-28 Towards automating Codenames spymasters with deep reinforcement learning Sherman Siu et.al. 2212.14104v1 null
2022-12-28 Automatic Recognition and Classification of Future Work Sentences from Academic Articles in a Specific Domain Chengzhi Zhang et.al. 2212.13860v1 link
2022-12-30 Cyber Security and Online Safety Education for Schools in the UK: Looking through the Lens of Twitter Data Jamie Knott et.al. 2212.13742v2 null
2022-12-28 Part-guided Relational Transformers for Fine-grained Visual Recognition Yifan Zhao et.al. 2212.13685v1 link
2022-12-27 SVSBI: Sequence-based virtual screening of biomolecular interactions Li Shen et.al. 2212.13617v1 link
2022-12-27 Nanomaterials for Supercapacitors: Uncovering Research Themes with Unsupervised Machine Learning Mridhula Venkatanarayanan et.al. 2212.13550v1 null
2022-12-27 A Survey on Knowledge-Enhanced Pre-trained Language Models Chaoqi Zhen et.al. 2212.13428v1 null
2023-01-11 NEEDED: Introducing Hierarchical Transformer to Eye Diseases Diagnosis Xu Ye et.al. 2212.13408v3 link
2022-12-26 VQA and Visual Reasoning: An Overview of Recent Datasets, Methods and Challenges Rufai Yusuf Zakari et.al. 2212.13296v1 null
2022-12-24 On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective Ying Wen et.al. 2212.12669v1 link
2022-12-24 STRUDEL: Structured Dialogue Summarization for Dialogue Comprehension Borui Wang et.al. 2212.12652v1 null
2022-12-24 Utilizing Priming to Identify Optimal Class Ordering to Alleviate Catastrophic Forgetting Gabriel Mantione-Holmes et.al. 2212.12643v1 null
2022-12-23 Content Rating Classification for Fan Fiction Yu Qiao et.al. 2212.12496v1 null
2022-12-23 Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media Yuting Guo et.al. 2212.12454v1 null
2022-12-22 CAMeMBERT: Cascading Assistant-Mediated Multilingual BERT Dan DeGenaro et.al. 2212.11456v1 null
2022-12-21 Automatic Emotion Modelling in Written Stories Lukas Christ et.al. 2212.11382v1 link
2022-12-21 Training language models for deeper understanding improves brain alignment Khai Loong Aw et.al. 2212.10898v1 link
2022-12-21 A Survey of Mix-based Data Augmentation: Taxonomy, Methods, Applications, and Explainability Chengtai Cao et.al. 2212.10888v1 link
2022-12-21 A Portal Dedicated to Higgs Bosons for Experts and the General Public Andre Sopczak et.al. 2212.10857v1 null
2022-12-21 End-to-End Automatic Speech Recognition model for the Sudanese Dialect Ayman Mansour et.al. 2212.10826v1 null
2022-12-21 MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning Zhiyang Xu et.al. 2212.10773v1 link
2022-12-21 Investigation of Network Architecture for Multimodal Head-and-Neck Tumor Segmentation Ye Li et.al. 2212.10724v1 null
2022-12-20 KronA: Parameter Efficient Tuning with Kronecker Adapter Ali Edalati et.al. 2212.10650v1 null
2022-12-20 A Survey of Deep Learning for Mathematical Reasoning Pan Lu et.al. 2212.10535v1 link
2022-12-20 A Measure-Theoretic Characterization of Tight Language Models Li Du et.al. 2212.10502v1 null
2022-12-20 Is GPT-3 a Good Data Annotator? Bosheng Ding et.al. 2212.10450v1 null
2022-12-20 Towards Reasoning in Large Language Models: A Survey Jie Huang et.al. 2212.10403v1 link
2022-12-20 SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers Hongyi Yuan et.al. 2212.10325v1 link
2022-12-20 CSMPQ:Class Separability Based Mixed-Precision Quantization Mingkai Wang et.al. 2212.10220v1 null
2022-12-20 GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator Jian Yang et.al. 2212.10218v1 link
2022-12-20 Human-Guided Fair Classification for Natural Language Processing Florian E. Dorner et.al. 2212.10154v1 link
2022-12-20 When Federated Learning Meets Pre-trained Language Models' Parameter-Efficient Tuning Methods Zhuo Zhang et.al. 2212.10025v1 link
2022-12-19 A Comparative Study on Textual Saliency of Styles from Eye Tracking, Annotations, and Language Models Karin de Langis et.al. 2212.09873v1 link
2022-12-19 Exploring Hybrid and Ensemble Models for Multiclass Prediction of Mental Health Status on Social Media Sourabh Zanwar et.al. 2212.09839v1 null
2022-12-19 Do CoNLL-2003 Named Entity Taggers Still Work Well in 2023? Shuheng Liu et.al. 2212.09747v1 link
2022-12-19 MANER: Mask Augmented Named Entity Recognition for Extreme Low-Resource Languages Shashank Sonkar et.al. 2212.09723v1 null

(back to top)

Legal NLP

Publish Date Title Authors PDF Code
2024-04-15 LegalPro-BERT: Classification of Legal Provisions by fine-tuning BERT Large Language Model Amit Tewari et.al. 2404.10097v1 link
2024-04-08 Text clustering applied to data augmentation in legal contexts Lucas José Gonçalves Freitas et.al. 2404.08683v1 null
2024-04-10 Leveraging open-source models for legal language modeling and analysis: a case study on the Indian constitution Vikhyath Gupta et.al. 2404.06751v1 null
2024-04-01 Exploring the Nexus of Large Language Models and Legal Systems: A Short Survey Weicong Qin et.al. 2404.00990v1 null
2024-04-01 Towards an In-Depth Comprehension of Case Relevance for Better Legal Retrieval Haitao Li et.al. 2404.00947v1 null
2024-03-30 Automatic explanation of the classification of Spanish legal judgments in jurisdiction-dependent law categories with tree estimators Jaime González-González et.al. 2404.00437v1 null
2024-04-16 Towards Explainability in Legal Outcome Prediction Models Josef Valvoda et.al. 2403.16852v2 link
2024-03-20 PARAMANU-AYN: An Efficient Novel Generative and Instruction-tuned Language Model for Indian Legal Case Documents Mitodru Niyogi et.al. 2403.13681v1 null
2024-03-19 Towards Unsupervised Question Answering System with Multi-level Summarization for Legal Text M Manvith Prabhu et.al. 2403.13107v1 null
2024-03-16 Human Centered AI for Indian Legal Text Analytics Sudipto Ghosh et.al. 2403.10944v1 null
2024-03-12 Generating Clarification Questions for Disambiguating Contracts Anmol Singhal et.al. 2403.08053v1 null
2024-03-11 Exploring Large Language Models and Hierarchical Frameworks for Classification of Large Unstructured Legal Documents Nishchal Prasad et.al. 2403.06872v1 link
2024-03-04 LLM vs. Lawyers: Identifying a Subset of Summary Judgments in a Large UK Case Law Dataset Ahmed Izzidien et.al. 2403.04791v1 link
2024-03-07 SaulLM-7B: A pioneering Large Language Model for Law Pierre Colombo et.al. 2403.03883v2 null
2024-03-04 Improving Legal Judgement Prediction in Romanian with Long Text Encoders Mihai Masala et.al. 2402.19170v2 null
2024-02-16 Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification Shanshan Xu et.al. 2402.07214v2 null
2024-01-31 Employing Label Models on ChatGPT Answers Improves Legal Text Entailment Performance Chau Nguyen et.al. 2401.17897v1 null
2024-03-04 Islamic Law, Western European Law and the Roots of Middle East's Long Divergence: a Comparative Empirical Investigation (800-1600) Hans-Bernd Schaefer et.al. 2401.14435v3 null
2024-01-07 CAPTAIN at COLIEE 2023: Efficient Methods for Legal Information Retrieval and Entailment Tasks Chau Nguyen et.al. 2401.03551v1 link
2023-12-19 CaseGNN: Graph Neural Networks for Legal Case Retrieval with Text-Attributed Graphs Yanran Tang et.al. 2312.11229v2 link
2023-12-15 Data and Approaches for German Text simplification -- towards an Accessibility-enhanced Communication Thorben Schomacker et.al. 2312.09966v1 null
2023-12-13 A Deep Learning-Based System for Automatic Case Summarization Minh Duong et.al. 2312.07824v1 null
2023-12-08 LaCour!: Enabling Research on Argumentation in Hearings of the European Court of Human Rights Lena Held et.al. 2312.05061v1 null
2023-12-03 Towards Mitigating Perceived Unfairness in Contracts from a Non-Legal Stakeholder's Perspective Anmol Singhal et.al. 2312.01398v1 null
2023-12-01 The Ethics of Automating Legal Actors Josef Valvoda et.al. 2312.00584v1 null
2023-11-24 Tracing Influence at Scale: A Contrastive Learning Approach to Linking Public Comments and Regulator Responses Linzi Xing et.al. 2311.14871v1 null
2023-11-20 On the Potential and Limitations of Few-Shot In-Context Learning to Generate Metamorphic Specifications for Tax Preparation Software Dananjay Srinivas et.al. 2311.11979v1 null
2023-11-16 BLT: Can Large Language Models Handle Basic Legal Text? Andrew Blair-Stanek et.al. 2311.09693v1 link
2023-11-15 LePaRD: A Large-Scale Dataset of Judges Citing Precedents Robert Mahari et.al. 2311.09356v1 link
2023-11-15 Large Language Models are legal but they are not: Making the case for a powerful LegalLLM Thanmay Jayakumar et.al. 2311.08890v1 null
2023-10-24 DALE: Generative Data Augmentation for Low-Resource Legal NLP Sreyan Ghosh et.al. 2310.15799v1 link
2023-10-22 The Law and NLP: Bridging Disciplinary Disconnects Robert Mahari et.al. 2310.14346v1 null
2023-10-19 Do Language Models Learn about Legal Entity Types during Pretraining? Claire Barale et.al. 2310.13092v1 link
2023-10-24 From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification Shanshan Xu et.al. 2310.11878v4 null
2023-10-17 Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation Shubham Kumar Nigam et.al. 2310.11049v1 link
2023-10-25 Legal NLP Meets MiCAR: Advancing the Analysis of Crypto White Papers Carolina Camassa et.al. 2310.10333v3 null
2023-10-15 Improving Access to Justice for the Indian Population: A Benchmark for Evaluating Translation of Legal Text to Indian Languages Sayan Mahapatra et.al. 2310.09765v1 null
2023-10-09 LAiW: A Chinese Legal Large Language Models Benchmark (A Technical Report) Yongfu Dai et.al. 2310.05620v1 link
2023-10-08 Enhancing Pre-Trained Language Models with Sentence Position Embeddings for Rhetorical Roles Recognition in Legal Opinions Anas Belfathi et.al. 2310.05276v1 null
2023-09-28 LawBench: Benchmarking Legal Knowledge of Large Language Models Zhiwei Fei et.al. 2309.16289v1 link
2023-09-25 A Hierarchical Neural Framework for Classification and its Explanation in Large Unstructured Legal Documents Nishchal Prasad et.al. 2309.10563v2 null
2023-09-15 Resolving Legalese: A Multilingual Exploration of Negation Scope Resolution in Legal Documents Ramona Christen et.al. 2309.08695v1 link
2023-09-08 NESTLE: a No-Code Tool for Statistical Analysis of Legal Corpus Kyoungyeon Cho et.al. 2309.04146v1 null
2023-09-06 Prompt-based Effective Input Reformulation for Legal Case Retrieval Yanran Tang et.al. 2309.02962v1 link
2023-08-22 The Software Heritage License Dataset (2022 Edition) Jesús M. González-Barahona et.al. 2308.11258v1 null
2023-08-02 UPB at IberLEF-2023 AuTexTification: Detection of Machine-Generated Text using Transformer Ensembles Andrei-Alexandru Preda et.al. 2308.01408v1 null
2023-07-26 Towards Establishing Systematic Classification Requirements for Automated Driving Ken T. Mori et.al. 2307.14058v1 null
2023-07-16 It's All Relative: Interpretable Models for Scoring Bias in Documents Aswin Suresh et.al. 2307.08139v1 null
2023-06-29 Towards Grammatical Tagging for the Legal Language of Cybersecurity Gianpietro Castiglione et.al. 2306.17042v1 null
2023-09-01 SCALE: Scaling up the Complexity for Advanced Language Model Evaluation Vishvaksenan Rasiah et.al. 2306.09237v2 link
2023-06-13 Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard Ehsan Kamalloo et.al. 2306.07471v1 link
2023-06-12 Large Language Models as Tax Attorneys: A Case Study in Legal Capabilities Emergence John J. Nay et.al. 2306.07075v1 null
2023-06-09 Towards the Exploitation of LLM-based Chatbot for Providing Legal Support to Palestinian Cooperatives Rabee Qasem et.al. 2306.05827v1 null
2023-06-03 FlairNLP at SemEval-2023 Task 6b: Extraction of Legal Named Entities from Legal Texts using Contextual String Embeddings Vinay N Ramesh et.al. 2306.02182v1 link
2023-05-24 CuRIAM: Corpus re Interpretation and Metalanguage in U.S. Supreme Court Opinions Michael Kranzlein et.al. 2305.14719v1 null
2023-05-20 Proceedings of the International Workshop on Methodologies for Translating Legal Norms into Formal Representations (LN2FR 2022) in association with 35th International Conference on Legal Knowledge and Information Systems (JURIX 2022) Georg Borges et.al. 2305.12203v1 null
2023-05-22 LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development Ilias Chalkidis et.al. 2305.07507v2 link
2023-05-11 THUIR@COLIEE 2023: Incorporating Structural Knowledge into Pre-trained Language Models for Legal Case Retrieval Haitao Li et.al. 2305.06812v1 link
2023-05-08 Unlocking Practical Applications in Legal Domain: Evaluation of GPT for Zero-Shot Semantic Annotation of Legal Texts Jaromir Savelka et.al. 2305.04417v1 null
2023-05-06 Rhetorical Role Labeling of Legal Documents using Transformers and Graph Neural Networks Anshika Gupta et.al. 2305.04100v1 null