2024-05-21 |
Code-mixed Sentiment and Hate-speech Prediction |
Anjali Yadav et.al. |
2405.12929v1 |
null |
2024-05-21 |
SmartFlow: Robotic Process Automation using LLMs |
Arushi Jain et.al. |
2405.12842v1 |
null |
2024-05-21 |
Large Language Models Meet NLP: A Survey |
Libo Qin et.al. |
2405.12819v1 |
null |
2024-05-21 |
Transformer in Touch: A Survey |
Jing Gao et.al. |
2405.12779v1 |
null |
2024-05-21 |
SYMPLEX: Controllable Symbolic Music Generation using Simplex Diffusion with Vocabulary Priors |
Nicolas Jonason et.al. |
2405.12666v1 |
null |
2024-05-21 |
Exploration of Masked and Causal Language Modelling for Text Generation |
Nicolo Micheletti et.al. |
2405.12630v1 |
null |
2024-05-21 |
Mamba in Speech: Towards an Alternative to Self-Attention |
Xiangyu Zhang et.al. |
2405.12609v1 |
null |
2024-05-21 |
Is Dataset Quality Still a Concern in Diagnosis Using Large Foundation Model? |
Ziqin Lin et.al. |
2405.12584v1 |
null |
2024-05-21 |
Phishing Email Detection Using Inputs From Artificial Intelligence |
Mithün Paul et.al. |
2405.12494v1 |
null |
2024-05-21 |
Resolving Word Vagueness with Scenario-guided Adapter for Natural Language Inference |
Yonghao Liu et.al. |
2405.12434v1 |
null |
2024-05-20 |
Developers' Perceptions on the Impact of ChatGPT in Software Development: A Survey |
Thiago S. Vaillant et.al. |
2405.12195v1 |
null |
2024-05-20 |
Unveiling factors influencing judgment variation in Sentiment Analysis with Natural Language Processing and Statistics |
Olga Kellert et.al. |
2405.12055v1 |
null |
2024-05-20 |
Continuous Sign Language Recognition with Adapted Conformer via Unsupervised Pretraining |
Neena Aloysius et.al. |
2405.12018v1 |
null |
2024-05-20 |
Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification |
Weilian Zhou et.al. |
2405.12003v1 |
link |
2024-05-20 |
A review on the use of large language models as virtual tutors |
Silvia García-Méndez et.al. |
2405.11983v1 |
null |
2024-05-20 |
Biomedical Entity Linking for Dutch: Fine-tuning a Self-alignment BERT Model on an Automatically Generated Wikipedia Corpus |
Fons Hartendorp et.al. |
2405.11941v1 |
link |
2024-05-20 |
Beyond MLE: Investigating SEARNN for Low-Resourced Neural Machine Translation |
Chris Emezue et.al. |
2405.11819v1 |
null |
2024-05-20 |
FedCAda: Adaptive Client-Side Optimization for Accelerated and Stable Federated Learning |
Liuzhi Zhou et.al. |
2405.11811v1 |
null |
2024-05-20 |
Inverse Design of Metal-Organic Frameworks Using Quantum Natural Language Processing |
Shinyoung Kang et.al. |
2405.11783v1 |
null |
2024-05-20 |
Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques |
Siva Rajesh Kasa et.al. |
2405.11775v1 |
null |
2024-05-17 |
A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers |
Kaiyu Huang et.al. |
2405.10936v1 |
link |
2024-05-17 |
High-dimensional multiple imputation (HDMI) for partially observed confounders including natural language processing-derived auxiliary covariates |
Janick Weberpals et.al. |
2405.10925v1 |
null |
2024-05-17 |
Prioritising GitHub Priority Labels |
James Caddy et.al. |
2405.10891v1 |
null |
2024-05-17 |
Natural Language Processing for Requirements Traceability |
Jin L. C. Guo et.al. |
2405.10845v1 |
null |
2024-05-17 |
INDUS: Effective and Efficient Language Models for Scientific Applications |
Bishwaranjan Bhattacharjee et.al. |
2405.10725v1 |
null |
2024-05-17 |
Empowering Prior to Court Legal Analysis: A Transparent and Accessible Dataset for Defensive Statement Classification and Interpretation |
Yannis Spyridis et.al. |
2405.10702v1 |
null |
2024-05-17 |
Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges |
Xiaoming Shi et.al. |
2405.10630v1 |
null |
2024-05-17 |
Dynamic data sampler for cross-language transfer learning in large language models |
Yudong Li et.al. |
2405.10626v1 |
link |
2024-05-17 |
Feature-based Low-Rank Compression of Large Language Models via Bayesian Optimization |
Yixin Ji et.al. |
2405.10616v1 |
link |
2024-05-17 |
Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset |
Jie Zhu et.al. |
2405.10542v1 |
link |
2024-05-16 |
Mitigating Text Toxicity with Counterfactual Generation |
Milan Bhan et.al. |
2405.09948v1 |
null |
2024-05-16 |
On the relevance of pre-neural approaches in natural language processing pedagogy |
Aditya Joshi et.al. |
2405.09854v1 |
null |
2024-05-16 |
Optimization Techniques for Sentiment Analysis Based on LLM (GPT-3) |
Tong Zhan et.al. |
2405.09770v1 |
null |
2024-05-15 |
SCI 3.0: A Web-based Schema Curation Interface for Graphical Event Representations |
Reece Suchocki et.al. |
2405.09733v1 |
null |
2024-05-15 |
Enhancing Maritime Trajectory Forecasting via H3 Index and Causal Language Modelling (CLM) |
Nicolas Drapier et.al. |
2405.09596v1 |
null |
2024-05-15 |
Facilitating Opinion Diversity through Hybrid NLP Approaches |
Michiel van der Meer et.al. |
2405.09439v1 |
null |
2024-05-15 |
Comparing the Efficacy of GPT-4 and Chat-GPT in Mental Health Care: A Blind Assessment of Large Language Models for Psychological Support |
Birger Moell et.al. |
2405.09300v1 |
null |
2024-05-15 |
Positional Knowledge is All You Need: Position-induced Transformer (PiT) for Operator Learning |
Junfeng Chen et.al. |
2405.09285v1 |
null |
2024-05-15 |
Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy |
Feng Wang et.al. |
2405.09014v1 |
link |
2024-05-14 |
Challenges and Opportunities in Text Generation Explainability |
Kenza Amara et.al. |
2405.08468v1 |
null |
2024-05-13 |
A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection |
Matthew Korban et.al. |
2405.08204v1 |
null |
2024-05-13 |
Benchmarking Retrieval-Augmented Large Language Models in Biomedical NLP: Application, Robustness, and Self-Awareness |
Mingchen Li et.al. |
2405.08151v1 |
null |
2024-05-14 |
PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition |
Ziyang Zhang et.al. |
2405.07932v2 |
link |
2024-05-13 |
A Comprehensive Analysis of Static Word Embeddings for Turkish |
Karahan Sarıtaş et.al. |
2405.07778v1 |
link |
2024-05-13 |
Challenges and Opportunities of NLP for HR Applications: A Discussion Paper |
Jochen L. Leidner et.al. |
2405.07766v1 |
null |
2024-05-13 |
Constructing a BPE Tokenization DFA |
Martin Berglund et.al. |
2405.07671v1 |
null |
2024-05-13 |
Backdoor Removal for Generative Large Language Models |
Haoran Li et.al. |
2405.07667v1 |
null |
2024-05-13 |
AIris: An AI-powered Wearable Assistive Device for the Visually Impaired |
Dionysia Danai Brilli et.al. |
2405.07606v1 |
null |
2024-05-13 |
Evaluation of Retrieval-Augmented Generation: A Survey |
Hao Yu et.al. |
2405.07437v1 |
link |
2024-05-11 |
Advanced Natural-based interaction for the ITAlian language: LLaMAntino-3-ANITA |
Marco Polignano et.al. |
2405.07101v1 |
null |
2024-05-11 |
TacoERE: Cluster-aware Compression for Event Relation Extraction |
Yong Guan et.al. |
2405.06890v1 |
null |
2024-05-10 |
PLeak: Prompt Leaking Attacks against Large Language Model Applications |
Bo Hui et.al. |
2405.06823v1 |
link |
2024-05-10 |
Explaining Text Similarity in Transformer Models |
Alexandros Vasileiou et.al. |
2405.06604v1 |
link |
2024-05-10 |
What Can Natural Language Processing Do for Peer Review? |
Ilia Kuznetsov et.al. |
2405.06563v1 |
link |
2024-05-10 |
Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification |
Yaoqin Ye et.al. |
2405.06468v1 |
null |
2024-05-10 |
LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play |
Li-Chun Lu et.al. |
2405.06373v1 |
null |
2024-05-10 |
A NLP Approach to "Review Bombing" in Metacritic PC Videogames User Ratings |
Javier Coronado-Blázquez et.al. |
2405.06306v1 |
null |
2024-05-09 |
Creating Geospatial Trajectories from Human Trafficking Text Corpora |
Saydeh N. Karabatis et.al. |
2405.06130v1 |
null |
2024-05-09 |
Narrative to Trajectory (N2T+): Extracting Routes of Life or Death from Human Trafficking Text Corpora |
Saydeh N. Karabatis et.al. |
2405.06129v1 |
null |
2024-05-09 |
Collaborative Design for Job-Seekers with Autism: A Conceptual Framework for Future Research |
Sungsoo Ray Hong et.al. |
2405.06078v1 |
null |
2024-05-09 |
Natural Language Processing RELIES on Linguistics |
Juri Opitz et.al. |
2405.05966v1 |
null |
2024-05-09 |
Revitalising Stagecraft: NLP-Driven Sentiment Analysis for Traditional Theater Revival |
Saikat Samanta et.al. |
2405.05813v1 |
null |
2024-05-09 |
Enhancing Suicide Risk Detection on Social Media through Semi-Supervised Deep Label Smoothing |
Matthew Squires et.al. |
2405.05795v1 |
null |
2024-05-09 |
Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language |
Ronny Paul et.al. |
2405.05777v1 |
null |
2024-05-09 |
Computational lexical analysis of Flamenco genres |
Pablo Rosillo-Rodes et.al. |
2405.05723v1 |
null |
2024-05-09 |
Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM |
Xikang Yang et.al. |
2405.05610v1 |
link |
2024-05-09 |
A Survey on Backbones for Deep Video Action Recognition |
Zixuan Tang et.al. |
2405.05584v1 |
null |
2024-05-08 |
Enhancing Holonic Architecture with Natural Language Processing for System of Systems |
Muhammad Ashfaq et.al. |
2405.05365v1 |
null |
2024-05-08 |
CARE-SD: Classifier-based analysis for recognizing and eliminating stigmatizing and doubt marker labels in electronic health records: model development and validation |
Drew Walker et.al. |
2405.05204v1 |
null |
2024-05-08 |
An Artificial Intelligence Approach for Interpreting Creative Combinational Designs |
Liuqing Chen et.al. |
2405.04985v1 |
null |
2024-05-08 |
Improving Long Text Understanding with Knowledge Distilled from Summarization Model |
Yan Liu et.al. |
2405.04955v1 |
null |
2024-05-08 |
Fine-tuning Pre-trained Named Entity Recognition Models For Indian Languages |
Sankalp Bahad et.al. |
2405.04829v1 |
null |
2024-05-08 |
Zero-shot LLM-guided Counterfactual Generation for Text |
Amrita Bhattacharjee et.al. |
2405.04793v1 |
null |
2024-05-08 |
CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization |
Zheyan Qu et.al. |
2405.04781v1 |
null |
2024-05-07 |
Bridging the Bosphorus: Advancing Turkish Large Language Models through Strategies for Low-Resource Language Adaptation and Benchmarking |
Emre Can Acikgoz et.al. |
2405.04685v1 |
null |
2024-05-07 |
Vision Mamba: A Comprehensive Survey and Taxonomy |
Xiao Liu et.al. |
2405.04404v1 |
link |
2024-05-07 |
Revisiting character-level adversarial attacks |
Elias Abad Rocamora et.al. |
2405.04346v1 |
link |
2024-05-07 |
NOVA: NoC-based Vector Unit for Mapping Attention Layers on a CNN Accelerator |
Mohit Upadhyay et.al. |
2405.04206v1 |
null |
2024-05-07 |
LingML: Linguistic-Informed Machine Learning for Enhanced Fake News Detection |
Jasraj Singh et.al. |
2405.04165v1 |
null |
2024-05-07 |
Sign2GPT: Leveraging Large Language Models for Gloss-Free Sign Language Translation |
Ryan Wong et.al. |
2405.04164v1 |
null |
2024-05-07 |
GPT-Enabled Cybersecurity Training: A Tailored Approach for Effective Awareness |
Nabil Al-Dhamari et.al. |
2405.04138v1 |
null |
2024-05-07 |
Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning |
Karim Galliamov et.al. |
2405.04126v1 |
link |
2024-05-07 |
Evaluating Text Summaries Generated by Large Language Models Using OpenAI's GPT |
Hassan Shakil et.al. |
2405.04053v1 |
null |
2024-05-07 |
Sketch Then Generate: Providing Incremental User Feedback and Guiding LLM Code Generation through Language-Oriented Code Sketches |
Chen Zhu-Tian et.al. |
2405.03998v1 |
null |
2024-05-07 |
A Roadmap for Multilingual, Multimodal Domain Independent Deception Detection |
Dainis Boumber et.al. |
2405.03920v1 |
null |
2024-05-06 |
Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment |
Abhinav Agarwalla et.al. |
2405.03594v1 |
null |
2024-05-06 |
Gaussian Stochastic Weight Averaging for Bayesian Low-Rank Adaptation of Large Language Models |
Emre Onal et.al. |
2405.03425v1 |
null |
2024-05-06 |
Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond |
Jiuxiang Gu et.al. |
2405.03251v1 |
null |
2024-05-06 |
Vietnamese AI Generated Text Detection |
Quang-Dan Tran et.al. |
2405.03206v1 |
null |
2024-05-06 |
CRAFT: Extracting and Tuning Cultural Instructions from the Wild |
Bin Wang et.al. |
2405.03138v1 |
link |
2024-05-06 |
WDMoE: Wireless Distributed Large Language Models with Mixture of Experts |
Nan Xue et.al. |
2405.03131v1 |
null |
2024-05-05 |
Unraveling the Dominance of Large Language Models Over Transformer Models for Bangla Natural Language Inference: A Comprehensive Study |
Fatema Tuj Johora Faria et.al. |
2405.02937v1 |
link |
2024-05-05 |
Exploring the Improvement of Evolutionary Computation via Large Language Models |
Jinyu Cai et.al. |
2405.02876v1 |
null |
2024-05-05 |
HuixiangDou-CR: Coreference Resolution in Group Chats |
Huanjun Kong et.al. |
2405.02817v1 |
link |
2024-05-05 |
Structural Balance in Real-World Social Networks: Incorporating Direction and Transitivity in Measuring Partial Balance |
Rezvaneh Rezapour et.al. |
2405.02798v1 |
null |
2024-05-03 |
Impact of emoji exclusion on the performance of Arabic sarcasm detection models |
Ghalyah H. Aleryani et.al. |
2405.02195v1 |
null |
2024-05-03 |
Single and Multi-Hop Question-Answering Datasets for Reticular Chemistry with GPT-4-Turbo |
Nakul Rampal et.al. |
2405.02128v1 |
null |
2024-05-03 |
Comparative Analysis of Retrieval Systems in the Real World |
Dmytro Mozolevskyi et.al. |
2405.02048v1 |
null |
2024-05-03 |
The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification |
Minh Duc Bui et.al. |
2405.02010v1 |
null |
2024-05-03 |
Conformal Prediction for Natural Language Processing: A Survey |
Margarida M. Campos et.al. |
2405.01976v1 |
null |
2024-05-03 |
Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders and Identifying Distinct Features |
Chuanbo Hu et.al. |
2405.01799v1 |
null |
2024-05-02 |
Question Suggestion for Conversational Shopping Assistants Using Product Metadata |
Nikhita Vedula et.al. |
2405.01738v1 |
null |
2024-05-02 |
Automatically Extracting Numerical Results from Randomized Controlled Trials with Large Language Models |
Hye Sun Yun et.al. |
2405.01686v1 |
link |
2024-05-02 |
Leveraging Prompt-Learning for Structured Information Extraction from Crohn's Disease Radiology Reports in a Low-Resource Language |
Liam Hazan et.al. |
2405.01682v1 |
null |
2024-05-02 |
1-Diffractor: Efficient and Utility-Preserving Text Obfuscation Leveraging Word-Level Metric Differential Privacy |
Stephen Meisenbacher et.al. |
2405.01678v1 |
link |
2024-05-02 |
Analyzing the Role of Semantic Representations in the Era of Large Language Models |
Zhijing Jin et.al. |
2405.01502v1 |
link |
2024-05-02 |
"In-Context Learning" or: How I learned to stop worrying and love "Applied Information Retrieval" |
Andrew Parry et.al. |
2405.01116v1 |
null |
2024-05-01 |
A Legal Framework for Natural Language Processing Model Training in Portugal |
Rúben Almeida et.al. |
2405.00536v1 |
null |
2024-05-01 |
DAM: A Universal Dual Attention Mechanism for Multimodal Timeseries Cryptocurrency Trend Forecasting |
Yihang Fu et.al. |
2405.00522v1 |
link |
2024-05-01 |
Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning |
Lucas-Andreï Thil et.al. |
2405.00516v1 |
null |
2024-05-01 |
Thread review sentimental analysis with tkinter GUI & tableau dashboard |
Robin Donal et.al. |
2405.00377v1 |
link |
2024-05-01 |
AdaMoLE: Fine-Tuning Large Language Models with Adaptive Mixture of Low-Rank Adaptation Experts |
Zefang Liu et.al. |
2405.00361v1 |
link |
2024-05-01 |
A Survey on Deep Active Learning: Recent Advances and New Frontiers |
Dongyuan Li et.al. |
2405.00334v1 |
null |
2024-05-01 |
Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition |
Dongyuan Li et.al. |
2405.00307v1 |
link |
2024-05-01 |
ASAM: Boosting Segment Anything Model with Adversarial Tuning |
Bo Li et.al. |
2405.00256v1 |
link |
2024-04-30 |
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing |
Yucheng Hu et.al. |
2404.19543v1 |
link |
2024-04-30 |
DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets |
Xiaoyu Huang et.al. |
2404.19264v1 |
null |
2024-04-30 |
Mix of Experts Language Model for Named Entity Recognition |
Xinwei Chen et.al. |
2404.19192v1 |
null |
2024-04-30 |
Revenge of the Fallen? Recurrent Models Match Transformers at Predicting Human Language Comprehension Metrics |
James A. Michaelov et.al. |
2404.19178v1 |
null |
2024-04-29 |
A Framework for Real-time Safeguarding the Text Generation of Large Language |
Ximing Dong et.al. |
2404.19048v1 |
null |
2024-04-29 |
Unsupervised Binary Code Translation with Application to Code Similarity Detection and Vulnerability Discovery |
Iftakhar Ahmad et.al. |
2404.19025v1 |
link |
2024-04-29 |
Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism |
Lei Kang et.al. |
2404.19024v1 |
link |
2024-04-29 |
Computational Job Market Analysis with Natural Language Processing |
Mike Zhang et.al. |
2404.18977v1 |
link |
2024-04-29 |
Overcoming Knowledge Barriers: Online Imitation Learning from Observation with Pretrained World Models |
Xingyuan Zhang et.al. |
2404.18896v1 |
link |
2024-04-29 |
Towards A Structured Overview of Use Cases for Natural Language Processing in the Legal Domain: A German Perspective |
Juraj Vladika et.al. |
2404.18759v1 |
null |
2024-04-29 |
Reinforcement Learning Problem Solving with Large Language Models |
Sina Gholamian et.al. |
2404.18638v1 |
null |
2024-04-29 |
From ChatGPT, DALL-E 3 to Sora: How has Generative AI Changed Digital Humanities Research and Services? |
Jiangfeng Liu et.al. |
2404.18518v1 |
null |
2024-04-29 |
Quantitative Tools for Time Series Analysis in Natural Language Processing: A Practitioners Guide |
W. Benedikt Schmal et.al. |
2404.18499v1 |
link |
2024-04-28 |
Mapping 'when'-clauses in Latin American and Caribbean languages: an experiment in subtoken-based typology |
Nilo Pedrazzini et.al. |
2404.18257v1 |
null |
2024-04-28 |
PatentGPT: A Large Language Model for Intellectual Property |
Zilong Bai et.al. |
2404.18255v1 |
null |
2024-04-28 |
4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBs |
Minjie Wang et.al. |
2404.18209v1 |
link |
2024-04-28 |
Exploring the Robustness of In-Context Learning with Noisy Labels |
Chen Cheng et.al. |
2404.18191v1 |
link |
2024-04-28 |
Application and practice of AI technology in quantitative investment |
Shuochen Bi et.al. |
2404.18184v1 |
null |
2024-04-26 |
Transformer For Low-frequency Extrapolating of Seismic Data |
Zheng Cong et.al. |
2404.17437v1 |
null |
2024-04-26 |
Evaluation of Geographical Distortions in Language Models: A Crucial Step Towards Equitable Representations |
Rémy Decoupes et.al. |
2404.17401v1 |
null |
2024-04-26 |
M3BAT: Unsupervised Domain Adaptation for Multimodal Mobile Sensing with Multi-Branch Adversarial Training |
Lakmal Meegahapola et.al. |
2404.17391v1 |
null |
2024-04-26 |
Can a Multichoice Dataset be Repurposed for Extractive Question Answering? |
Teresa Lynn et.al. |
2404.17342v1 |
null |
2024-04-26 |
Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM |
Xuan Zhang et.al. |
2404.17283v1 |
link |
2024-04-26 |
Prompting Towards Alleviating Code-Switched Data Scarcity in Under-Resourced Languages with GPT as a Pivot |
Michelle Terblanche et.al. |
2404.17216v1 |
null |
2024-04-26 |
Quantifying Memorization of Domain-Specific Pre-trained Language Models using Japanese Newspaper and Paywalls |
Shotaro Ishihara et.al. |
2404.17143v1 |
null |
2024-04-26 |
Process Mining Embeddings: Learning Vector Representations for Petri Nets |
Juan G. Colonna et.al. |
2404.17129v1 |
link |
2024-04-26 |
Text Sentiment Analysis and Classification Based on Bidirectional Gated Recurrent Units (GRUs) Model |
Wei Xu et.al. |
2404.17123v1 |
null |
2024-04-26 |
2M-NER: Contrastive Learning for Multilingual and Multimodal NER with Language and Modal Fusion |
Dongsheng Wang et.al. |
2404.17122v1 |
null |
2024-04-25 |
EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning |
Hongxia Xie et.al. |
2404.16670v1 |
link |
2024-04-25 |
ProbGate at EHRSQL 2024: Enhancing SQL Query Generation Accuracy through Probabilistic Threshold Filtering and Error Handling |
Sangryul Kim et.al. |
2404.16659v1 |
link |
2024-04-25 |
Análise de ambiguidade linguística em modelos de linguagem de grande escala (LLMs) |
Lavínia de Carvalho Moraes et.al. |
2404.16653v1 |
null |
2024-04-25 |
U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF |
Xingchen Song et.al. |
2404.16407v1 |
null |
2024-04-25 |
LLM-Based Section Identifiers Excel on Open Source but Stumble in Real World Applications |
Saranya Krishnamoorthy et.al. |
2404.16294v1 |
link |
2024-04-24 |
Towards Efficient Patient Recruitment for Clinical Trials: Application of a Prompt-Based Learning Model |
Mojdeh Rahmanian et.al. |
2404.16198v1 |
null |
2024-04-24 |
Chat2Scenario: Scenario Extraction From Dataset Through Utilization of Large Language Model |
Yongqi Zhao et.al. |
2404.16147v1 |
link |
2024-04-24 |
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges |
Badri Narayana Patro et.al. |
2404.16112v1 |
link |
2024-04-24 |
Semantic Routing for Enhanced Performance of LLM-Assisted Intent-Based 5G Core Network Management and Orchestration |
Dimitrios Michael Manias et.al. |
2404.15869v1 |
null |
2024-04-24 |
Porting Large Language Models to Mobile Devices for Question Answering |
Hannes Fassold et.al. |
2404.15851v1 |
null |
2024-04-24 |
Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations? |
Hossein Salami et.al. |
2404.15578v1 |
null |
2024-04-23 |
Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information |
Chihiro Taguchi et.al. |
2404.15501v1 |
link |
2024-04-23 |
IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents |
Jean-Philippe Corbeil et.al. |
2404.15488v1 |
link |
2024-04-23 |
Feature Distribution Shift Mitigation with Contrastive Pretraining for Intrusion Detection |
Weixing Wang et.al. |
2404.15382v1 |
null |
2024-04-22 |
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of Experts |
Dengchun Li et.al. |
2404.15159v1 |
link |
2024-04-23 |
Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-case |
Muhammad Asif Auyb et.al. |
2404.14977v1 |
null |
2024-04-23 |
Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models |
Yang Tan et.al. |
2404.14850v1 |
link |
2024-04-23 |
Modeling the Sacred: Considerations when Using Considerations when Using Religious Texts in Natural Language Processing |
Ben Hutchinson et.al. |
2404.14740v1 |
null |
2024-04-23 |
Learning Word Embedding with Better Distance Weighting and Window Size Scheduling |
Chaohao Yang et.al. |
2404.14631v1 |
null |
2024-04-22 |
Automated Long Answer Grading with RiceChem Dataset |
Shashank Sonkar et.al. |
2404.14316v1 |
link |
2024-04-22 |
Marking: Visual Grading with Highlighting Errors and Annotating Missing Bits |
Shashank Sonkar et.al. |
2404.14301v1 |
null |
2024-04-22 |
EnzChemRED, a rich enzyme chemistry relation extraction dataset |
Po-Ting Lai et.al. |
2404.14209v1 |
null |
2024-04-22 |
Protecting Your LLMs with Information Bottleneck |
Zichuan Liu et.al. |
2404.13968v1 |
link |
2024-04-22 |
MARIO Eval: Evaluate Your Math LLM with your Math LLM--A mathematical dataset evaluation toolkit |
Boning Zhang et.al. |
2404.13925v1 |
link |
2024-04-21 |
Mixture of LoRA Experts |
Xun Wu et.al. |
2404.13628v1 |
link |
2024-04-21 |
Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications |
Charith Chandra Sai Balne et.al. |
2404.13506v1 |
null |
2024-04-20 |
Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing |
Yuang Liu et.al. |
2404.13434v1 |
null |
2024-04-20 |
Retrieval-Augmented Generation-based Relation Extraction |
Sefika Efeoglu et.al. |
2404.13397v1 |
link |
2024-04-20 |
MahaSQuAD: Bridging Linguistic Divides in Marathi Question-Answering |
Ruturaj Ghatage et.al. |
2404.13364v1 |
link |
2024-04-19 |
FinLangNet: A Novel Deep Learning Framework for Credit Risk Prediction Using Linguistic Analogy in Financial Data |
Yu Lei et.al. |
2404.13004v1 |
link |
2024-04-19 |
LiMe: a Latin Corpus of Late Medieval Criminal Sentences |
Alessandra Bassani et.al. |
2404.12829v1 |
null |
2024-04-19 |
Large Language Model Supply Chain: A Research Agenda |
Shenao Wang et.al. |
2404.12736v1 |
null |
2024-04-19 |
Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation |
Lasal Jayawardena et.al. |
2404.12596v1 |
null |
2024-04-18 |
GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction |
Urchade Zaratiana et.al. |
2404.12491v1 |
link |
2024-04-18 |
NLP-enabled trajectory map-matching in urban road networks using transformer sequence-to-sequence model |
Sevin Mohammadi et.al. |
2404.12460v1 |
null |
2024-04-18 |
RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation |
Chao Jin et.al. |
2404.12457v1 |
null |
2024-04-18 |
Point-In-Context: Understanding Point Cloud via In-Context Learning |
Mengyuan Liu et.al. |
2404.12352v1 |
link |
2024-04-18 |
Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting |
Nicholas Harris et.al. |
2404.12283v1 |
null |
2024-04-18 |
EuSQuAD: Automatically Translated and Aligned SQuAD2.0 for Basque |
Aitor García-Pablos et.al. |
2404.12177v1 |
link |
2024-04-18 |
Stance Detection on Social Media with Fine-Tuned Large Language Models |
İlker Gül et.al. |
2404.12171v1 |
null |
2024-04-18 |
Enhance Robustness of Language Models Against Variation Attack through Graph Integration |
Zi Xiong et.al. |
2404.12014v1 |
null |
2024-04-18 |
ParaFusion: A Large-Scale LLM-Driven English Paraphrase Dataset Infused with High-Quality Lexical and Syntactic Diversity |
Lasal Jayawardena et.al. |
2404.12010v1 |
null |
2024-04-18 |
EVIT: Event-Oriented Instruction Tuning for Event Reasoning |
Zhengwei Tao et.al. |
2404.11978v1 |
null |
2024-04-18 |
Sharing Parameter by Conjugation for Knowledge Graph Embeddings in Complex Space |
Xincan Feng et.al. |
2404.11809v1 |
link |
2024-04-17 |
REQUAL-LM: Reliability and Equity through Aggregation in Large Language Models |
Sana Ebrahimi et.al. |
2404.11782v1 |
null |
2024-04-17 |
Pretraining Billion-scale Geospatial Foundational Models on Frontier |
Aristeidis Tsaris et.al. |
2404.11706v1 |
null |
2024-04-17 |
Related Work and Citation Text Generation: A Survey |
Xiangci Li et.al. |
2404.11588v1 |
null |
2024-04-17 |
Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis |
Soyoung Yang et.al. |
2404.11539v1 |
null |
2024-04-17 |
GenFighter: A Generative and Evolutive Textual Attack Removal |
Md Athikul Islam et.al. |
2404.11538v1 |
null |
2024-04-17 |
Research on emotionally intelligent dialogue generation based on automatic dialogue system |
Jin Wang et.al. |
2404.11447v1 |
null |
2024-04-17 |
Low-Cost Language Models: Survey and Performance Evaluation on Python Code Generation |
Jessica López Espejel et.al. |
2404.11160v1 |
null |
2024-04-17 |
Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions |
Leena Mathur et.al. |
2404.11023v1 |
null |
2024-04-16 |
Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training |
Pavel Denisov et.al. |
2404.10922v1 |
link |
2024-04-16 |
A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents |
Wiam Adnan et.al. |
2404.10848v1 |
null |
2024-04-16 |
A Sentiment Analysis of Medical Text Based on Deep Learning |
Yinan Chen et.al. |
2404.10503v1 |
null |
2024-04-16 |
Towards Complex Ontology Alignment using Large Language Models |
Reihaneh Amini et.al. |
2404.10329v1 |
null |
2024-04-16 |
Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs |
Woomin Song et.al. |
2404.10308v1 |
link |
2024-04-16 |
Future Language Modeling from Temporal Document History |
Changmao Li et.al. |
2404.10297v1 |
link |
2024-04-15 |
LegalPro-BERT: Classification of Legal Provisions by fine-tuning BERT Large Language Model |
Amit Tewari et.al. |
2404.10097v1 |
link |
2024-04-15 |
Detecting AI Generated Text Based on NLP and Machine Learning Approaches |
Nuzhat Prova et.al. |
2404.10032v1 |
null |
2024-04-15 |
How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model |
Hanxue Gu et.al. |
2404.09957v1 |
link |
2024-04-15 |
AI-Driven Statutory Reasoning via Software Engineering Methods |
Rohan Padhye et.al. |
2404.09868v1 |
null |
2024-04-15 |
Reimagining Self-Adaptation in the Age of Large Language Models |
Raghav Donakanti et.al. |
2404.09866v1 |
null |
2024-04-15 |
KG-CTG: Citation Generation through Knowledge Graph-guided Large Language Models |
Avinash Anand et.al. |
2404.09763v1 |
null |
2024-04-15 |
Resilience of Large Language Models for Noisy Instructions |
Bin Wang et.al. |
2404.09754v1 |
null |
2024-04-15 |
State Space Model for New-Generation Network Alternative to Transformers: A Survey |
Xiao Wang et.al. |
2404.09516v1 |
link |
2024-04-15 |
Automatic Knowledge Graph Construction for Judicial Cases |
Jie Zhou et.al. |
2404.09416v1 |
null |
2024-04-15 |
A Large-Scale Evaluation of Speech Foundation Models |
Shu-wen Yang et.al. |
2404.09385v1 |
link |
2024-04-14 |
Hierarchical Attention Models for Multi-Relational Graphs |
Roshni G. Iyer et.al. |
2404.09365v1 |
link |
2024-04-14 |
Counteracting Concept Drift by Learning with Future Malware Predictions |
Branislav Bosansky et.al. |
2404.09352v1 |
null |
2024-04-14 |
A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion |
Zihan Cao et.al. |
2404.09293v1 |
null |
2024-04-14 |
Unveiling LLM Evaluation Focused on Metrics: Challenges and Solutions |
Taojun Hu et.al. |
2404.09135v1 |
null |
2024-04-13 |
Multilingual Evaluation of Semantic Textual Relatedness |
Sharvi Endait et.al. |
2404.09047v1 |
null |
2024-04-13 |
WikiSplit++: Easy Data Refinement for Split and Rephrase |
Hayato Tsukagoshi et.al. |
2404.09002v1 |
link |
2024-04-13 |
Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives |
Yidan Liu et.al. |
2404.08926v1 |
null |
2024-04-10 |
An inclusive review on deep learning techniques and their scope in handwriting recognition |
Sukhdeep Singh et.al. |
2404.08011v1 |
null |
2024-04-11 |
AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports |
Lukas Lange et.al. |
2404.07765v1 |
link |
2024-04-11 |
ODA: Observation-Driven Agent for integrating LLMs and Knowledge Graphs |
Lei Sun et.al. |
2404.07677v1 |
link |
2024-04-11 |
CAT: Contrastive Adapter Training for Personalized Image Generation |
Jae Wan Park et.al. |
2404.07554v1 |
link |
2024-04-11 |
Behavior Trees Enable Structured Programming of Language Model Agents |
Richard Kelley et.al. |
2404.07439v1 |
link |
2024-04-11 |
Towards Robustness of Text-to-Visualization Translation against Lexical and Phrasal Variability |
Jinwei Lu et.al. |
2404.07135v2 |
null |
2024-04-10 |
DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space |
Jianxiang Xiang et.al. |
2404.06760v1 |
null |
2024-04-12 |
Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness |
Xincan Feng et.al. |
2404.06714v2 |
null |
2024-04-10 |
CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers |
Longwei Zou et.al. |
2404.06709v1 |
null |
2024-04-09 |
Perplexed: Understanding When Large Language Models are Confused |
Nathan Cooper et.al. |
2404.06634v1 |
null |
2024-04-09 |
ClinLinker: Medical Entity Linking of Clinical Concept Mentions in Spanish |
Fernando Gallego et.al. |
2404.06367v1 |
null |
2024-04-09 |
Finding fake reviews in e-commerce platforms by using hybrid algorithms |
Mathivanan Periasamy et.al. |
2404.06339v1 |
null |
2024-04-09 |
Exploring the True Potential: Evaluating the Black-box Optimization Capability of Large Language Models |
Beichen Huang et.al. |
2404.06290v1 |
null |
2024-04-09 |
VI-OOD: A Unified Representation Learning Framework for Textual Out-of-distribution Detection |
Li-Ming Zhan et.al. |
2404.06217v1 |
link |
2024-04-09 |
Protection of Guizhou Miao Batik Culture Based on Knowledge Graph and Deep Learning |
Huafeng Quan et.al. |
2404.06168v1 |
null |
2024-04-09 |
Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports |
Tianyu Cao et.al. |
2404.06162v1 |
null |
2024-04-09 |
Mansformer: Efficient Transformer of Mixed Attention for Image Deblurring and Beyond |
Pin-Hung Kuo et.al. |
2404.06135v1 |
null |
2024-04-09 |
FLEX: FLEXible Federated Learning Framework |
Francisco Herrera et.al. |
2404.06127v1 |
link |
2024-04-09 |
All in One: An Empirical Study of GPT for Few-Shot Aspect-Based Sentiment Anlaysis |
Baoxing Jiang et.al. |
2404.06063v1 |
null |
2024-04-09 |
Privacy Preserving Prompt Engineering: A Survey |
Kennedy Edemacu et.al. |
2404.06001v1 |
null |
2024-04-08 |
A Large-Scale Exploration of $μ$ -Transfer |
Lucas Lingle et.al. |
2404.05728v1 |
link |
2024-04-08 |
Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding |
Ahmad Idrissi-Yaghir et.al. |
2404.05694v1 |
null |
2024-04-08 |
Causality Extraction from Nuclear Licensee Event Reports Using a Hybrid Framework |
Sohag Rahman et.al. |
2404.05656v1 |
null |
2024-04-08 |
LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking |
Faren Yan et.al. |
2404.05624v1 |
null |
2024-04-08 |
3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering |
Qingyuan Zhou et.al. |
2404.05522v1 |
null |
2024-04-08 |
Relation Extraction Using Large Language Models: A Case Study on Acupuncture Point Locations |
Yiming Li et.al. |
2404.05415v1 |
null |
2024-04-08 |
NLP Progress in Indigenous Latin American Languages |
Atnafu Lambebo Tonja et.al. |
2404.05365v1 |
null |
2024-04-08 |
Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods |
Roopkatha Dey et.al. |
2404.05159v1 |
null |
2024-04-08 |
EcoVerse: An Annotated Twitter Dataset for Eco-Relevance Classification, Environmental Impact Analysis, and Stance Detection |
Francesca Grasso et.al. |
2404.05133v1 |
link |
2024-04-07 |
Adapting LLMs for Efficient Context Processing through Soft Prompt Compression |
Cangqing Wang et.al. |
2404.04997v1 |
null |
2024-04-05 |
player2vec: A Language Modeling Approach to Understand Player Behavior in Games |
Tianze Wang et.al. |
2404.04234v1 |
null |
2024-04-05 |
Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving |
Gulsum Yigit et.al. |
2404.03938v1 |
null |
2024-04-05 |
Simple Techniques for Enhancing Sentence Embeddings in Generative Language Models |
Bowen Zhang et.al. |
2404.03921v1 |
link |
2024-04-05 |
A Bi-consolidating Model for Joint Relational Triple Extraction |
Xiaocheng Luo et.al. |
2404.03881v1 |
null |
2024-04-04 |
Understanding Language Modeling Paradigm Adaptations in Recommender Systems: Lessons Learned and Open Challenges |
Lemei Zhang et.al. |
2404.03788v1 |
link |
2024-04-04 |
Dendrites endow artificial neural networks with accurate, robust and parameter-efficient learning |
Spyridon Chavlis et.al. |
2404.03708v1 |
null |
2024-04-04 |
Knowledge Graph Representation for Political Information Sources |
Tinatin Osmonova et.al. |
2404.03437v1 |
null |
2024-04-04 |
ChangeMamba: Remote Sensing Change Detection with Spatio-Temporal State Space Model |
Hongruixuan Chen et.al. |
2404.03425v1 |
link |
2024-04-04 |
Towards Pareto Optimal Throughput in Small Language Model Serving |
Pol G. Recasens et.al. |
2404.03353v1 |
null |
2024-04-04 |
A Comparative Analysis of Word-Level Metric Differential Privacy: Benchmarking The Privacy-Utility Trade-off |
Stephen Meisenbacher et.al. |
2404.03324v1 |
link |
2024-04-04 |
The Death of Feature Engineering? BERT with Linguistic Features on SQuAD 2.0 |
Jiawei Li et.al. |
2404.03184v1 |
null |
2024-04-03 |
Construction of Functional Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model |
Yanpeng Ye et.al. |
2404.03080v1 |
null |
2024-04-03 |
GPT-DETOX: An In-Context Learning-Based Paraphraser for Text Detoxification |
Ali Pesaranghader et.al. |
2404.03052v1 |
null |
2024-04-03 |
Automatic Prompt Selection for Large Language Models |
Viet-Tung Do et.al. |
2404.02717v1 |
null |
2024-04-03 |
Adversarial Attacks and Dimensionality in Text Classifiers |
Nandish Chattopadhyay et.al. |
2404.02660v1 |
null |
2024-04-03 |
Learn to Disguise: Avoid Refusal Responses in LLM's Defense via a Multi-agent Attacker-Disguiser Game |
Qianqiao Xu et.al. |
2404.02532v1 |
null |
2024-04-03 |
On the Efficiency and Robustness of Vibration-based Foundation Models for IoT Sensing: A Case Study |
Tomoyoshi Kimura et.al. |
2404.02461v1 |
null |
2024-04-03 |
Task Agnostic Architecture for Algorithm Induction via Implicit Composition |
Sahil J. Sindhi et.al. |
2404.02450v1 |
null |
2024-04-03 |
The Promises and Pitfalls of Using Language Models to Measure Instruction Quality in Education |
Paiheng Xu et.al. |
2404.02444v1 |
null |
2024-04-03 |
CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models |
Zaid Sheikh et.al. |
2404.02408v1 |
link |
2024-04-02 |
Corpus Considerations for Annotator Modeling and Scaling |
Olufunke O. Sarumi et.al. |
2404.02340v1 |
link |
2024-04-02 |
Comparative Study of Domain Driven Terms Extraction Using Large Language Models |
Sandeep Chataut et.al. |
2404.02330v1 |
null |
2024-04-02 |
Using Interpretation Methods for Model Enhancement |
Zhuo Chen et.al. |
2404.02068v1 |
link |
2024-04-02 |
BERTopic-Driven Stock Market Predictions: Unraveling Sentiment Insights |
Enmin Zhu et.al. |
2404.02053v1 |
null |
2024-04-02 |
Kallaama: A Transcribed Speech Dataset about Agriculture in the Three Most Widely Spoken Languages in Senegal |
Elodie Gauthier et.al. |
2404.01991v1 |
link |
2024-04-02 |
Team UTSA-NLP at SemEval 2024 Task 5: Prompt Ensembling for Argument Reasoning in Civil Procedures with GPT4 |
Dan Schumacher et.al. |
2404.01961v1 |
link |
2024-04-02 |
Classifying Graphemes in English Words Through the Application of a Fuzzy Inference System |
Samuel Rose et.al. |
2404.01953v1 |
null |
2024-04-02 |
Sentiment Analysis of Citations in Scientific Articles Using ChatGPT: Identifying Potential Biases and Conflicts of Interest |
Walid Hariri et.al. |
2404.01800v1 |
null |
2024-04-02 |
Can Humans Identify Domains? |
Maria Barrett et.al. |
2404.01785v1 |
link |
2024-04-02 |
M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets |
Gaurish Thakkar et.al. |
2404.01753v1 |
null |
2024-04-02 |
CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models |
Xuechen Liang et.al. |
2404.01663v1 |
link |
2024-04-02 |
mChartQA: A universal benchmark for multimodal Chart Question Answer based on Vision-Language Alignment and Reasoning |
Jingxuan Wei et.al. |
2404.01548v1 |
null |
2024-03-29 |
LayerNorm: A key component in parameter-efficient fine-tuning |
Taha ValizadehAslani et.al. |
2403.20284v1 |
null |
2024-03-29 |
ChatGPT v.s. Media Bias: A Comparative Study of GPT-3.5 and Fine-tuned Language Models |
Zehao Wen et.al. |
2403.20158v1 |
null |
2024-03-29 |
NLP for Counterspeech against Hate: A Survey and How-To Guide |
Helena Bonaldi et.al. |
2403.20103v1 |
null |
2024-03-29 |
Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets |
Shadi Manafi et.al. |
2403.20056v1 |
link |
2024-03-29 |
Colorful Cutout: Enhancing Image Data Augmentation with Curriculum Learning |
Juhwan Choi et.al. |
2403.20012v1 |
null |
2024-03-29 |
MANGO: A Benchmark for Evaluating Mapping and Navigation Abilities of Large Language Models |
Peng Ding et.al. |
2403.19913v1 |
link |
2024-03-28 |
Natural Language, AI, and Quantum Computing in 2024: Research Ingredients and Directions in QNLP |
Dominic Widdows et.al. |
2403.19758v1 |
null |
2024-03-28 |
Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data |
Shan Chen et.al. |
2403.19511v1 |
link |
2024-03-29 |
Uncovering Misattributed Suicide Causes through Annotation Inconsistency Detection in Death Investigation Notes |
Song Wang et.al. |
2403.19432v2 |
link |
2024-03-28 |
EthioMT: Parallel Corpus for Low-resource Ethiopian Languages |
Atnafu Lambebo Tonja et.al. |
2403.19365v1 |
null |
2024-03-28 |
A diverse Multilingual News Headlines Dataset from around the World |
Felix Leeb et.al. |
2403.19352v1 |
link |
2024-03-27 |
Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data |
Yuting Guo et.al. |
2403.19031v1 |
null |
2024-03-27 |
Resource Allocation in Large Language Model Integrated 6G Vehicular Networks |
Chang Liu et.al. |
2403.19016v1 |
null |
2024-03-27 |
A Survey on Large Language Models from Concept to Implementation |
Chen Wang et.al. |
2403.18969v1 |
null |
2024-03-27 |
Reshaping Free-Text Radiology Notes Into Structured Reports With Generative Transformers |
Laura Bergomi et.al. |
2403.18938v1 |
link |
2024-03-27 |
Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation |
Mateusz Klimaszewski et.al. |
2403.18804v1 |
link |
2024-03-27 |
3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation |
Ehsan Latif et.al. |
2403.18778v1 |
null |
2024-03-27 |
Transformers-based architectures for stroke segmentation: A review |
Yalda Zafari-Ghadim et.al. |
2403.18637v1 |
null |
2024-03-27 |
Debiasing Sentence Embedders through Contrastive Word Pairs |
Philip Kenneweg et.al. |
2403.18555v1 |
link |
2024-03-27 |
Neural Architecture Search for Sentence Classification with BERT |
Philip Kenneweg et.al. |
2403.18547v1 |
link |
2024-03-27 |
Faster Convergence for Transformer Fine-tuning with Line Search Methods |
Philip Kenneweg et.al. |
2403.18506v1 |
link |
2024-03-27 |
SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks |
Brian Formento et.al. |
2403.18423v1 |
link |
2024-03-27 |
Improving Attributed Text Generation of Large Language Models via Preference Learning |
Dongfang Li et.al. |
2403.18381v1 |
null |
2024-03-27 |
mALBERT: Is a Compact Multilingual BERT Model Still Worth It? |
Christophe Servan et.al. |
2403.18338v1 |
null |
2024-03-27 |
RankMamba, Benchmarking Mamba's Document Ranking Performance in the Era of Transformers |
Zhichao Xu et.al. |
2403.18276v1 |
link |
2024-03-26 |
OmniVid: A Generative Framework for Universal Video Understanding |
Junke Wang et.al. |
2403.17935v1 |
link |
2024-03-26 |
Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications |
Philip Lippmann et.al. |
2403.17860v1 |
null |
2024-03-26 |
ArabicaQA: A Comprehensive Dataset for Arabic Question Answering |
Abdelrahman Abdallah et.al. |
2403.17848v1 |
link |
2024-03-26 |
Graph Language Model (GLM): A new graph-based approach to detect social instabilities |
Wallyson Lemes de Oliveira et.al. |
2403.17816v1 |
null |
2024-03-26 |
Are Compressed Language Models Less Subgroup Robust? |
Leonidas Gee et.al. |
2403.17811v1 |
link |
2024-03-26 |
A Survey on Deep Learning and State-of-the-arts Applications |
Mohd Halim Mohd Noor et.al. |
2403.17561v1 |
null |
2024-03-26 |
Practical Applications of Advanced Cloud Services and Generative AI Systems in Medical Image Analysis |
Jingyu Xu et.al. |
2403.17549v1 |
null |
2024-03-26 |
An Empirical Study of ChatGPT-related projects on GitHub |
Zheng Lin et.al. |
2403.17437v1 |
null |
2024-03-26 |
Transcribing Bengali Text with Regional Dialects to IPA using District Guided Tokens |
S M Jishanul Islam et.al. |
2403.17407v1 |
null |
2024-03-26 |
Extracting Biomedical Entities from Noisy Audio Transcripts |
Nima Ebadi et.al. |
2403.17363v1 |
null |
2024-03-25 |
Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning? |
Shaoxiong Ji et.al. |
2403.16777v1 |
null |
2024-03-25 |
NSINA: A News Corpus for Sinhala |
Hansi Hettiarachchi et.al. |
2403.16571v1 |
link |
2024-03-25 |
Harnessing the power of LLMs for normative reasoning in MASs |
Bastin Tony Roy Savarimuthu et.al. |
2403.16524v1 |
null |
2024-03-25 |
Linguistically Differentiating Acts and Recalls of Racial Microaggressions on Social Media |
Uma Sushmitha Gunturi et.al. |
2403.16514v1 |
null |
2024-03-25 |
$\textit{LinkPrompt}$ : Natural and Universal Adversarial Attacks on Prompt-based Language Models |
Yue Xu et.al. |
2403.16432v1 |
link |
2024-03-24 |
Large Language Models in Biomedical and Health Informatics: A Bibliometric Review |
Huizi Yu et.al. |
2403.16303v1 |
null |
2024-03-24 |
Image Captioning in news report scenario |
Tianrui Liu et.al. |
2403.16209v1 |
null |
2024-03-24 |
Korean Bio-Medical Corpus (KBMC) for Medical Named Entity Recognition |
Sungjoo Byun et.al. |
2403.16158v1 |
null |
2024-03-24 |
A Survey on Lexical Ambiguity Detection and Word Sense Disambiguation |
Miuru Abeysiriwardana et.al. |
2403.16129v1 |
null |
2024-03-23 |
LlamBERT: Large-scale low-cost data annotation in NLP |
Bálint Csanády et.al. |
2403.15938v1 |
link |
2024-03-23 |
RAAMove: A Corpus for Analyzing Moves in Research Article Abstracts |
Hongzheng Li et.al. |
2403.15872v1 |
null |
2024-03-22 |
Towards Deep Learning Enabled Cybersecurity Risk Assessment for Microservice Architectures |
Majid Abdulsatar et.al. |
2403.15169v1 |
null |
2024-03-22 |
CHisIEC: An Information Extraction Corpus for Ancient Chinese History |
Xuemei Tang et.al. |
2403.15088v1 |
null |
2024-03-22 |
Construction of a Japanese Financial Benchmark for Large Language Models |
Masanori Hirano et.al. |
2403.15062v1 |
link |
2024-03-22 |
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement |
Nicholas Lee et.al. |
2403.15042v1 |
link |
2024-03-22 |
On Zero-Shot Counterspeech Generation by LLMs |
Punyajoy Saha et.al. |
2403.14938v1 |
link |
2024-03-21 |
Reversible Jump Attack to Textual Classifiers with Modification Reduction |
Mingze Ni et.al. |
2403.14731v1 |
link |
2024-03-21 |
PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model |
Zheng Zhang et.al. |
2403.14598v1 |
link |
2024-03-21 |
ChatGPT Alternative Solutions: Large Language Models Survey |
Hanieh Alipour et.al. |
2403.14469v1 |
null |
2024-03-21 |
From Perils to Possibilities: Understanding how Human (and AI) Biases affect Online Fora |
Virginia Morini et.al. |
2403.14298v1 |
null |
2024-03-21 |
Dermacen Analytica: A Novel Methodology Integrating Multi-Modal Large Language Models with Machine Learning in tele-dermatology |
Dimitrios P. Panagoulias et.al. |
2403.14243v1 |
null |
2024-03-21 |
Extracting Emotion Phrases from Tweets using BART |
Mahdi Rezapour et.al. |
2403.14050v1 |
null |
2024-03-21 |
The NeurIPS 2023 Machine Learning for Audio Workshop: Affective Audio Benchmarks and Novel Data |
Alice Baird et.al. |
2403.14048v1 |
null |
2024-03-20 |
Leveraging Linguistically Enhanced Embeddings for Open Information Extraction |
Fauzan Farooqui et.al. |
2403.13903v1 |
null |
2024-03-20 |
EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation |
Atnafu Lambebo Tonja et.al. |
2403.13737v1 |
null |
2024-03-20 |
Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models |
Chengzhe Feng et.al. |
2403.13588v1 |
null |
2024-03-20 |
How Gender Interacts with Political Values: A Case Study on Czech BERT Models |
Adnan Al Ali et.al. |
2403.13514v1 |
null |
2024-03-20 |
Community Needs and Assets: A Computational Analysis of Community Conversations |
Md Towhidul Absar Chowdhury et.al. |
2403.13272v1 |
link |
2024-03-19 |
AdaFish: Fast low-rank parameter-efficient fine-tuning by using second-order information |
Jiang Hu et.al. |
2403.13128v1 |
null |
2024-03-19 |
Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Texts |
Sai Ashish Somayajula et.al. |
2403.12918v1 |
link |
2024-03-19 |
Comparing Explanation Faithfulness between Multilingual and Monolingual Fine-tuned Language Models |
Zhixue Zhao et.al. |
2403.12809v1 |
link |
2024-03-19 |
Quantixar: High-performance Vector Data Management System |
Gulshan Yadav et.al. |
2403.12583v1 |
null |
2024-03-19 |
Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices |
Sara Abdali et.al. |
2403.12503v1 |
null |
2024-03-19 |
Third-Party Language Model Performance Prediction from Instruction |
Rahul Nadkarni et.al. |
2403.12413v1 |
link |
2024-03-19 |
Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question Answering |
Yuan Gao et.al. |
2403.12393v1 |
null |
2024-03-19 |
AraPoemBERT: A Pretrained Language Model for Arabic Poetry Analysis |
Faisal Qarah et.al. |
2403.12392v1 |
null |
2024-03-19 |
Improving Generalizability of Extracting Social Determinants of Health Using Large Language Models through Prompt-tuning |
Cheng Peng et.al. |
2403.12374v1 |
null |
2024-03-18 |
Leveraging Large Language Models to Extract Information on Substance Use Disorder Severity from Clinical Notes: A Zero-shot Learning Approach |
Maria Mahbub et.al. |
2403.12297v1 |
null |
2024-03-18 |
Evaluating Named Entity Recognition: Comparative Analysis of Mono- and Multilingual Transformer Models on Brazilian Corporate Earnings Call Transcriptions |
Ramon Abilio et.al. |
2403.12212v1 |
link |
2024-03-17 |
ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization |
Mengsha Liu et.al. |
2403.11236v1 |
link |
2024-03-17 |
Multi-Objective Evolutionary Neural Architecture Search for Recurrent Neural Networks |
Reinhard Booysen et.al. |
2403.11173v1 |
link |
2024-03-17 |
Exploring Tokenization Strategies and Vocabulary Sizes for Enhanced Arabic Language Models |
Mohamed Taher Alrefaie et.al. |
2403.11130v1 |
null |
2024-03-17 |
RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning |
Javad Rafiei Asl et.al. |
2403.11082v1 |
null |
2024-03-17 |
Deep Learning-based Sentiment Analysis in Persian Language |
Mohammad Heydari et.al. |
2403.11069v1 |
null |
2024-03-16 |
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages |
Fahim Faisal et.al. |
2403.11009v1 |
link |
2024-03-16 |
Energy-Based Models with Applications to Speech and Language Processing |
Zhijian Ou et.al. |
2403.10961v1 |
null |
2024-03-16 |
A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment |
Tianhe Wu et.al. |
2403.10854v1 |
link |
2024-03-16 |
Detecting Bias in Large Language Models: Fine-tuned KcBERT |
J. K. Lee et.al. |
2403.10774v1 |
null |
2024-03-15 |
A Multilingual Perspective on Probing Gender Bias |
Karolina Stańczak et.al. |
2403.10699v1 |
null |
2024-03-15 |
ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment |
Xiaofeng Wu et.al. |
2403.10504v1 |
null |
2024-03-15 |
TriSum: Learning Summarization Ability from Large Language Models with Structured Rationale |
Pengcheng Jiang et.al. |
2403.10351v1 |
null |
2024-03-15 |
NLP Verification: Towards a General Methodology for Certifying Robustness |
Marco Casadio et.al. |
2403.10144v1 |
null |
2024-03-15 |
Identifying Health Risks from Family History: A Survey of Natural Language Processing Techniques |
Xiang Dai et.al. |
2403.09997v1 |
null |
2024-03-15 |
ViTCN: Vision Transformer Contrastive Network For Reasoning |
Bo Song et.al. |
2403.09962v1 |
null |
2024-03-14 |
Fisher Mask Nodes for Language Model Merging |
Thennal D K et.al. |
2403.09891v1 |
link |
2024-03-14 |
Scaling Behavior of Machine Translation with Large Language Models under Prompt Injection Attacks |
Zhifan Sun et.al. |
2403.09832v1 |
link |
2024-03-14 |
Emotional Intelligence Through Artificial Intelligence : NLP and Deep Learning in the Analysis of Healthcare Texts |
Prashant Kumar Nag et.al. |
2403.09762v1 |
null |
2024-03-14 |
Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey |
Xiaoyu Liu et.al. |
2403.09606v1 |
null |
2024-03-14 |
PreConfig: A Pretrained Model for Automating Network Configuration |
Fuliang Li et.al. |
2403.09369v1 |
null |
2024-03-14 |
Exploring the Capabilities and Limitations of Large Language Models in the Electric Energy Sector |
Lin Dong et.al. |
2403.09125v1 |
null |
2024-03-14 |
Information Extraction: An application to the domain of hyper-local financial data on developing countries |
Abuzar Royesh et.al. |
2403.09077v1 |
null |
2024-03-13 |
Ethos: Rectifying Language Models in Orthogonal Parameter Space |
Lei Gao et.al. |
2403.08994v1 |
null |
2024-03-13 |
Predictive Analysis of Tuberculosis Treatment Outcomes Using Machine Learning: A Karnataka TB Data Study at a Scale |
SeshaSai Nath Chinagudaba et.al. |
2403.08834v1 |
null |
2024-03-13 |
SoK: Reducing the Vulnerability of Fine-tuned Language Models to Membership Inference Attacks |
Guy Amit et.al. |
2403.08481v1 |
null |
2024-03-13 |
Specification Overfitting in Artificial Intelligence |
Benjamin Roth et.al. |
2403.08425v1 |
null |
2024-03-12 |
VANP: Learning Where to See for Navigation with Self-Supervised Vision-Action Pre-Training |
Mohammad Nazeri et.al. |
2403.08109v1 |
null |
2024-03-12 |
Mechanics of Next Token Prediction with Self-Attention |
Yingcong Li et.al. |
2403.08081v1 |
null |
2024-03-12 |
Exploring Safety Generalization Challenges of Large Language Models via Code |
Qibing Ren et.al. |
2403.07865v1 |
null |
2024-03-12 |
Fine-tuning Neural Network Quantum States |
Riccardo Rende et.al. |
2403.07795v1 |
null |
2024-03-12 |
MoralBERT: Detecting Moral Values in Social Discourse |
Vjosa Preniqi et.al. |
2403.07678v1 |
null |
2024-03-12 |
A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions |
Quoc-Vinh Lai-Dang et.al. |
2403.07542v1 |
null |
2024-03-12 |
Generalised Graph Grammars for Natural Language Processing |
Oliver Robert Fox et.al. |
2403.07481v1 |
null |
2024-03-12 |
Knowledge Graph Large Language Model (KG-LLM) for Link Prediction |
Dong Shu et.al. |
2403.07311v1 |
null |
2024-03-11 |
LSTM-Based Text Generation: A Study on Historical Datasets |
Mustafa Abbas Hussein Hussein et.al. |
2403.07087v1 |
null |
2024-03-11 |
ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis |
Yanming Liu et.al. |
2403.06932v1 |
link |
2024-03-11 |
Application of Quantum Tensor Networks for Protein Classification |
Debarshi Kundu et.al. |
2403.06890v1 |
null |
2024-03-11 |
Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting |
Wenting Chen et.al. |
2403.06835v1 |
null |
2024-03-11 |
ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model |
Zhiwei Liu et.al. |
2403.06765v1 |
link |
2024-03-11 |
NLP4RE Tools: Classification, Overview, and Management |
Julian Frattini et.al. |
2403.06685v1 |
null |
2024-03-11 |
QuantTune: Optimizing Model Quantization with Adaptive Outlier-Driven Fine Tuning |
Jiun-Man Chen et.al. |
2403.06497v1 |
null |
2024-03-11 |
'One size doesn't fit all': Learning how many Examples to use for In-Context Learning for Improved Text Classification |
Manish Chandra et.al. |
2403.06402v1 |
null |
2024-03-11 |
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource Languages |
Michael Andersland et.al. |
2403.06354v1 |
link |
2024-03-10 |
ArgMed-Agents: Explainable Clinical Decision Reasoning with Large Language Models via Argumentation Schemes |
Shengxin Hong et.al. |
2403.06294v1 |
null |
2024-03-10 |
In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model |
Junhui Yin et.al. |
2403.06126v1 |
null |
2024-03-08 |
Debiasing Large Visual Language Models |
Yi-Fan Zhang et.al. |
2403.05262v1 |
link |
2024-03-08 |
Benchmarking Large Language Models for Molecule Prediction Tasks |
Zhiqiang Zhong et.al. |
2403.05075v1 |
link |
2024-03-08 |
Can we obtain significant success in RST discourse parsing by using Large Language Models? |
Aru Maekawa et.al. |
2403.05065v1 |
link |
2024-03-07 |
Analysis of Systems' Performance in Natural Language Processing Competitions |
Sergio Nava-Muñoz et.al. |
2403.04693v1 |
null |
2024-03-07 |
Classist Tools: Social Class Correlates with Performance in NLP |
Amanda Cercas Curry et.al. |
2403.04445v1 |
null |
2024-03-07 |
Advancing Biomedical Text Mining with Community Challenges |
Hui Zong et.al. |
2403.04261v1 |
null |
2024-03-06 |
Enhancing Instructional Quality: Leveraging Computer-Assisted Textual Analysis to Generate In-Depth Insights from Educational Artifacts |
Zewei Tian et.al. |
2403.03920v1 |
null |
2024-03-06 |
Impoverished Language Technology: The Lack of (Social) Class in NLP |
Amanda Cercas Curry et.al. |
2403.03874v1 |
null |
2024-03-06 |
German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset |
Laura Mascarell et.al. |
2403.03750v1 |
link |
2024-03-06 |
Probabilistic Topic Modelling with Transformer Representations |
Arik Reuter et.al. |
2403.03737v1 |
link |
2024-03-06 |
Enhancing ASD detection accuracy: a combined approach of machine learning and deep learning models with natural language processing |
Sergio Rubio-Martín et.al. |
2403.03581v1 |
null |
2024-03-06 |
Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem |
Yuhong Sun et.al. |
2403.03558v1 |
link |
2024-03-05 |
TTPXHunter: Actionable Threat Intelligence Extraction as TTPs form Finished Cyber Threat Reports |
Nanda Rani et.al. |
2403.03267v1 |
null |
2024-03-05 |
Detecting Concrete Visual Tokens for Multimodal Machine Translation |
Braeden Bowen et.al. |
2403.03075v1 |
null |
2024-03-05 |
Data Augmentation using LLMs: Data Perspectives, Learning Paradigms and Challenges |
Bosheng Ding et.al. |
2403.02990v1 |
null |
2024-03-05 |
A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching |
Dong Yao et.al. |
2403.02975v1 |
null |
2024-03-05 |
SimuCourt: Building Judicial Decision-Making Agents with Real-world Judgement Documents |
Zhitao He et.al. |
2403.02959v1 |
link |
2024-03-05 |
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods |
Hanlei Jin et.al. |
2403.02901v1 |
null |
2024-03-05 |
Quantum Mixed-State Self-Attention Network |
Fu Chen et.al. |
2403.02871v1 |
null |
2024-03-05 |
Emerging Synergies Between Large Language Models and Machine Learning in Ecommerce Recommendations |
Xiaonan Xu et.al. |
2403.02760v1 |
null |
2024-03-05 |
Causal Prompting: Debiasing Large Language Model Prompting based on Front-Door Adjustment |
Congzhi Zhang et.al. |
2403.02738v1 |
null |
2024-03-05 |
Privacy-Aware Semantic Cache for Large Language Models |
Waris Gill et.al. |
2403.02694v1 |
null |
2024-03-04 |
A Tutorial on the Pretrain-Finetune Paradigm for Natural Language Processing |
Yu Wang et.al. |
2403.02504v1 |
null |
2024-03-02 |
LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems |
Tasnim Ahmed et.al. |
2403.01342v1 |
null |
2024-03-02 |
VNLP: Turkish NLP Package |
Meliksah Turker et.al. |
2403.01309v1 |
null |
2024-03-02 |
VBART: The Turkish LLM |
Meliksah Turker et.al. |
2403.01308v1 |
null |
2024-03-02 |
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact |
Ruikang Liu et.al. |
2403.01241v1 |
null |
2024-03-02 |
Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions |
Flor Miriam Plaza-del-Arco et.al. |
2403.01222v1 |
link |
2024-03-02 |
Evaluating Large Language Models as Virtual Annotators for Time-series Physical Sensing Data |
Aritra Hota et.al. |
2403.01133v1 |
null |
2024-03-01 |
Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning |
Christian Raymond et.al. |
2403.00865v1 |
link |
2024-03-01 |
Beyond Single-Model Views for Deep Learning: Optimization versus Generalizability of Stochastic Optimization Algorithms |
Toki Tahmid Inan et.al. |
2403.00574v1 |
null |
2024-03-01 |
Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese |
Yuqi Chen et.al. |
2403.00509v1 |
null |
2024-03-01 |
Gender Bias in Large Language Models across Multiple Languages |
Jinman Zhao et.al. |
2403.00277v1 |
null |
2024-02-29 |
Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing |
Pranav Shetty et.al. |
2402.19462v1 |
link |
2024-02-29 |
Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines |
Lijia Ma et.al. |
2402.19421v1 |
null |
2024-02-29 |
Here's a Free Lunch: Sanitizing Backdoored Models with Model Merge |
Ansh Arora et.al. |
2402.19334v1 |
null |
2024-02-29 |
Improving Legal Judgement Prediction in Romanian with Long Text Encoders |
Mihai Masala et.al. |
2402.19170v1 |
null |
2024-02-29 |
Beyond Language Models: Byte Models are Digital World Simulators |
Shangda Wu et.al. |
2402.19155v1 |
null |
2024-02-29 |
Enhancing Steganographic Text Extraction: Evaluating the Impact of NLP Models on Accuracy and Semantic Coherence |
Mingyang Li et.al. |
2402.18849v1 |
null |
2024-02-29 |
MPAT: Building Robust Deep Neural Networks against Textual Adversarial Attacks |
Fangyuan Zhang et.al. |
2402.18792v1 |
null |
2024-02-28 |
Learning to Compress Prompt in Natural Language Formats |
Yu-Neng Chuang et.al. |
2402.18700v1 |
null |
2024-02-28 |
Large Language Models and Games: A Survey and Roadmap |
Roberto Gallotta et.al. |
2402.18659v1 |
null |
2024-02-28 |
Tokenization Is More Than Compression |
Craig W. Schmidt et.al. |
2402.18376v1 |
null |
2024-02-28 |
Towards Better Understanding of Contrastive Sentence Representation Learning: A Unified Paradigm for Gradient |
Mingxin Li et.al. |
2402.18281v1 |
null |
2024-02-28 |
Challenges in Pre-Training Graph Neural Networks for Context-Based Fake News Detection: An Evaluation of Current Strategies and Resource Limitations |
Gregor Donabauer et.al. |
2402.18179v1 |
link |
2024-02-28 |
Learning Intrinsic Dimension via Information Bottleneck for Explainable Aspect-based Sentiment Analysis |
Zhenxiao Cheng et.al. |
2402.18145v1 |
null |
2024-02-28 |
Saving the legacy of Hero Ibash: Evaluating Four Language Models for Aminoacian |
Yunze Xiao et.al. |
2402.18121v1 |
null |
2024-02-28 |
Using Text Embeddings for Deductive Qualitative Research at Scale in Physics Education |
Tor Ole B. Odden et.al. |
2402.18087v1 |
link |
2024-02-28 |
Data augmentation method for modeling health records with applications to clopidogrel treatment failure detection |
Sunwoong Choi et.al. |
2402.18046v1 |
null |
2024-02-28 |
Crisis talk: analysis of the public debate around the energy crisis and cost of living |
Rrubaa Panchendrarajan et.al. |
2402.18043v1 |
null |
2024-02-28 |
Datasets for Large Language Models: A Comprehensive Survey |
Yang Liu et.al. |
2402.18041v1 |
link |
2024-02-28 |
Gradient-Free Adaptive Global Pruning for Pre-trained Language Models |
Guangji Bai et.al. |
2402.17946v1 |
link |
2024-02-27 |
Navigator: A Decentralized Scheduler for Latency-Sensitive ML Workflows |
Yuting Yang et.al. |
2402.17652v1 |
null |
2024-02-27 |
From Text Segmentation to Smart Chaptering: A Novel Benchmark for Structuring Video Transcriptions |
Fabian Retkowski et.al. |
2402.17633v1 |
null |
2024-02-27 |
Neural Automated Writing Evaluation with Corrective Feedback |
Izia Xiaoxiao Wang et.al. |
2402.17613v1 |
null |
2024-02-27 |
Extreme Miscalibration and the Illusion of Adversarial Robustness |
Vyas Raina et.al. |
2402.17509v1 |
null |
2024-02-27 |
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey |
Dinh-Viet-Toan Le et.al. |
2402.17467v1 |
link |
2024-02-27 |
Benchmarking GPT-4 on Algorithmic Problems: A Systematic Evaluation of Prompting Strategies |
Flavio Petruzzellis et.al. |
2402.17396v1 |
null |
2024-02-27 |
FairBelief - Assessing Harmful Beliefs in Language Models |
Mattia Setzu et.al. |
2402.17389v1 |
null |
2024-02-27 |
Curriculum Learning Meets Directed Acyclic Graph for Multimodal Emotion Recognition |
Cam-Van Thi Nguyen et.al. |
2402.17269v1 |
link |
2024-02-27 |
Deep Learning-Based Speech and Vision Synthesis to Improve Phishing Attack Detection through a Multi-layer Adaptive Framework |
Tosin Ige et.al. |
2402.17249v1 |
null |
2024-02-27 |
Does Negative Sampling Matter? A Review with Insights into its Theory and Applications |
Zhen Yang et.al. |
2402.17238v1 |
null |
2024-02-26 |
ProLLaMA: A Protein Large Language Model for Multi-Task Protein Language Processing |
Liuzhenghao Lv et.al. |
2402.16445v1 |
link |
2024-02-26 |
MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property |
Shiwen Ni et.al. |
2402.16389v1 |
link |
2024-02-25 |
From Text to Transformation: A Comprehensive Review of Large Language Models' Versatility |
Pravneet Kaur et.al. |
2402.16142v1 |
null |
2024-02-25 |
Deep Learning Approaches for Improving Question Answering Systems in Hepatocellular Carcinoma Research |
Shuning Huo et.al. |
2402.16038v1 |
null |
2024-02-25 |
$C^3$ : Confidence Calibration Model Cascade for Inference-Efficient Cross-Lingual Natural Language Understanding |
Taixi Lu et.al. |
2402.15991v1 |
null |
2024-02-24 |
SportQA: A Benchmark for Sports Understanding in Large Language Models |
Haotian Xia et.al. |
2402.15862v1 |
null |
2024-02-24 |
Prompt Perturbation Consistency Learning for Robust Language Models |
Yao Qiang et.al. |
2402.15833v1 |
null |
2024-02-24 |
Linguistic Intelligence in Large Language Models for Telecommunications |
Tasnim Ahmed et.al. |
2402.15818v1 |
null |
2024-02-23 |
Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models |
Yanzheng Xiang et.al. |
2402.15637v1 |
null |
2024-02-23 |
Transformers are Expressive, But Are They Expressive Enough for Regression? |
Swaroop Nath et.al. |
2402.15478v1 |
link |
2024-02-23 |
United We Pretrain, Divided We Fail! Representation Learning for Time Series by Pretraining on 75 Datasets at Once |
Maurice Kraus et.al. |
2402.15404v1 |
null |
2024-02-23 |
Fine-Grained Detoxification via Instance-Level Prefixes for Large Language Models |
Xin Yi et.al. |
2402.15202v1 |
null |
2024-02-23 |
Improving Sentence Embeddings with an Automatically Generated NLI Dataset |
Soma Sato et.al. |
2402.15132v1 |
null |
2024-02-23 |
Descripción automática de secciones delgadas de rocas: una aplicación Web |
Stalyn Paucar et.al. |
2402.15039v1 |
null |
2024-02-22 |
Ar-Spider: Text-to-SQL in Arabic |
Saleh Almohaimeed et.al. |
2402.15012v1 |
null |
2024-02-22 |
How Important Is Tokenization in French Medical Masked Language Models? |
Yanis Labrak et.al. |
2402.15010v1 |
null |
2024-02-22 |
LLMs with Industrial Lens: Deciphering the Challenges and Prospects -- A Survey |
Ashok Urlana et.al. |
2402.14558v1 |
null |
2024-02-22 |
Should We Respect LLMs? A Cross-Lingual Study on the Influence of Prompt Politeness on LLM Performance |
Ziqi Yin et.al. |
2402.14531v1 |
null |
2024-02-22 |
Malaysian English News Decoded: A Linguistic Resource for Named Entity and Relation Extraction |
Mohan Raj Chanthran et.al. |
2402.14521v1 |
link |
2024-02-22 |
SpanSeq: Similarity-based sequence data splitting method for improved development and assessment of deep learning projects |
Alfred Ferrer Florensa et.al. |
2402.14482v1 |
link |
2024-02-22 |
Novi jezički modeli za srpski jezik |
Mihailo Škorić et.al. |
2402.14379v1 |
null |
2024-02-22 |
Vision-Language Navigation with Embodied Intelligence: A Survey |
Peng Gao et.al. |
2402.14304v1 |
null |
2024-02-22 |
Mitigating Biases of Large Language Models in Stance Detection with Calibration |
Ang Li et.al. |
2402.14296v1 |
null |
2024-02-22 |
Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education |
Rui Yang et.al. |
2402.14293v1 |
link |
2024-02-22 |
Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding |
Yu-Qi Yang et.al. |
2402.14215v1 |
link |
2024-02-22 |
Content Conditional Debiasing for Fair Text Embedding |
Wenlong Deng et.al. |
2402.14208v1 |
null |
2024-02-21 |
Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models |
Chenyang Lyu et.al. |
2402.13887v1 |
null |
2024-02-21 |
Using Large Language Models for Natural Language Processing Tasks in Requirements Engineering: A Systematic Guideline |
Andreas Vogelsang et.al. |
2402.13823v1 |
null |
2024-02-21 |
Beyond Hate Speech: NLP's Challenges and Opportunities in Uncovering Dehumanizing Language |
Hezhao Zhang et.al. |
2402.13818v1 |
null |
2024-02-21 |
From Text to CQL: Bridging Natural Language and Corpus Search Engine |
Luming Lu et.al. |
2402.13740v1 |
null |
2024-02-21 |
RESTRuler: Towards Automatically Identifying Violations of RESTful Design Rules in Web APIs |
Justus Bogner et.al. |
2402.13710v1 |
null |
2024-02-21 |
CMNER: A Chinese Multimodal NER Dataset based on Social Media |
Yuanze Ji et.al. |
2402.13693v1 |
link |
2024-02-21 |
An Augmented Lagrangian Method for Training Recurrent Neural Networks |
Yue Wang et.al. |
2402.13687v1 |
null |
2024-02-21 |
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning |
Zhaorui Yang et.al. |
2402.13669v1 |
link |
2024-02-21 |
Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions |
Lei Pan et.al. |
2402.13647v1 |
null |
2024-02-21 |
Overview of the VLSP 2023 -- ComOM Shared Task: A Data Challenge for Comparative Opinion Mining from Vietnamese Product Reviews |
Hoang-Quynh Le et.al. |
2402.13613v1 |
null |
2024-02-20 |
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models |
Yizhi LI et.al. |
2402.13109v1 |
null |
2024-02-20 |
Few shot clinical entity recognition in three languages: Masked language models outperform LLM prompting |
Marco Naguib et.al. |
2402.12801v1 |
null |
2024-02-19 |
Predicting trucking accidents with truck drivers 'safety climate perception across companies: A transfer learning approach |
Kailai Sun et.al. |
2402.12417v1 |
null |
2024-02-19 |
Analysis of Persian News Agencies on Instagram, A Words Co-occurrence Graph-based Approach |
Mohammad Heydari et.al. |
2402.12272v1 |
null |
2024-02-19 |
Synthetic location trajectory generation using categorical diffusion models |
Simon Dirmeier et.al. |
2402.12242v1 |
link |
2024-02-19 |
Language Model Adaptation to Specialized Domains through Selective Masking based on Genre and Topical Characteristics |
Anas Belfathi et.al. |
2402.12036v1 |
link |
2024-02-19 |
Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space |
Zongru Wu et.al. |
2402.12026v1 |
null |
2024-02-19 |
Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing? |
Marco Gaido et.al. |
2402.12025v1 |
null |
2024-02-19 |
What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects |
Verena Blaschke et.al. |
2402.11968v1 |
null |
2024-02-19 |
DB-LLM: Accurate Dual-Binarization for Efficient LLMs |
Hong Chen et.al. |
2402.11960v1 |
null |
2024-02-19 |
AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimization |
Jiyao Li et.al. |
2402.11940v1 |
null |
2024-02-19 |
Semantic Textual Similarity Assessment in Chest X-ray Reports Using a Domain-Specific Cosine-Based Metric |
Sayeh Gholipour Picha et.al. |
2402.11908v1 |
link |
2024-02-19 |
InMD-X: Large Language Models for Internal Medicine Doctors |
Hansle Gwon et.al. |
2402.11883v1 |
null |
2024-02-16 |
Construction of a Syntactic Analysis Map for Yi Shui School through Text Mining and Natural Language Processing Research |
Hanqing Zhao et.al. |
2402.10743v1 |
null |
2024-02-16 |
Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm |
Yuanzhen Xie et.al. |
2402.10671v1 |
link |
2024-02-16 |
Fine Tuning Named Entity Extraction Models for the Fantasy Domain |
Aravinth Sivaganeshan et.al. |
2402.10662v1 |
null |
2024-02-16 |
Linear Transformers with Learnable Kernel Functions are Better In-Context Models |
Yaroslav Aksenov et.al. |
2402.10644v1 |
link |
2024-02-16 |
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation |
Dayou Du et.al. |
2402.10631v1 |
link |
2024-02-16 |
Zero-shot sampling of adversarial entities in biomedical question answering |
R. Patrick Xian et.al. |
2402.10527v1 |
null |
2024-02-16 |
Parametric Augmentation for Time Series Contrastive Learning |
Xu Zheng et.al. |
2402.10434v1 |
link |
2024-02-16 |
Understanding In-Context Learning with a Pelican Soup Framework |
Ting-Rui Chiang et.al. |
2402.10424v1 |
null |
2024-02-16 |
LogELECTRA: Self-supervised Anomaly Detection for Unstructured Logs |
Yuuki Yamanaka et.al. |
2402.10397v1 |
null |
2024-02-15 |
Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention |
Romain Ilbert et.al. |
2402.10198v1 |
link |
2024-02-15 |
Reusing Softmax Hardware Unit for GELU Computation in Transformers |
Christodoulos Peltekis et.al. |
2402.10118v1 |
link |
2024-02-15 |
Balancing the Causal Effects in Class-Incremental Learning |
Junhao Zheng et.al. |
2402.10063v1 |
null |
2024-02-15 |
Fast Vocabulary Transfer for Language Model Compression |
Leonidas Gee et.al. |
2402.09977v1 |
null |
2024-02-15 |
Multi-Word Tokenization for Sequence Compression |
Leonidas Gee et.al. |
2402.09949v1 |
link |
2024-02-15 |
BUSTER: a "BUSiness Transaction Entity Recognition" dataset |
Andrea Zugarini et.al. |
2402.09916v1 |
null |
2024-02-15 |
Camouflage is all you need: Evaluating and Enhancing Language Model Robustness Against Camouflage Adversarial Attacks |
Álvaro Huertas-García et.al. |
2402.09874v1 |
null |
2024-02-15 |
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent |
Quentin Gallouédec et.al. |
2402.09844v1 |
link |
2024-02-15 |
All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining |
Haihong Zhao et.al. |
2402.09834v1 |
null |
2024-02-14 |
LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset |
Botao Yu et.al. |
2402.09391v1 |
link |
2024-02-14 |
Leveraging Large Language Models for Enhanced NLP Task Performance through Knowledge Distillation and Optimized Training Strategies |
Yining Huang et.al. |
2402.09282v1 |
null |
2024-02-14 |
Personalized Large Language Models |
Stanisław Woźniak et.al. |
2402.09269v1 |
null |
2024-02-14 |
Advancing NLP Models with Strategic Text Augmentation: A Comprehensive Study of Augmentation Methods and Curriculum Strategies |
Himmet Toprak Kesgin et.al. |
2402.09141v1 |
null |
2024-02-14 |
SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks |
Jiwon Song et.al. |
2402.09025v1 |
link |
2024-02-14 |
Research and application of Transformer based anomaly detection model: A literature review |
Mingrui Ma et.al. |
2402.08975v1 |
null |
2024-02-13 |
BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation |
Omid Nejati Manzari et.al. |
2402.08793v1 |
link |
2024-02-13 |
COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability |
Xingang Guo et.al. |
2402.08679v1 |
link |
2024-02-13 |
Online Foundation Model Selection in Robotics |
Po-han Li et.al. |
2402.08570v1 |
null |
2024-02-13 |
Intriguing Differences Between Zero-Shot and Systematic Evaluations of Vision-Language Transformer Models |
Shaeke Salman et.al. |
2402.08473v1 |
null |
2024-02-13 |
Generating Java Methods: An Empirical Assessment of Four AI-Based Code Assistants |
Vincenzo Corso et.al. |
2402.08431v1 |
null |
2024-02-13 |
Visual Question Answering Instruction: Unlocking Multimodal Large Language Model To Domain-Specific Visual Multitasks |
Jusung Lee et.al. |
2402.08360v1 |
null |
2024-02-13 |
Explicit References to Social Values in Fairy Tales: A Comparison between Three European Cultures |
Alba Morollon Diaz-Faes et.al. |
2402.08318v1 |
link |
2024-02-13 |
QuApprox: A Framework for Benchmarking the Approximability of Variational Quantum Circuit |
Jinyang Li et.al. |
2402.08261v1 |
null |
2024-02-13 |
A survey of recent methods for addressing AI fairness and bias in biomedicine |
Yifan Yang et.al. |
2402.08250v1 |
null |
2024-02-12 |
Enhancing Amharic-LLaMA: Integrating Task Specific and Generative Datasets |
Israel Abebe Azime et.al. |
2402.08015v1 |
null |
2024-02-12 |
Empowering Federated Learning for Massive Models with NVIDIA FLARE |
Holger R. Roth et.al. |
2402.07792v1 |
null |
2024-02-12 |
Text Detoxification as Style Transfer in English and Hindi |
Sourabrata Mukherjee et.al. |
2402.07767v1 |
null |
2024-02-12 |
AraSpider: Democratizing Arabic-to-SQL |
Ahmed Heakl et.al. |
2402.07448v1 |
link |
2024-02-12 |
Dólares or Dollars? Unraveling the Bilingual Prowess of Financial LLMs Between Spanish and English |
Xiao Zhang et.al. |
2402.07405v1 |
link |
2024-02-12 |
Beyond the Headlines: Understanding Sentiments and Morals Impacting Female Employment in Spain |
Oscar Araque et.al. |
2402.07339v1 |
null |
2024-02-11 |
Differentially Private Training of Mixture of Experts Models |
Pierre Tholoniat et.al. |
2402.07334v1 |
null |
2024-02-11 |
Insights into Natural Language Database Query Errors: From Attention Misalignment to User Handling Strategies |
Zheng Ning et.al. |
2402.07304v1 |
null |
2024-02-11 |
TransGPT: Multi-modal Generative Pre-trained Transformer for Transportation |
Peng Wang et.al. |
2402.07233v1 |
null |
2024-02-11 |
Learning by Watching: A Review of Video-based Learning Approaches for Robot Manipulation |
Chrisantus Eze et.al. |
2402.07127v1 |
null |
2024-02-10 |
In-Context Data Distillation with TabPFN |
Junwei Ma et.al. |
2402.06971v1 |
null |
2024-02-09 |
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning |
Shivalika Singh et.al. |
2402.06619v1 |
null |
2024-02-09 |
FaBERT: Pre-training BERT on Persian Blogs |
Mostafa Masumi et.al. |
2402.06617v1 |
null |
2024-02-09 |
TIC: Translate-Infer-Compile for accurate 'text to plan' using LLMs and logical intermediate representations |
Sudhir Agarwal et.al. |
2402.06608v1 |
null |
2024-02-09 |
G-SciEdBERT: A Contextualized LLM for Science Assessment Tasks in German |
Ehsan Latif et.al. |
2402.06584v1 |
null |
2024-02-09 |
A Unified Causal View of Instruction Tuning |
Lu Chen et.al. |
2402.06220v1 |
null |
2024-02-08 |
On the Convergence of Zeroth-Order Federated Tuning in Large Language Models |
Zhenqing Ling et.al. |
2402.05926v1 |
null |
2024-02-08 |
FAQ-Gen: An automated system to generate domain-specific FAQs to aid content comprehension |
Sahil Kale et.al. |
2402.05812v1 |
null |
2024-02-08 |
Efficient Models for the Detection of Hate, Abuse and Profanity |
Christoph Tillmann et.al. |
2402.05624v1 |
null |
2024-02-08 |
Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings |
Elena Senger et.al. |
2402.05617v1 |
null |
2024-02-08 |
Benchmarking Large Language Models on Communicative Medical Coaching: a Novel System and Dataset |
Hengguan Huang et.al. |
2402.05547v1 |
null |
2024-02-08 |
GPT-4 Generated Narratives of Life Events using a Structured Narrative Prompt: A Validation Study |
Christopher J. Lynch et.al. |
2402.05435v1 |
null |
2024-02-07 |
PAC Learnability under Explanation-Preserving Graph Perturbations |
Xu Zheng et.al. |
2402.05039v1 |
null |
2024-02-07 |
An Enhanced Prompt-Based LLM Reasoning Scheme via Knowledge Graph-Integrated Collaboration |
Yihao Li et.al. |
2402.04978v1 |
null |
2024-02-07 |
Chatbots in Knowledge-Intensive Contexts: Comparing Intent and LLM-Based Systems |
Samuel Kernan Freire et.al. |
2402.04955v1 |
null |
2024-02-07 |
SPARQL Generation: an analysis on fine-tuning OpenLLaMA for Question Answering over a Life Science Knowledge Graph |
Julio C. Rangel et.al. |
2402.04627v1 |
link |
2024-02-07 |
Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models |
Chirag Agarwal et.al. |
2402.04614v1 |
null |
2024-02-07 |
RA-Rec: An Efficient ID Representation Alignment Framework for LLM-based Recommendation |
Xiaohan Yu et.al. |
2402.04527v1 |
null |
2024-02-07 |
Developments in Sheaf-Theoretic Models of Natural Language Ambiguities |
Kin Ian Lo et.al. |
2402.04505v1 |
null |
2024-02-06 |
Adaptive Inference: Theoretical Limits and Unexplored Opportunities |
Soheil Hor et.al. |
2402.04359v1 |
null |
2024-02-06 |
LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text |
Dor Bernsohn et.al. |
2402.04335v1 |
link |
2024-02-06 |
Explaining Autonomy: Enhancing Human-Robot Interaction through Explanation Generation with Large Language Models |
David Sobrín-Hidalgo et.al. |
2402.04206v1 |
null |
2024-02-06 |
Scientific Language Modeling: A Quantitative Review of Large Language Models in Molecular Science |
Pengfei Liu et.al. |
2402.04119v1 |
link |
2024-02-06 |
The Use of a Large Language Model for Cyberbullying Detection |
Bayode Ogunleye et.al. |
2402.04088v1 |
null |
2024-02-06 |
Systematic Biases in LLM Simulations of Debates |
Amir Taubenfeld et.al. |
2402.04049v1 |
null |
2024-02-06 |
AlbNews: A Corpus of Headlines for Topic Modeling in Albanian |
Erion Çano et.al. |
2402.04028v1 |
link |
2024-02-06 |
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs |
Simone Balloccu et.al. |
2402.03927v1 |
null |
2024-02-06 |
Intensive Vision-guided Network for Radiology Report Generation |
Fudan Zheng et.al. |
2402.03754v1 |
null |
2024-02-06 |
Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies |
Zhixuan Chu et.al. |
2402.03628v1 |
null |
2024-02-06 |
Partially Recentralization Softmax Loss for Vision-Language Models Robustness |
Hao Wang et.al. |
2402.03627v1 |
null |
2024-02-05 |
Is Mamba Capable of In-Context Learning? |
Riccardo Grazzi et.al. |
2402.03170v1 |
link |
2024-02-05 |
EEVEE: An Easy Annotation Tool for Natural Language Processing |
Axel Sorensen et.al. |
2402.02864v1 |
null |
2024-02-05 |
Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate |
Can Jin et.al. |
2402.02769v1 |
link |
2024-02-04 |
It's how you do things that matters": Attending to Process to Better Serve Indigenous Communities with Language Technologies |
Ned Cooper et.al. |
2402.02639v1 |
null |
2024-02-04 |
Predicting Machine Translation Performance on Low-Resource Languages: The Role of Domain Similarity |
Eric Khiu et.al. |
2402.02633v1 |
null |
2024-02-04 |
DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging |
Matteo Pagliardini et.al. |
2402.02622v1 |
null |
2024-02-04 |
ClipFormer: Key-Value Clipping of Transformers on Memristive Crossbars for Write Noise Mitigation |
Abhiroop Bhattacharjee et.al. |
2402.02586v1 |
null |
2024-02-04 |
A Quantitative Discourse Analysis of Asian Workers in the US Historical Newspapers |
Jaihyun Park et.al. |
2402.02572v1 |
null |
2024-02-04 |
Integration of cognitive tasks into artificial general intelligence test for large models |
Youzhi Qu et.al. |
2402.02547v1 |
null |
2024-02-04 |
Absolute convergence and error thresholds in non-active adaptive sampling |
Manuel Vilares Ferro et.al. |
2402.02522v1 |
null |
2024-02-02 |
Code-Switched Language Identification is Harder Than You Think |
Laurie Burchell et.al. |
2402.01505v1 |
link |
2024-02-02 |
From Words to Molecules: A Survey of Large Language Models in Chemistry |
Chang Liao et.al. |
2402.01439v1 |
null |
2024-02-02 |
Beyond the Answers: Reviewing the Rationality of Multiple Choice Question Answering for the Evaluation of Large Language Models |
Haochun Wang et.al. |
2402.01349v1 |
null |
2024-02-02 |
Efficient Prompt Caching via Embedding Similarity |
Hanlin Zhu et.al. |
2402.01173v1 |
null |
2024-02-02 |
A Survey for Foundation Models in Autonomous Driving |
Haoxiang Gao et.al. |
2402.01105v1 |
null |
2024-02-01 |
Domain-Independent Deception: A New Taxonomy and Linguistic Analysis |
Rakesh M. Verma et.al. |
2402.01019v1 |
null |
2024-02-01 |
HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent |
Weijie Xu et.al. |
2402.01018v1 |
link |
2024-02-01 |
An Information-Theoretic Approach to Analyze NLP Classification Tasks |
Luran Wang et.al. |
2402.00978v1 |
link |
2024-02-01 |
SPARQL Generation with Entity Pre-trained GPT for KG Question Answering |
Diego Bustamante et.al. |
2402.00969v1 |
link |
2024-02-01 |
Can Large Language Models Understand Context? |
Yilun Zhu et.al. |
2402.00858v1 |
null |
2024-02-01 |
ReAGent: Towards A Model-agnostic Feature Attribution Method for Generative Language Models |
Zhixue Zhao et.al. |
2402.00794v1 |
link |
2024-02-01 |
Neural Policy Style Transfer |
Raul Fernandez-Fernandez et.al. |
2402.00677v1 |
null |
2024-02-01 |
SA-MDKIF: A Scalable and Adaptable Medical Domain Knowledge Injection Framework for Large Language Models |
Tianhan Xu et.al. |
2402.00474v1 |
null |
2024-01-31 |
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research |
Luca Soldaini et.al. |
2402.00159v1 |
link |
2024-01-31 |
Entity Linking in the Job Market Domain |
Mike Zhang et.al. |
2401.17979v1 |
link |
2024-01-31 |
SNNLP: Energy-Efficient Natural Language Processing Using Spiking Neural Networks |
R. Alexander Knipper et.al. |
2401.17911v1 |
link |
2024-01-31 |
Employing Label Models on ChatGPT Answers Improves Legal Text Entailment Performance |
Chau Nguyen et.al. |
2401.17897v1 |
null |
2024-01-31 |
Document Structure in Long Document Transformers |
Jan Buchmann et.al. |
2401.17658v1 |
null |
2024-01-31 |
Assertion Detection Large Language Model In-context Learning LoRA Fine-tuning |
Yuelyu Ji et.al. |
2401.17602v1 |
link |
2024-01-31 |
Scavenging Hyena: Distilling Transformers into Long Convolution Models |
Tokiniaina Raharison Ralambomihanta et.al. |
2401.17574v1 |
null |
2024-01-31 |
Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels |
Negar Arabzadeh et.al. |
2401.17543v1 |
null |
2024-01-30 |
Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks |
Savas Yildirim et.al. |
2401.17396v1 |
null |
2024-01-30 |
Gazetteer-Enhanced Bangla Named Entity Recognition with BanglaBERT Semantic Embeddings K-Means-Infused CRF Model |
Niloy Farhan et.al. |
2401.17206v1 |
link |
2024-01-30 |
Large Language Model Evaluation via Matrix Entropy |
Lai Wei et.al. |
2401.17139v1 |
link |
2024-01-30 |
SAL-PIM: A Subarray-level Processing-in-Memory Architecture with LUT-based Linear Interpolation for Transformer-based Text Generation |
Wontak Han et.al. |
2401.17005v1 |
null |
2024-01-30 |
SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics |
Takaaki Saeki et.al. |
2401.16812v1 |
link |
2024-01-30 |
Engineering A Large Language Model From Scratch |
Abiodun Finbarrs Oketunji et.al. |
2401.16736v1 |
null |
2024-01-30 |
TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese |
Nicholas Kluge Corrêa et.al. |
2401.16640v1 |
link |
2024-01-30 |
Breaking Free Transformer Models: Task-specific Context Attribution Promises Improved Generalizability Without Fine-tuning Pre-trained LLMs |
Stepan Tytarenko et.al. |
2401.16638v1 |
link |
2024-01-29 |
Dynamic Electro-Optic Analog Memory for Neuromorphic Photonic Computing |
Sean Lam et.al. |
2401.16515v1 |
null |
2024-01-29 |
ViLexNorm: A Lexical Normalization Corpus for Vietnamese Social Media Text |
Thanh-Nhi Nguyen et.al. |
2401.16403v1 |
link |
2024-01-29 |
CO2: Efficient Distributed Training with Full Communication-Computation Overlap |
Weigao Sun et.al. |
2401.16265v1 |
link |
2024-01-29 |
Towards Red Teaming in Multimodal and Multilingual Translation |
Christophe Ropers et.al. |
2401.16247v1 |
null |
2024-01-29 |
A Survey on Structure-Preserving Graph Transformers |
Van Thuy Hoang et.al. |
2401.16176v1 |
null |
2024-01-29 |
E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models |
Jinchang Hou et.al. |
2401.15927v1 |
link |
2024-01-29 |
Unrestricted Error-Type Codebook Generation for Error Correction Code in DNA Storage Inspired by NLP |
Yi Lu et.al. |
2401.15915v1 |
link |
2024-01-29 |
DrBERT: Unveiling the Potential of Masked Language Modeling Decoder in BERT pretraining |
Wen Liang et.al. |
2401.15861v1 |
null |
2024-01-28 |
Fine-Tuned Large Language Models for Symptom Recognition from Spanish Clinical Text |
Mai A. Shaaban et.al. |
2401.15780v1 |
null |
2024-01-27 |
FloodLense: A Framework for ChatGPT-based Real-time Flood Detection |
Pranath Reddy Kumbam et.al. |
2401.15501v1 |
null |
2024-01-27 |
A Survey on Data Augmentation in Large Model Era |
Yue Zhou et.al. |
2401.15422v1 |
link |
2024-01-26 |
SliceGPT: Compress Large Language Models by Deleting Rows and Columns |
Saleh Ashkboos et.al. |
2401.15024v1 |
link |
2024-01-26 |
Memory-Inspired Temporal Prompt Interaction for Text-Image Classification |
Xinyao Yu et.al. |
2401.14856v1 |
null |
2024-01-26 |
Adaptive Point Transformer |
Alessandro Baiocchi et.al. |
2401.14845v1 |
null |
2024-01-26 |
ChemDFM: Dialogue Foundation Model for Chemistry |
Zihan Zhao et.al. |
2401.14818v1 |
null |
2024-01-26 |
Large Language Model Adaptation for Financial Sentiment Analysis |
Pau Rodriguez Inserte et.al. |
2401.14777v1 |
null |
2024-01-26 |
Topology-Aware Exploration of Energy-Based Models Equilibrium: Toric QC-LDPC Codes and Hyperbolic MET QC-LDPC Codes |
Vasiliy Usatyuk et.al. |
2401.14749v1 |
null |
2024-01-26 |
Listening to the Voices: Describing Ethical Caveats of Conversational User Interfaces According to Experts and Frequent Users |
Thomas Mildner et.al. |
2401.14746v1 |
null |
2024-01-26 |
An Empirical Investigation of Domain Adaptation Ability for Chinese Spelling Check Models |
Xi Wang et.al. |
2401.14630v1 |
null |
2024-01-25 |
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation |
Gökçe Uludoğan et.al. |
2401.14373v1 |
link |
2024-01-25 |
Topologies of Reasoning: Demystifying Chains, Trees, and Graphs of Thoughts |
Maciej Besta et.al. |
2401.14295v1 |
null |
2024-01-25 |
Improving Natural Language Capability of Code Large Language Model |
Wei Li et.al. |
2401.14242v1 |
link |
2024-01-25 |
Parameter-Efficient Conversational Recommender System as a Language Processing Task |
Mathieu Ravaut et.al. |
2401.14194v1 |
link |
2024-01-25 |
How Can Large Language Models Understand Spatial-Temporal Data? |
Lei Liu et.al. |
2401.14192v1 |
null |
2024-01-25 |
Convolutional Neural Networks can achieve binary bail judgement classification |
Amit Barman et.al. |
2401.14135v1 |
null |
2024-01-25 |
(Chat)GPT v BERT: Dawn of Justice for Semantic Change Detection |
Francesco Periti et.al. |
2401.14040v1 |
link |
2024-01-25 |
Accelerating Retrieval-Augmented Language Model Serving with Speculation |
Zhihao Zhang et.al. |
2401.14021v1 |
null |
2024-01-25 |
ChatGPT and Human Synergy in Black-Box Testing: A Comparative Analysis |
Hiroyuki Kirinuki et.al. |
2401.13924v1 |
null |
2024-01-24 |
Investigating the Efficacy of Large Language Models for Code Clone Detection |
Mohamad Khajezade et.al. |
2401.13802v1 |
link |
2024-01-24 |
CNN architecture extraction on edge GPU |
Peter Horvath et.al. |
2401.13575v1 |
null |
2024-01-24 |
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation |
Zhaohu Xing et.al. |
2401.13560v1 |
link |
2024-01-24 |
Research about the Ability of LLM in the Tamper-Detection Area |
Xinyu Yang et.al. |
2401.13504v1 |
null |
2024-01-24 |
Text Categorization Can Enhance Domain-Agnostic Stopword Extraction |
Houcemeddine Turki et.al. |
2401.13398v1 |
null |
2024-01-24 |
MaLA-500: Massive Language Adaptation of Large Language Models |
Peiqin Lin et.al. |
2401.13303v1 |
null |
2024-01-24 |
SpecLLM: Exploring Generation and Review of VLSI Design Specification with Large Language Model |
Mengming Li et.al. |
2401.13266v1 |
link |
2024-01-24 |
From Random to Informed Data Selection: A Diversity-Based Approach to Optimize Human Annotation and Few-Shot Learning |
Alexandre Alcoforado et.al. |
2401.13229v1 |
null |
2024-01-23 |
Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains |
Yu Zhang et.al. |
2401.13129v1 |
link |
2024-01-23 |
Free Form Medical Visual Question Answering in Radiology |
Abhishek Narayanan et.al. |
2401.13081v1 |
null |
2024-01-23 |
From Understanding to Utilization: A Survey on Explainability for Large Language Models |
Haoyan Luo et.al. |
2401.12874v1 |
null |
2024-01-23 |
KAM-CoT: Knowledge Augmented Multimodal Chain-of-Thoughts Reasoning |
Debjyoti Mondal et.al. |
2401.12863v1 |
null |
2024-01-23 |
Benchmarking LLMs via Uncertainty Quantification |
Fanghua Ye et.al. |
2401.12794v1 |
link |
2024-01-23 |
A Comprehensive View of the Biases of Toxicity and Sentiment Analysis Methods Towards Utterances with African American English Expressions |
Guilherme H. Resende et.al. |
2401.12720v1 |
null |
2024-01-23 |
From Numbers to Words: Multi-Modal Bankruptcy Prediction Using the ECL Dataset |
Henri Arno et.al. |
2401.12652v1 |
link |
2024-01-23 |
Key Information Retrieval to Classify the Unstructured Data Content of Preferential Trade Agreements |
Jiahui Zhao et.al. |
2401.12520v1 |
null |
2024-01-23 |
Digital cloning of online social networks for language-sensitive agent-based modeling of misinformation spread |
Prateek Puri et.al. |
2401.12509v1 |
null |
2024-01-23 |
Comparing Human-Centered Language Modeling: Is it Better to Model Groups, Individual Traits, or Both? |
Nikita Soni et.al. |
2401.12492v1 |
null |
2024-01-23 |
Assessing and Understanding Creativity in Large Language Models |
Yunpu Zhao et.al. |
2401.12491v1 |
null |
2024-01-23 |
Contrastive Learning in Distilled Models |
Valerie Lim et.al. |
2401.12472v1 |
link |
2024-01-22 |
Temporal Blind Spots in Large Language Models |
Jonas Wallat et.al. |
2401.12078v1 |
link |
2024-01-22 |
NLP-based Relation Extraction Methods in RE |
Quim Motger et.al. |
2401.12075v1 |
null |
2024-01-22 |
Cross-lingual Transfer Learning for Javanese Dependency Parsing |
Fadli Aulawi Al Ghiffari et.al. |
2401.12072v1 |
null |
2024-01-22 |
Synergizing Machine Learning & Symbolic Methods: A Survey on Hybrid Approaches to Natural Language Processing |
Rrubaa Panchendrarajan et.al. |
2401.11972v1 |
null |
2024-01-22 |
Knowledge Navigation: Inferring the Interlocking Map of Knowledge from Research Trajectories |
Shibing Xiang et.al. |
2401.11742v1 |
link |
2024-01-22 |
Revolutionizing Finance with LLMs: An Overview of Applications and Insights |
Huaqin Zhao et.al. |
2401.11641v1 |
null |
2024-01-21 |
Simple Domain Adaptation for Sparse Retrievers |
Mathias Vast et.al. |
2401.11509v1 |
null |
2024-01-21 |
Integration of Large Language Models in Control of EHD Pumps for Precise Color Synthesis |
Yanhong Peng et.al. |
2401.11500v1 |
null |
2024-01-21 |
Towards Better Inclusivity: A Diverse Tweet Corpus of English Varieties |
Nhi Pham et.al. |
2401.11487v1 |
link |
2024-01-21 |
AttentionLego: An Open-Source Building Block For Spatially-Scalable Large Language Model Accelerator With Processing-In-Memory Technology |
Rongqing Cong et.al. |
2401.11459v1 |
null |
2024-01-19 |
Advancements in eHealth Data Analytics through Natural Language Processing and Deep Learning |
Elena-Simona Apostol et.al. |
2401.10850v1 |
null |
2024-01-19 |
Data Augmentation for Traffic Classification |
Chao Wang et.al. |
2401.10754v1 |
null |
2024-01-19 |
Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models |
Mayank Agarwal et.al. |
2401.10716v1 |
null |
2024-01-19 |
Attentive Fusion: A Transformer-based Approach to Multimodal Hate Speech Detection |
Atanu Mandal et.al. |
2401.10653v1 |
link |
2024-01-19 |
The "Colonial Impulse" of Natural Language Processing: An Audit of Bengali Sentiment Analysis Tools and Their Identity-based Biases |
Dipto Das et.al. |
2401.10535v1 |
null |
2024-01-18 |
Learning High-Quality and General-Purpose Phrase Representations |
Lihu Chen et.al. |
2401.10407v1 |
link |
2024-01-18 |
Supervised Fine-tuning in turn Improves Visual Foundation Models |
Xiaohu Jiang et.al. |
2401.10222v1 |
link |
2024-01-18 |
Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap |
Xingyu Wu et.al. |
2401.10034v1 |
null |
2024-01-18 |
Framing Analysis of Health-Related Narratives: Conspiracy versus Mainstream Media |
Markus Reiter-Haas et.al. |
2401.10030v1 |
null |
2024-01-19 |
Better Explain Transformers by Illuminating Important Information |
Linxin Song et.al. |
2401.09972v2 |
link |
2024-01-18 |
A Survey on Hardware Accelerators for Large Language Models |
Christoforos Kachris et.al. |
2401.09890v1 |
link |
2024-01-18 |
Decades of Transformation: Evolution of the NASA Astrophysics Data System's Infrastructure |
Alberto Accomazzi et.al. |
2401.09685v1 |
null |
2024-01-17 |
Learning Shortcuts: On the Misleading Promise of NLU in Language Models |
Geetanjali Bihani et.al. |
2401.09615v1 |
null |
2024-01-17 |
BERTologyNavigator: Advanced Question Answering with BERT-based Semantics |
Shreya Rajpal et.al. |
2401.09553v1 |
null |
2024-01-17 |
Learning from Emotions, Demographic Information and Implicit User Feedback in Task-Oriented Document-Grounded Dialogues |
Dominic Petrak et.al. |
2401.09248v1 |
link |
2024-01-17 |
Dynamic Relation Transformer for Contextual Text Block Detection |
Jiawei Wang et.al. |
2401.09232v1 |
null |
2024-01-17 |
Narratives of Collective Action in YouTube's Discourse on Veganism |
Arianna Pera et.al. |
2401.09210v1 |
link |
2024-01-17 |
LLMs for Relational Reasoning: How Far are We? |
Zhiming Li et.al. |
2401.09042v1 |
null |
2024-01-16 |
EmoLLMs: A Series of Emotional Large Language Models and Annotation Tools for Comprehensive Affective Analysis |
Zhiwei Liu et.al. |
2401.08508v1 |
link |
2024-01-16 |
Content-Aware Tweet Location Inference using Quadtree Spatial Partitioning and Jaccard-Cosine Word Embedding |
Oluwaseun Ajao et.al. |
2401.08506v1 |
null |
2024-01-16 |
Machine Translation with Large Language Models: Prompt Engineering for Persian, English, and Russian Directions |
Nooshin Pourkamali et.al. |
2401.08429v1 |
null |
2024-01-16 |
Cross-lingual neural fuzzy matching for exploiting target-language monolingual corpora in computer-aided translation |
Miquel Esplà-Gomis et.al. |
2401.08374v1 |
link |
2024-01-16 |
Application of LLM Agents in Recruitment: A Novel Framework for Resume Screening |
Chengguang Gan et.al. |
2401.08315v1 |
null |
2024-01-15 |
The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey |
Saurav Pawar et.al. |
2401.07872v1 |
null |
2024-01-15 |
Quantum Transfer Learning for Acceptability Judgements |
Giuseppe Buonaiuto et.al. |
2401.07777v1 |
null |
2024-01-15 |
On the importance of Data Scale in Pretraining Arabic Language Models |
Abbas Ghaddar et.al. |
2401.07760v1 |
link |
2024-01-15 |
Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends |
Yunshi Lan et.al. |
2401.07518v1 |
link |
2024-01-15 |
Developing ChatGPT for Biology and Medicine: A Complete Review of Biomedical Question Answering |
Qing Li et.al. |
2401.07510v1 |
null |
2024-01-15 |
Graph database while computationally efficient filters out quickly the ESG integrated equities in investment management |
Partha Sen et.al. |
2401.07483v1 |
null |
2024-01-15 |
GWPT: A Green Word-Embedding-based POS Tagger |
Chengwei Wei et.al. |
2401.07475v1 |
null |
2024-01-15 |
Leveraging the power of transformers for guilt detection in text |
Abdul Gafar Manuel Meque et.al. |
2401.07414v1 |
null |
2024-01-12 |
Stylometry Analysis of Multi-authored Documents for Authorship and Author Style Change Detection |
Muhammad Tayyab Zamir et.al. |
2401.06752v1 |
null |
2024-01-12 |
Reframing Tax Law Entailment as Analogical Reasoning |
Xinrui Zou et.al. |
2401.06715v1 |
null |
2024-01-12 |
Cyborgs for strategic communication on social media |
Lynnette Hui Xian Ng et.al. |
2401.06582v1 |
null |
2024-01-12 |
INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning |
Yutao Zhu et.al. |
2401.06532v1 |
link |
2024-01-12 |
An investigation of structures responsible for gender bias in BERT and DistilBERT |
Thibaud Leteno et.al. |
2401.06495v1 |
null |
2024-01-12 |
Adapting Large Language Models for Document-Level Machine Translation |
Minghao Wu et.al. |
2401.06468v1 |
null |
2024-01-12 |
SamLP: A Customized Segment Anything Model for License Plate Detection |
Haoxuan Ding et.al. |
2401.06374v1 |
link |
2024-01-12 |
MuGI: Enhancing Information Retrieval through Multi-Text Generation Intergration with Large Language Models |
Le Zhang et.al. |
2401.06311v1 |
link |
2024-01-11 |
Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings |
Hiroaki Yamagiwa et.al. |
2401.06112v1 |
link |
2024-01-11 |
How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes |
Sabina Elkins et.al. |
2401.05914v1 |
null |
2024-01-11 |
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems |
Tianyu Cui et.al. |
2401.05778v1 |
null |
2024-01-11 |
ConcEPT: Concept-Enhanced Pre-Training for Language Models |
Xintao Wang et.al. |
2401.05669v1 |
null |
2024-01-11 |
Natural Language Processing for Dialects of a Language: A Survey |
Aditya Joshi et.al. |
2401.05632v1 |
null |
2024-01-10 |
TrustLLM: Trustworthiness in Large Language Models |
Lichao Sun et.al. |
2401.05561v1 |
link |
2024-01-10 |
CADgpt: Harnessing Natural Language Processing for 3D Modelling to Enhance Computer-Aided Design Workflows |
Timo Kapsalis et.al. |
2401.05476v1 |
null |
2024-01-10 |
MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector |
Marta R. Costa-jussà et.al. |
2401.05060v1 |
link |
2024-01-09 |
Entity Recognition from Colloquial Text |
Tamara Babaian et.al. |
2401.04853v1 |
null |
2024-01-09 |
MoSECroT: Model Stitching with Static Word Embeddings for Crosslingual Zero-shot Transfer |
Haotian Ye et.al. |
2401.04821v1 |
null |
2024-01-09 |
Phishing Website Detection through Multi-Model Analysis of HTML Content |
Furkan Çolhak et.al. |
2401.04820v1 |
null |
2024-01-10 |
Low-Resource Vision Challenges for Foundation Models |
Yunhua Zhang et.al. |
2401.04716v2 |
null |
2024-01-09 |
TechGPT-2.0: A large language model project to solve the task of knowledge graph construction |
Jiaqi Wang et.al. |
2401.04507v1 |
link |
2024-01-09 |
LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training |
Khoi M. Le et.al. |
2401.04348v1 |
link |
2024-01-09 |
Know Your Needs Better: Towards Structured Understanding of Marketer Demands with Analogical Reasoning Augmented LLMs |
Junjie Wang et.al. |
2401.04319v1 |
link |
2024-01-08 |
Attention versus Contrastive Learning of Tabular Data -- A Data-centric Benchmarking |
Shourav B. Rabbani et.al. |
2401.04266v1 |
null |
2024-01-08 |
Large language models in bioinformatics: applications and perspectives |
Jiajia Liu et.al. |
2401.04155v1 |
null |
2024-01-08 |
Empirical Analysis of Efficient Fine-Tuning Methods for Large Pre-Trained Language Models |
Nigel Doering et.al. |
2401.04051v1 |
null |
2024-01-08 |
IDoFew: Intermediate Training Using Dual-Clustering in Language Models for Few Labels Text Classification |
Abdullah Alsuhaibani et.al. |
2401.04025v1 |
null |
2024-01-08 |
Aligned with LLM: a new multi-modal training paradigm for encoding fMRI activity in visual cortex |
Shuxiao Ma et.al. |
2401.03851v1 |
null |
2024-01-08 |
We Need to Talk About Classification Evaluation Metrics in NLP |
Peter Vickers et.al. |
2401.03831v1 |
null |
2024-01-08 |
Anatomy of Neural Language Models |
Majd Saleh et.al. |
2401.03797v1 |
link |
2024-01-08 |
Overview of the 2023 ICON Shared Task on Gendered Abuse Detection in Indic Languages |
Aatman Vaidya et.al. |
2401.03677v1 |
null |
2024-01-07 |
Is there really a Citation Age Bias in NLP? |
Hoa Nguyen et.al. |
2401.03545v1 |
null |
2024-01-07 |
Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions |
Yichi Zhang et.al. |
2401.03495v1 |
link |
2024-01-07 |
Maintaining Journalistic Integrity in the Digital Age: A Comprehensive NLP Framework for Evaluating Online News Content |
Ljubisa Bojic et.al. |
2401.03467v1 |
null |
2024-01-07 |
Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects |
Yuheng Cheng et.al. |
2401.03428v1 |
link |
2024-01-05 |
Introducing Bode: A Fine-Tuned Large Language Model for Portuguese Prompt-Based Task |
Gabriel Lino Garcia et.al. |
2401.02909v1 |
null |
2024-01-05 |
Nonlinear functional regression by functional deep neural network with kernel embedding |
Zhongjie Shi et.al. |
2401.02890v1 |
null |
2024-01-05 |
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks |
Haoyuan Wu et.al. |
2401.02731v1 |
link |
2024-01-05 |
Beyond Fidelity: Explaining Vulnerability Localization of Learning-based Detectors |
Baijun Cheng et.al. |
2401.02686v1 |
link |
2024-01-05 |
Training and Serving System of Foundation Models: A Comprehensive Survey |
Jiahang Zhou et.al. |
2401.02643v1 |
null |
2024-01-04 |
L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages |
Aishwarya Mirashi et.al. |
2401.02254v1 |
link |
2024-01-04 |
SwitchTab: Switched Autoencoders Are Effective Tabular Learners |
Jing Wu et.al. |
2401.02013v1 |
null |
2024-01-03 |
Mining Temporal Attack Patterns from Cyberthreat Intelligence Reports |
Md Rayhanur Rahman et.al. |
2401.01883v1 |
null |
2024-01-03 |
Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling |
Himmet Toprak Kesgin et.al. |
2401.01830v1 |
null |
2024-01-03 |
Text mining arXiv: a look through quantitative finance papers |
Michele Leonardo Bianchi et.al. |
2401.01751v1 |
null |
2024-01-03 |
Predicting challenge moments from students' discourse: A comparison of GPT-4 to two traditional natural language processing approaches |
Wannapon Suraworachet et.al. |
2401.01692v1 |
null |
2024-01-04 |
Towards a Foundation Purchasing Model: Pretrained Generative Autoregression on Transaction Sequences |
Piotr Skalski et.al. |
2401.01641v2 |
link |
2024-01-03 |
Test-Time Personalization with Meta Prompt for Gaze Estimation |
Huan Liu et.al. |
2401.01577v1 |
link |
2024-01-03 |
Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models |
Rita Frieske et.al. |
2401.01572v1 |
null |
2024-01-03 |
LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training |
Rujiao Long et.al. |
2401.01522v1 |
null |
2024-01-03 |
Practical Guidelines for the Selection and Evaluation of NLP Techniques in RE |
Mehrdad Sabetzadeh et.al. |
2401.01508v1 |
null |
2024-01-03 |
Natural Language Processing and Multimodal Stock Price Prediction |
Kevin Taylor et.al. |
2401.01487v1 |
null |
2024-01-02 |
LLM Harmony: Multi-Agent Communication for Problem Solving |
Sumedh Rasal et.al. |
2401.01312v1 |
link |
2024-01-02 |
Fairness Certification for Natural Language Processing and Large Language Models |
Vincent Freiberger et.al. |
2401.01262v1 |
null |
2024-01-02 |
Imperio: Language-Guided Backdoor Attacks for Arbitrary Model Control |
Ka-Ho Chow et.al. |
2401.01085v1 |
link |
2024-01-02 |
Vietnamese Poem Generation & The Prospect Of Cross-Language Poem-To-Poem Translation |
Triet Huynh Minh et.al. |
2401.01078v1 |
link |
2024-01-02 |
Cheetah: Natural Language Generation for 517 African Languages |
Ife Adebara et.al. |
2401.01053v1 |
null |
2024-01-02 |
Safety and Performance, Why Not Both? Bi-Objective Optimized Model Compression against Heterogeneous Attacks Toward AI Software Deployment |
Jie Zhu et.al. |
2401.00996v1 |
link |
2024-01-01 |
Temporal Validity Change Prediction |
Georg Wenzel et.al. |
2401.00779v1 |
null |
2024-01-01 |
Large language model for Bible sentiment analysis: Sermon on the Mount |
Mahek Vora et.al. |
2401.00689v1 |
link |
2024-01-01 |
Predicting Anti-microbial Resistance using Large Language Models |
Hyunwoo Yoo et.al. |
2401.00642v1 |
null |
2023-12-31 |
Exploring the Effectiveness of Instruction Tuning in Biomedical Language Processing |
Omid Rohanian et.al. |
2401.00579v1 |
null |
2023-12-29 |
Action-Item-Driven Summarization of Long Meeting Transcripts |
Logan Golia et.al. |
2312.17581v1 |
link |
2023-12-29 |
Overview of the PromptCBLUE Shared Task in CHIP2023 |
Wei Zhu et.al. |
2312.17522v1 |
link |
2023-12-28 |
GitAgent: Facilitating Autonomous Agent with GitHub by Tool Extension |
Bohan Lyu et.al. |
2312.17294v1 |
null |
2023-12-29 |
Length Extrapolation of Transformers: A Survey from the Perspective of Position Encoding |
Liang Zhao et.al. |
2312.17044v2 |
null |
2023-12-28 |
Few-shot learning for automated content analysis: Efficient coding of arguments and claims in the debate on arms deliveries to Ukraine |
Jonas Rieger et.al. |
2312.16975v1 |
null |
2023-12-27 |
A proposed new metric for the conceptual diversity of a text |
İlknur Dönmez Phd et.al. |
2312.16548v1 |
null |
2023-12-26 |
Zur Darstellung eines mehrstufigen Prototypbegriffs in der multilingualen automatischen Sprachgenerierung: vom Korpus über word embeddings bis hin zum automatischen Wörterbuch |
María José Domínguez Vázquez et.al. |
2312.16311v1 |
null |
2023-12-26 |
Social-Transmotion: Promptable Human Trajectory Prediction |
Saeed Saadatnejad et.al. |
2312.16168v1 |
link |
2023-12-26 |
Dotless Representation of Arabic Text: Analysis and Modeling |
Maged S. Al-Shaibani et.al. |
2312.16104v1 |
null |
2023-12-26 |
FedMS: Federated Learning with Mixture of Sparsely Activated Foundations Models |
Panlong Wu et.al. |
2312.15926v1 |
null |
2023-12-26 |
Think and Retrieval: A Hypothesis Knowledge Graph Enhanced Medical Large Language Models |
Xinke Jiang et.al. |
2312.15883v1 |
null |
2023-12-26 |
Heterogeneous Encoders Scaling In The Transformer For Neural Machine Translation |
Jia Cheng Hu et.al. |
2312.15872v1 |
null |
2023-12-26 |
Punctuation Matters! Stealthy Backdoor Attack for Language Models |
Xuan Sheng et.al. |
2312.15867v1 |
null |
2023-12-26 |
Hypergraph Enhanced Knowledge Tree Prompt Learning for Next-Basket Recommendation |
Zi-Feng Mai et.al. |
2312.15851v1 |
null |
2023-12-25 |
Design and Implementation of a Tool for Extracting Uzbek Syllables |
Ulugbek Salaev et.al. |
2312.15779v1 |
null |
2023-12-25 |
Large Language Models are Not Stable Recommender Systems |
Tianhui Ma et.al. |
2312.15746v1 |
null |
2023-12-25 |
PersianLLaMA: Towards Building First Persian Large Language Model |
Mohammad Amin Abbasi et.al. |
2312.15713v1 |
null |
2023-12-22 |
YAYI 2: Multilingual Open-Source Large Language Models |
Yin Luo et.al. |
2312.14862v1 |
null |
2023-12-22 |
Large Language Model (LLM) Bias Index -- LLMBI |
Abiodun Finbarrs Oketunji et.al. |
2312.14769v1 |
null |
2023-12-22 |
Zero-shot Causal Graph Extrapolation from Text via LLMs |
Alessandro Antonucci et.al. |
2312.14670v1 |
link |
2023-12-22 |
Training Neural Networks with Internal State, Unconstrained Connectivity, and Discrete Activations |
Alexander Grushin et.al. |
2312.14359v1 |
null |
2023-12-21 |
Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs |
Juraj Vladika et.al. |
2312.13881v1 |
null |
2023-12-21 |
kNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels |
Jiaming Zhou et.al. |
2312.13560v1 |
link |
2023-12-21 |
Empowering Few-Shot Recommender Systems with Large Language Models -- Enhanced Representations |
Zhoumeng Wang et.al. |
2312.13557v1 |
link |
2023-12-20 |
AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation |
Dong Huang et.al. |
2312.13010v1 |
link |
2023-12-20 |
Segmenting Messy Text: Detecting Boundaries in Text Derived from Historical Newspaper Images |
Carol Anderson et.al. |
2312.12773v1 |
null |
2023-12-21 |
A Case Study on Test Case Construction with Large Language Models: Unveiling Practical Insights and Challenges |
Roberto Francisco de Lima Junior et.al. |
2312.12598v2 |
null |
2023-12-19 |
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation |
Sihan Liu et.al. |
2312.12470v1 |
link |
2023-12-19 |
Geo-located Aspect Based Sentiment Analysis (ABSA) for Crowdsourced Evaluation of Urban Environments |
Demircan Tas et.al. |
2312.12253v1 |
null |
2023-12-19 |
Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models: A Critical Review and Assessment |
Lingling Xu et.al. |
2312.12148v1 |
null |
2023-12-19 |
Designing Guiding Principles for NLP for Healthcare: A Case Study of Maternal Health |
Maria Antoniak et.al. |
2312.11803v1 |
link |
2023-12-19 |
MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA |
Lang Yu et.al. |
2312.11795v1 |
link |
2023-12-19 |
MineObserver 2.0: A Deep Learning & In-Game Framework for Assessing Natural Language Descriptions of Minecraft Imagery |
Jay Mahajan et.al. |
2312.11761v1 |
null |
2023-12-18 |
A Heterogeneous Chiplet Architecture for Accelerating End-to-End Transformer Models |
Harsh Sharma et.al. |
2312.11750v1 |
null |
2023-12-18 |
Agent-based Learning of Materials Datasets from Scientific Literature |
Mehrad Ansari et.al. |
2312.11690v1 |
link |
2023-12-18 |
From Generalized Laughter to Personalized Chuckles: Unleashing the Power of Data Fusion in Subjective Humor Detection |
Julita Bielaniewicz et.al. |
2312.11296v1 |
null |
2023-12-18 |
Structure-Preserving Transformers for Learning Parametrized Hamiltonian Systems |
Benedikt Brantner et.al. |
2312.11166v1 |
link |
2023-12-18 |
Efficiency-oriented approaches for self-supervised speech representation learning |
Luis Lugo et.al. |
2312.11142v1 |
null |
2023-12-17 |
Validation of Rigorous Requirements Specifications and Document Automation with the ITLingo RSL Language |
Andre Rodrigues et.al. |
2312.10822v1 |
null |
2023-12-17 |
What Makes Digital Support Effective? How Therapeutic Skills Affect Clinical Well-Being |
Anna Fang et.al. |
2312.10775v1 |
null |
2023-12-17 |
Identification of Knowledge Neurons in Protein Language Models |
Divya Nori et.al. |
2312.10770v1 |
null |
2023-12-17 |
Can persistent homology whiten Transformer-based black-box models? A case study on BERT compression |
Luis Balderas et.al. |
2312.10702v1 |
null |
2023-12-17 |
Cross-Domain Robustness of Transformer-based Keyphrase Generation |
Anna Glazkova et.al. |
2312.10700v1 |
null |
2023-12-17 |
Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-hoc Retrieval |
Weihang Su et.al. |
2312.10661v1 |
link |
2023-12-17 |
Decoding Concerns: Multi-label Classification of Vaccine Sentiments in Social Media |
Somsubhra De et.al. |
2312.10626v1 |
link |
2023-12-16 |
CoCoGen: Physically-Consistent and Conditioned Score-based Generative Models for Forward and Inverse Problems |
Christian Jacobsen et.al. |
2312.10527v1 |
null |
2023-12-16 |
Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning |
Kaiyou Song et.al. |
2312.10457v1 |
link |
2023-12-16 |
An Attentive Inductive Bias for Sequential Recommendation Beyond the Self-Attention |
Yehjin Shin et.al. |
2312.10325v1 |
link |
2023-12-15 |
Faithful Persona-based Conversational Dataset Generation with Large Language Models |
Pegah Jandaghi et.al. |
2312.10007v1 |
link |
2023-12-15 |
LLaMAntino: LLaMA 2 Models for Effective Text Generation in Italian Language |
Pierpaolo Basile et.al. |
2312.09993v1 |
null |
2023-12-15 |
A Novel Dataset for Financial Education Text Simplification in Spanish |
Nelson Perez-Rojas et.al. |
2312.09897v1 |
null |
2023-12-15 |
Deep Unsupervised Domain Adaptation for Time Series Classification: a Benchmark |
Hassan Ismail Fawaz et.al. |
2312.09857v1 |
link |
2023-12-15 |
Algorithms for automatic intents extraction and utterances classification for goal-oriented dialogue systems |
Leonid Legashev et.al. |
2312.09658v1 |
null |
2023-12-15 |
Picking the Underused Heads: A Network Pruning Perspective of Attention Head Selection for Fusing Dialogue Coreference Information |
Zhengyuan Liu et.al. |
2312.09541v1 |
null |
2023-12-15 |
Riveter: Measuring Power and Social Dynamics Between Entities |
Maria Antoniak et.al. |
2312.09536v1 |
link |
2023-12-14 |
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision |
Collin Burns et.al. |
2312.09390v1 |
null |
2023-12-13 |
N-Gram Unsupervised Compoundation and Feature Injection for Better Symbolic Music Understanding |
Jinhao Tian et.al. |
2312.08931v1 |
link |
2023-12-15 |
Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis |
Yafei Hu et.al. |
2312.08782v2 |
null |
2023-12-15 |
VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding |
Yi Xin et.al. |
2312.08733v2 |
null |
2023-12-14 |
A Comparative Analysis of Fine-Tuned LLMs and Few-Shot Learning of LLMs for Financial Sentiment Analysis |
Sorouralsadat Fatemi et.al. |
2312.08725v1 |
null |
2023-12-14 |
ChatSOS: LLM-based knowledge Q&A system for safety engineering |
Haiyang Tang et.al. |
2312.08629v1 |
null |
2023-12-13 |
A Survey of Generative AI for Intelligent Transportation Systems |
Huan Yan et.al. |
2312.08248v1 |
null |
2023-12-13 |
LAMM: Label Alignment for Multi-Modal Prompt Learning |
Jingsheng Gao et.al. |
2312.08212v1 |
link |
2023-12-13 |
Towards Model-Based Data Acquisition for Subjective Multi-Task NLP Problems |
Kamil Kanclerz et.al. |
2312.08198v1 |
link |
2023-12-13 |
CIDR: A Cooperative Integrated Dynamic Refining Method for Minimal Feature Removal Problem |
Qian Chen et.al. |
2312.08157v1 |
link |
2023-12-13 |
Efficient Representation of the Activation Space in Deep Neural Networks |
Tanya Akumu et.al. |
2312.08143v1 |
null |
2023-12-13 |
Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models |
Junhao Zheng et.al. |
2312.07887v1 |
link |
2023-12-13 |
Abusive Span Detection for Vietnamese Narrative Texts |
Nhu-Thanh Nguyen et.al. |
2312.07831v1 |
null |
2023-12-13 |
A Deep Learning-Based System for Automatic Case Summarization |
Minh Duong et.al. |
2312.07824v1 |
null |
2023-12-12 |
Estimation of embedding vectors in high dimensions |
Golara Ahmadi Azar et.al. |
2312.07802v1 |
null |
2023-12-12 |
Sentiment analysis in Tourism: Fine-tuning BERT or sentence embeddings concatenation? |
Ibrahim Bouabdallaoui et.al. |
2312.07797v1 |
null |
2023-12-12 |
MS-Twins: Multi-Scale Deep Self-Attention Networks for Medical Image Segmentation |
Jing Xu et.al. |
2312.07128v1 |
null |
2023-12-12 |
Towards Enhanced Human Activity Recognition through Natural Language Generation and Pose Estimation |
Nikhil Kashyap et.al. |
2312.06965v1 |
null |
2023-12-11 |
Self-supervised Machine Learning Based Approach to Orbit Modelling Applied to Space Traffic Management |
Emma Stevenson et.al. |
2312.06854v1 |
null |
2023-12-11 |
TaCo: Targeted Concept Removal in Output Embeddings for NLP via Information Theory and Explainability |
Fanny Jourdan et.al. |
2312.06499v1 |
link |
2023-12-11 |
Survey on Memory-Augmented Neural Networks: Cognitive Insights to AI Applications |
Savya Khosla et.al. |
2312.06141v1 |
null |
2023-12-11 |
Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need |
Cheng Peng et.al. |
2312.06099v1 |
null |
2023-12-11 |
SECNN: Squeeze-and-Excitation Convolutional Neural Network for Sentence Classification |
Shandong Yuan et.al. |
2312.06088v1 |
null |
2023-12-11 |
IEKG: A Commonsense Knowledge Graph for Idiomatic Expressions |
Ziheng Zeng et.al. |
2312.06053v1 |
link |
2023-12-10 |
Modeling Uncertainty in Personalized Emotion Prediction with Normalizing Flows |
Piotr Miłkowski et.al. |
2312.06034v1 |
link |
2023-12-10 |
Large Language Models on Lexical Semantic Change Detection: An Evaluation |
Ruiyu Wang et.al. |
2312.06002v1 |
null |
2023-12-10 |
Natural Interaction Modalities for Human-CPS Interaction in Construction Progress Monitoring |
Srijeet Halder et.al. |
2312.05988v1 |
null |
2023-12-10 |
FP8-BERT: Post-Training Quantization for Transformer |
Jianwei Li et.al. |
2312.05725v1 |
null |
2023-12-09 |
NLLG Quarterly arXiv Report 09/23: What are the most influential current AI Papers? |
Ran Zhang et.al. |
2312.05688v1 |
link |
2023-12-08 |
HALO: An Ontology for Representing Hallucinations in Generative Models |
Navapat Nananukul et.al. |
2312.05209v1 |
null |
2023-12-08 |
Converting Epics/Stories into Pseudocode using Transformers |
Gaurav Kolhatkar et.al. |
2312.05047v1 |
null |
2023-12-08 |
Illicit Darkweb Classification via Natural-language Processing: Classifying Illicit Content of Webpages based on Textual Information |
Giuseppe Cascavilla et.al. |
2312.04944v1 |
null |
2023-12-08 |
Ophtha-LLaMA2: A Large Language Model for Ophthalmology |
Huan Zhao et.al. |
2312.04906v1 |
null |
2023-12-08 |
How to Determine the Most Powerful Pre-trained Language Model without Brute Force Fine-tuning? An Empirical Survey |
Jun Bai et.al. |
2312.04775v1 |
link |
2023-12-07 |
The Impact of AI Innovations on U.S. Occupations |
Ali Akbar Septiandri et.al. |
2312.04714v1 |
null |
2023-12-07 |
Simul-LLM: A Framework for Exploring High-Quality Simultaneous Translation with Large Language Models |
Victor Agostinelli et.al. |
2312.04691v1 |
link |
2023-12-07 |
PyThaiNLP: Thai Natural Language Processing in Python |
Wannaphong Phatthiyaphaibun et.al. |
2312.04649v1 |
link |
2023-12-07 |
Leveraging Transformer-based Language Models to Automate Requirements Satisfaction Assessment |
Amrit Poudel et.al. |
2312.04463v1 |
null |
2023-12-07 |
CLadder: A Benchmark to Assess Causal Reasoning Capabilities of Language Models |
Zhijing Jin et.al. |
2312.04350v1 |
link |
2023-12-07 |
Beyond Surface: Probing LLaMA Across Scales and Layers |
Nuo Chen et.al. |
2312.04333v1 |
link |
2023-12-07 |
nerblackbox: A High-level Library for Named Entity Recognition in Python |
Felix Stollenwerk et.al. |
2312.04306v1 |
link |
2023-12-07 |
Graph Convolutions Enrich the Self-Attention in Transformers! |
Jeongwhan Choi et.al. |
2312.04234v1 |
null |
2023-12-07 |
CODEX: A Cluster-Based Method for Explainable Reinforcement Learning |
Timothy K. Mathes et.al. |
2312.04216v1 |
link |
2023-12-07 |
Language Model Knowledge Distillation for Efficient Question Answering in Spanish |
Adrián Bazaga et.al. |
2312.04193v1 |
link |
2023-12-07 |
Series2Vec: Similarity-based Self-supervised Representation Learning for Time Series Classification |
Navid Mohammadi Foumani et.al. |
2312.03998v1 |
link |
2023-12-06 |
Collaboration or Corporate Capture? Quantifying NLP's Reliance on Industry Artifacts and Contributions |
Will Aitken et.al. |
2312.03912v1 |
null |
2023-12-07 |
Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers |
Umberto Cappellazzo et.al. |
2312.03694v2 |
link |
2023-12-06 |
KhabarChin: Automatic Detection of Important News in the Persian Language |
Hamed Hematian Hemati et.al. |
2312.03361v1 |
link |
2023-12-06 |
Measuring Misogyny in Natural Language Generation: Preliminary Results from a Case Study on two Reddit Communities |
Aaron J. Snoswell et.al. |
2312.03330v1 |
null |
2023-12-06 |
Detecting Rumor Veracity with Only Textual Information by Double-Channel Structure |
Alex Kim et.al. |
2312.03195v1 |
null |
2023-12-06 |
Corporate Bankruptcy Prediction with Domain-Adapted BERT |
Alex Kim et.al. |
2312.03194v1 |
null |
2023-12-05 |
Inherent limitations of LLMs regarding spatial information |
He Yan et.al. |
2312.03042v1 |
link |
2023-12-05 |
Concept Drift Adaptation in Text Stream Mining Settings: A Comprehensive Review |
Cristiano Mesquita Garcia et.al. |
2312.02901v1 |
null |
2023-12-05 |
Large Language Models on Graphs: A Comprehensive Survey |
Bowen Jin et.al. |
2312.02783v1 |
link |
2023-12-05 |
Empathy and Distress Detection using Ensembles of Transformer Models |
Tanmay Chavan et.al. |
2312.02578v1 |
null |
2023-12-05 |
Towards More Unified In-context Visual Understanding |
Dianmo Sheng et.al. |
2312.02520v1 |
null |
2023-12-05 |
MKA: A Scalable Medical Knowledge Assisted Mechanism for Generative Models on Medical Conversation Tasks |
Ke Liang et.al. |
2312.02496v1 |
link |
2023-12-04 |
Measuring Distributional Shifts in Text: The Advantage of Language Model-Based Embeddings |
Gyandev Gupta et.al. |
2312.02337v1 |
null |
2023-12-04 |
Revisiting Topic-Guided Language Models |
Carolina Zheng et.al. |
2312.02331v1 |
link |
2023-12-04 |
LLMs Accelerate Annotation for Medical Information Extraction |
Akshay Goel et.al. |
2312.02296v1 |
null |
2023-12-04 |
TPPoet: Transformer-Based Persian Poem Generation using Minimal Data and Advanced Decoding Techniques |
Amir Panahandeh et.al. |
2312.02125v1 |
null |
2023-12-04 |
Wild-Tab: A Benchmark For Out-Of-Distribution Generalization In Tabular Regression |
Sergey Kolesnikov et.al. |
2312.01792v1 |
null |
2023-12-04 |
Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites |
Lei Wang et.al. |
2312.01701v1 |
link |
2023-12-04 |
AGD: an Auto-switchable Optimizer using Stepwise Gradient Difference for Preconditioning Matrix |
Yun Yue et.al. |
2312.01658v1 |
link |
2023-12-03 |
AI-Powered Arabic Crossword Puzzle Generation for Educational Applications |
Kamyar Zeinalipour et.al. |
2312.01339v1 |
null |
2023-12-03 |
NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian |
Peng Liu et.al. |
2312.01314v1 |
null |
2023-12-03 |
Multiscale Topology in Interactomic Network: From Transcriptome to Antiaddiction Drug Repurposing |
Hongyan Du et.al. |
2312.01272v1 |
null |
2023-12-02 |
Enabling Quantum Natural Language Processing for Hindi Language |
Naman Srivastava et.al. |
2312.01221v1 |
null |
2023-12-02 |
Understanding Opinions Towards Climate Change on Social Media |
Yashaswi Pupneja et.al. |
2312.01217v1 |
null |
2023-12-02 |
From Voices to Validity: Leveraging Large Language Models (LLMs) for Textual Analysis of Policy Stakeholder Interviews |
Alex Liu et.al. |
2312.01202v1 |
null |
2023-12-01 |
Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals |
Tam Nguyen et.al. |
2312.00751v1 |
null |
2023-12-01 |
Infrared Image Super-Resolution via GAN |
Yongsong Huang et.al. |
2312.00689v1 |
null |
2023-12-01 |
Towards Transparency in Coreference Resolution: A Quantum-Inspired Approach |
Hadi Wazni et.al. |
2312.00688v1 |
link |
2023-12-01 |
Contextualized word senses: from attention to compositionality |
Pablo Gamallo et.al. |
2312.00680v1 |
null |
2023-12-01 |
Nonparametric Variational Regularisation of Pretrained Transformers |
Fabio Fehr et.al. |
2312.00662v1 |
null |
2023-12-01 |
CoLLiE: Collaborative Training of Large Language Models in an Efficient Way |
Kai Lv et.al. |
2312.00407v1 |
link |
2023-11-30 |
Towards Unsupervised Representation Learning: Learning, Evaluating and Transferring Visual Representations |
Bonifaz Stuhr et.al. |
2312.00101v1 |
link |
2023-11-30 |
Introducing Rhetorical Parallelism Detection: A New Task with Datasets, Metrics, and Baselines |
Stephen Bothwell et.al. |
2312.00100v1 |
link |
2023-11-30 |
CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation |
Pei Ke et.al. |
2311.18702v1 |
link |
2023-11-30 |
ESG Accountability Made Easy: DocQA at Your Service |
Lokesh Mishra et.al. |
2311.18481v1 |
null |
2023-11-30 |
Lessons from Building CodeBuddy: A Contextualized AI Coding Assistant |
gustavo Pinto et.al. |
2311.18450v1 |
null |
2023-11-30 |
Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models |
Sungjoo Byun et.al. |
2311.18215v1 |
null |
2023-11-29 |
Uncertainty Guided Global Memory Improves Multi-Hop Question Answering |
Alsu Sagirova et.al. |
2311.18151v1 |
link |
2023-11-29 |
Mukhyansh: A Headline Generation Dataset for Indic Languages |
Lokesh Madasu et.al. |
2311.17743v1 |
link |
2023-11-29 |
AviationGPT: A Large Language Model for the Aviation Domain |
Liya Wang et.al. |
2311.17686v1 |
null |
2023-11-29 |
Introduction to Transformers: an NLP Perspective |
Tong Xiao et.al. |
2311.17633v1 |
link |
2023-11-29 |
Model Performance Prediction for Hyperparameter Optimization of Deep Learning Models Using High Performance Computing and Quantum Annealing |
Juan Pablo García Amboage et.al. |
2311.17508v1 |
null |
2023-11-30 |
Grounding Foundation Models through Federated Transfer Learning: A General Framework |
Yan Kang et.al. |
2311.17431v2 |
null |
2023-11-29 |
Improving the Robustness of Transformer-based Large Language Models with Dynamic Attention |
Lujia Shen et.al. |
2311.17400v1 |
null |
2023-11-29 |
Are Large Language Models Good Fact Checkers: A Preliminary Study |
Han Cao et.al. |
2311.17355v1 |
null |
2023-11-29 |
A natural language processing-based approach: mapping human perception by understanding deep semantic features in street view images |
Haoran Ma et.al. |
2311.17354v1 |
null |
2023-11-29 |
Elo Uncovered: Robustness and Best Practices in Language Model Evaluation |
Meriem Boubdir et.al. |
2311.17295v1 |
null |
2023-11-28 |
Quantifying the redundancy between prosody and text |
Lukas Wolf et.al. |
2311.17233v1 |
link |
2023-11-28 |
Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis |
Aman Yadav et.al. |
2311.16965v1 |
null |
2023-11-28 |
A Benchmark for Evaluating Machine Translation Metrics on Dialects Without Standard Orthography |
Noëmi Aepli et.al. |
2311.16865v1 |
link |
2023-11-28 |
The curse of language biases in remote sensing VQA: the role of spatial attributes, language diversity, and the need for clear evaluation |
Christel Chappuis et.al. |
2311.16782v1 |
null |
2023-11-28 |
RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement |
Longhui Zhang et.al. |
2311.16720v1 |
link |
2023-11-28 |
Large Language Models Meet Computer Vision: A Brief Survey |
Raby Hamadi et.al. |
2311.16673v1 |
null |
2023-11-28 |
MedGen: A Python Natural Language Processing Toolkit for Medical Text Processing |
Rui Yang et.al. |
2311.16588v1 |
link |
2023-11-28 |
Graph Prompt Learning: A Comprehensive Survey and Beyond |
Xiangguo Sun et.al. |
2311.16534v1 |
link |
2023-11-27 |
Leveraging deep active learning to identify low-resource mobility functioning information in public clinical notes |
Tuan-Dung Le et.al. |
2311.15946v1 |
null |
2023-11-27 |
PIPE : Parallelized Inference Through Post-Training Quantization Ensembling of Residual Expansions |
Edouard Yvinec et.al. |
2311.15806v1 |
null |
2023-11-27 |
Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs |
Simone Conia et.al. |
2311.15781v1 |
link |
2023-11-27 |
Knowledge Unlearning for LLMs: Tasks, Methods, and Challenges |
Nianwen Si et.al. |
2311.15766v1 |
null |
2023-11-27 |
Italian Crossword Generator: Enhancing Education through Interactive Word Puzzles |
Kamyar Zeinalipour et.al. |
2311.15723v1 |
null |
2023-11-27 |
Cerbero-7B: A Leap Forward in Language-Specific LLMs Through Enhanced Chat Corpus Generation and Evaluation |
Federico A. Galatolo et.al. |
2311.15698v1 |
link |
2023-11-27 |
RoboGPT: an intelligent agent of making embodied long-term decisions for daily instruction tasks |
Yaran Chen et.al. |
2311.15649v1 |
null |
2023-11-27 |
Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text |
Finbarrs Oketunji et.al. |
2311.15565v1 |
null |
2023-11-27 |
A Comparative and Experimental Study on Automatic Question Answering Systems and its Robustness against Word Jumbling |
Shashidhar Reddy Javaji et.al. |
2311.15513v1 |
null |
2023-11-26 |
Local Convergence of Approximate Newton Method for Two Layer Nonlinear Regression |
Zhihang Li et.al. |
2311.15390v1 |
null |
2023-11-24 |
GPT Struct Me: Probing GPT Models on Narrative Entity Extraction |
Hugo Sousa et.al. |
2311.14583v1 |
link |
2023-11-24 |
CMed-GPT: Prompt Tuning for Entity-Aware Chinese Medical Dialogue Generation |
Zhijie Qu et.al. |
2311.14539v1 |
null |
2023-11-24 |
Narratives from GPT-derived Networks of News, and a link to Financial Markets Dislocations |
Deborah Miori et.al. |
2311.14419v1 |
null |
2023-11-24 |
LLamol: A Dynamic Multi-Conditional Generative Transformer for De Novo Molecular Design |
Niklas Dobberstein et.al. |
2311.14407v1 |
link |
2023-11-24 |
Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs |
Shengyin Sun et.al. |
2311.14324v1 |
null |
2023-11-24 |
Cosine Similarity Knowledge Distillation for Individual Class Information Transfer |
Gyeongdo Ham et.al. |
2311.14307v1 |
null |
2023-11-23 |
Uncovering Gender Stereotypes in Video Game Character Designs: A Multi-Modal Analysis of Honor of Kings |
Bingqing Liu et.al. |
2311.14226v1 |
null |
2023-11-23 |
Towards Explainable Strategy Templates using NLP Transformers |
Pallavi Bagga et.al. |
2311.14061v1 |
null |
2023-11-23 |
Efficient Trigger Word Insertion |
Yueqi Zeng et.al. |
2311.13957v1 |
null |
2023-11-22 |
Comparison of pipeline, sequence-to-sequence, and GPT models for end-to-end relation extraction: experiments with the rare disease use-case |
Shashank Gupta et.al. |
2311.13729v1 |
link |
2023-11-22 |
A Survey of Serverless Machine Learning Model Inference |
Kamil Kojs et.al. |
2311.13587v1 |
null |
2023-11-22 |
Machine Translation to Control Formality Features in the Target Language |
Harshita Tyagi et.al. |
2311.13475v1 |
null |
2023-11-22 |
Confidant: Customizing Transformer-based LLMs via Collaborative Edge Training |
Yuhao Chen et.al. |
2311.13381v1 |
null |
2023-11-22 |
Combatting Human Trafficking in the Cyberspace: A Natural Language Processing-Based Methodology to Analyze the Language in Online Advertisements |
Alejandro Rodriguez Perez et.al. |
2311.13118v1 |
null |
2023-11-21 |
A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs |
Jiageng Zhong et.al. |
2311.12893v1 |
null |
2023-11-21 |
Alpha Zero for Physics: Application of Symbolic Regression with Alpha Zero to find the analytical methods in physics |
Yoshihiro Michishita et.al. |
2311.12713v1 |
null |
2023-11-21 |
MathGloss: Building mathematical glossaries from text |
Lucy Horowitz et.al. |
2311.12649v1 |
link |
2023-11-21 |
Classification of Tabular Data by Text Processing |
Keshav Ramani et.al. |
2311.12521v1 |
null |
2023-11-21 |
Extracting Definienda in Mathematical Scholarly Articles with Transformers |
Shufan Jiang et.al. |
2311.12448v1 |
link |
2023-11-21 |
A Survey on Large Language Models for Personalized and Explainable Recommendations |
Junyi Chen et.al. |
2311.12338v1 |
null |
2023-11-21 |
AcademicGPT: Empowering Academic Research |
Shufa Wei et.al. |
2311.12315v1 |
null |
2023-11-21 |
ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science |
Sai Munikoti et.al. |
2311.12289v1 |
null |
2023-11-21 |
Equipping Pretrained Unconditional Music Transformers with Instrument and Genre Controls |
Weihan Xu et.al. |
2311.12257v1 |
null |
2023-11-20 |
Applications of Large Scale Foundation Models for Autonomous Driving |
Yu Huang et.al. |
2311.12144v1 |
null |
2023-11-20 |
Generating Valid and Natural Adversarial Examples with Large Language Models |
Zimu Wang et.al. |
2311.11861v1 |
null |
2023-11-20 |
Web News Timeline Generation with Extended Task Prompting |
Sha Wang et.al. |
2311.11652v1 |
null |
2023-11-20 |
Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks |
Ling Luo et.al. |
2311.11608v1 |
link |
2023-11-20 |
Exploring Prompting Large Language Models as Explainable Metrics |
Ghazaleh Mahmoudi et.al. |
2311.11552v1 |
link |
2023-11-20 |
Which AI Technique Is Better to Classify Requirements? An Experiment with SVM, LSTM, and ChatGPT |
Abdelkarim El-Hajjami et.al. |
2311.11547v1 |
null |
2023-11-20 |
ADAPTER-RL: Adaptation of Any Agent using Reinforcement Learning |
Yizhao Jin et.al. |
2311.11537v1 |
null |
2023-11-19 |
Self-Distilled Representation Learning for Time Series |
Felix Pieper et.al. |
2311.11335v1 |
null |
2023-11-19 |
Portuguese FAQ for Financial Services |
Paulo Finardi et.al. |
2311.11331v1 |
null |
2023-11-19 |
Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters |
Yinghui Li et.al. |
2311.11268v1 |
link |
2023-11-18 |
Hate speech and hate crimes: a data-driven study of evolving discourse around marginalized groups |
Malvina Bozhidarova et.al. |
2311.11163v1 |
link |
2023-11-17 |
Detection of Offensive and Threatening Online Content in a Low Resource Language |
Fatima Muhammad Adam et.al. |
2311.10541v1 |
null |
2023-11-17 |
ReuseSense: With Great Reuse Comes Greater Efficiency; Effectively Employing Computation Reuse on General-Purpose CPUs |
Nitesh Narayana GS et.al. |
2311.10487v1 |
null |
2023-11-17 |
Sinhala-English Word Embedding Alignment: Introducing Datasets and Benchmark for a Low Resource Language |
Kasun Wickramasinghe et.al. |
2311.10436v1 |
null |
2023-11-17 |
Causal Graph in Language Model Rediscovers Cortical Hierarchy in Human Narrative Processing |
Zhengqi He et.al. |
2311.10431v1 |
null |
2023-11-16 |
The Analysis and Extraction of Structure from Organizational Charts |
Nikhil Manali et.al. |
2311.10234v1 |
null |
2023-11-16 |
Revolutionizing Customer Interactions: Insights and Challenges in Deploying ChatGPT and Generative Chatbots for FAQs |
Feriel Khennouche et.al. |
2311.09976v1 |
null |
2023-11-16 |
OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking |
Chia-Hsuan Lee et.al. |
2311.09758v1 |
null |
2023-11-16 |
Trustworthy Large Models in Vision: A Survey |
Ziyan Guo et.al. |
2311.09680v1 |
null |
2023-11-17 |
FunctionMarker: Watermarking Language Datasets via Knowledge Injection |
Shuai Li et.al. |
2311.09535v2 |
null |
2023-11-16 |
AMRFact: Enhancing Summarization Factuality Evaluation with AMR-driven Training Data Generation |
Haoyi Qiu et.al. |
2311.09521v1 |
link |
2023-11-16 |
Atoms as Words: A Novel Approach to Deciphering Material Properties using NLP-inspired Machine Learning on Crystallographic Information Files (CIFs) |
Lalit Yadav et.al. |
2311.09508v1 |
null |
2023-11-16 |
SegMix: A Simple Structure-Aware Data Augmentation Method |
Yuxin Pei et.al. |
2311.09505v1 |
null |
2023-11-16 |
Show Your Work with Confidence: Confidence Bands for Tuning Curves |
Nicholas Lourie et.al. |
2311.09480v1 |
link |
2023-11-15 |
Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science |
Sridevi Wagle et.al. |
2311.09358v1 |
link |
2023-11-15 |
Mind's Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models |
Weize Liu et.al. |
2311.09214v1 |
link |
2023-11-15 |
Exploring the Potential of Large Language Models in Computational Argumentation |
Guizhen Chen et.al. |
2311.09022v1 |
link |
2023-11-15 |
Large Language Models are legal but they are not: Making the case for a powerful LegalLLM |
Thanmay Jayakumar et.al. |
2311.08890v1 |
null |
2023-11-15 |
Thread of Thought Unraveling Chaotic Contexts |
Yucheng Zhou et.al. |
2311.08734v1 |
null |
2023-11-15 |
Enabling CMF Estimation in Data-Constrained Scenarios: A Semantic-Encoding Knowledge Mining Model |
Yanlin Qi et.al. |
2311.08690v1 |
null |
2023-11-16 |
MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration |
Lin Xu et.al. |
2311.08562v2 |
link |
2023-11-14 |
Natural Language Processing for Financial Regulation |
Ixandra Achitouv et.al. |
2311.08533v1 |
null |
2023-11-14 |
GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer |
Urchade Zaratiana et.al. |
2311.08526v1 |
link |
2023-11-14 |
Functionality learning through specification instructions |
Pedro Henrique Luz de Araujo et.al. |
2311.08481v1 |
null |
2023-11-14 |
A Material Lens on Coloniality in NLP |
William Held et.al. |
2311.08391v1 |
null |
2023-11-14 |
Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration |
Zhenran Xu et.al. |
2311.08152v1 |
link |
2023-11-14 |
How to get better embeddings with code pre-trained models? An empirical study |
Yu Zhao et.al. |
2311.08066v1 |
null |
2023-11-14 |
How Well Do Text Embedding Models Understand Syntax? |
Yan Zhang et.al. |
2311.07996v1 |
link |
2023-11-14 |
How good are Large Language Models on African Languages? |
Jessica Ojo et.al. |
2311.07978v1 |
null |
2023-11-14 |
Towards Improving Robustness Against Common Corruptions in Object Detectors Using Adversarial Contrastive Learning |
Shashank Kotyan et.al. |
2311.07928v1 |
null |
2023-11-13 |
GreekT5: A Series of Greek Sequence-to-Sequence Models for News Summarization |
Nikolaos Giarelis et.al. |
2311.07767v1 |
link |
2023-11-13 |
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks |
Sanchit Ahuja et.al. |
2311.07463v1 |
null |
2023-11-13 |
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4 |
Microsoft Research AI4Science et.al. |
2311.07361v1 |
null |
2023-11-13 |
calamanCy: A Tagalog Natural Language Processing Toolkit |
Lester James V. Miranda et.al. |
2311.07171v1 |
link |
2023-11-13 |
STEER: Unified Style Transfer with Expert Reinforcement |
Skyler Hallinan et.al. |
2311.07167v1 |
link |
2023-11-12 |
Simulating Public Administration Crisis: A Novel Generative Agent-Based Simulation System to Lower Technology Barriers in Social Science Research |
Bushi Xiao et.al. |
2311.06957v1 |
null |
2023-11-12 |
Retrieval and Generative Approaches for a Pregnancy Chatbot in Nepali with Stemmed and Non-Stemmed Data : A Comparative Study |
Sujan Poudel et.al. |
2311.06898v1 |
null |
2023-11-12 |
GIELLM: Japanese General Information Extraction Large Language Model Utilizing Mutual Reinforcement Effect |
Chengguang Gan et.al. |
2311.06838v1 |
null |
2023-11-12 |
Explainability of Vision Transformers: A Comprehensive Review and New Perspectives |
Rojina Kashefi et.al. |
2311.06786v1 |
null |
2023-11-12 |
Detecting and Correcting Hate Speech in Multimodal Memes with Large Visual Language Model |
Minh-Hao Van et.al. |
2311.06737v1 |
null |
2023-11-12 |
Simple and Effective Input Reformulations for Translation |
Brian Yu et.al. |
2311.06696v1 |
link |
2023-11-10 |
BanglaBait: Semi-Supervised Adversarial Approach for Clickbait Detection on Bangla Clickbait Dataset |
Md. Motahar Mahtab et.al. |
2311.06204v1 |
link |
2023-11-10 |
Is it indeed bigger better? The comprehensive study of claim detection LMs applied for disinformation tackling |
Martin Hyben et.al. |
2311.06121v1 |
null |
2023-11-10 |
ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences |
Yuanhe Tian et.al. |
2311.06025v1 |
link |
2023-11-10 |
Hiformer: Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems |
Huan Gui et.al. |
2311.05884v1 |
null |
2023-11-10 |
Exploring Fine-tuning ChatGPT for News Recommendation |
Xinyi Li et.al. |
2311.05850v1 |
null |
2023-11-09 |
Long-Horizon Dialogue Understanding for Role Identification in the Game of Avalon with Large Language Models |
Simon Stepputtis et.al. |
2311.05720v1 |
null |
2023-11-09 |
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions |
Lei Huang et.al. |
2311.05232v1 |
link |
2023-11-09 |
Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization |
Jangwhan Lee et.al. |
2311.05161v1 |
null |
2023-11-09 |
Mental Health Diagnosis in the Digital Age: Harnessing Sentiment Analysis on Social Media Platforms upon Ultra-Sparse Feature Content |
Haijian Shao et.al. |
2311.05075v1 |
null |
2023-11-08 |
Towards Effective Paraphrasing for Information Disguise |
Anmol Agarwal et.al. |
2311.05018v1 |
link |
2023-11-08 |
Interpreting Pretrained Language Models via Concept Bottlenecks |
Zhen Tan et.al. |
2311.05014v1 |
link |
2023-11-08 |
Evaluating Generative Ad Hoc Information Retrieval |
Lukas Gienapp et.al. |
2311.04694v1 |
null |
2023-11-09 |
Evaluating Diverse Large Language Models for Automatic and General Bug Reproduction |
Sungmin Kang et.al. |
2311.04532v2 |
link |
2023-11-08 |
Multi-label and Multi-target Sampling of Machine Annotation for Computational Stance Detection |
Zhengyuan Liu et.al. |
2311.04495v1 |
link |
2023-11-08 |
Twitter Sentiment Analysis of Covid Vacciness |
Wenbo Zhu et.al. |
2311.04479v1 |
null |
2023-11-07 |
Formal Aspects of Language Modeling |
Ryan Cotterell et.al. |
2311.04329v1 |
null |
2023-11-07 |
SpaDeLeF: A Dataset for Hierarchical Classification of Lexical Functions for Collocations in Spanish |
Yevhen Kostiuk et.al. |
2311.04189v1 |
null |
2023-11-07 |
Perturbed examples reveal invariances shared by language models |
Ruchit Rawal et.al. |
2311.04166v1 |
null |
2023-11-07 |
Unveiling Safety Vulnerabilities of Large Language Models |
George Kour et.al. |
2311.04124v1 |
null |
2023-11-07 |
DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding |
Kehinde Ajayi et.al. |
2311.04098v1 |
link |
2023-11-07 |
Personality Style Recognition via Machine Learning: Identifying Anaclitic and Introjective Personality Styles from Patients' Speech |
Semere Kiros Bitew et.al. |
2311.04088v1 |
null |
2023-11-07 |
Cup Curriculum: Curriculum Learning on Model Capacity |
Luca Scharr et.al. |
2311.03956v1 |
link |
2023-11-07 |
Conversations in Galician: a Large Language Model for an Underrepresented Language |
Eliseo Bao et.al. |
2311.03812v1 |
link |
2023-11-07 |
Loss Balancing for Fair Supervised Learning |
Mohammad Mahdi Khalili et.al. |
2311.03714v1 |
link |
2023-11-07 |
Generalization of NLP Models: Notion and Causation |
Aparna Elangovan et.al. |
2311.03663v1 |
null |
2023-11-07 |
Instruct Me More! Random Prompting for Visual In-Context Learning |
Jiahao Zhang et.al. |
2311.03648v1 |
link |
2023-11-06 |
Tackling Concept Shift in Text Classification using Entailment-style Modeling |
Sumegh Roychowdhury et.al. |
2311.03320v1 |
null |
2023-11-06 |
Architectural Sweet Spots for Modeling Human Label Variation by the Example of Argument Quality: It's Best to Relate Perspectives! |
Philipp Heinisch et.al. |
2311.03153v1 |
link |
2023-11-06 |
BanLemma: A Word Formation Dependent Rule and Dictionary Based Bangla Lemmatizer |
Sadia Afrin et.al. |
2311.03078v1 |
link |
2023-11-06 |
Zero-shot Bilingual App Reviews Mining with Large Language Models |
Jialiang Wei et.al. |
2311.03058v1 |
link |
2023-11-06 |
GLEN: Generative Retrieval via Lexical Index Learning |
Sunkyung Lee et.al. |
2311.03057v1 |
link |
2023-11-06 |
Adapting Pre-trained Generative Models for Extractive Question Answering |
Prabir Mallick et.al. |
2311.02961v1 |
null |
2023-11-06 |
Incorporating Worker Perspectives into MTurk Annotation Practices for NLP |
Olivia Huang et.al. |
2311.02802v1 |
null |
2023-11-05 |
Pyclipse, a library for deidentification of free-text clinical notes |
Callandra Moore et.al. |
2311.02748v1 |
null |
2023-11-05 |
mahaNLP: A Marathi Natural Language Processing Library |
Vidula Magdum et.al. |
2311.02579v1 |
link |
2023-11-05 |
Relation Extraction Model Based on Semantic Enhancement Mechanism |
Peiyu Liu et.al. |
2311.02564v1 |
null |
2023-11-03 |
Grounded Intuition of GPT-Vision's Abilities with Scientific Images |
Alyssa Hwang et.al. |
2311.02069v1 |
link |
2023-11-03 |
Hardness of Low Rank Approximation of Entrywise Transformed Matrix Products |
Tamas Sarlos et.al. |
2311.01960v1 |
null |
2023-11-03 |
Constructing Temporal Dynamic Knowledge Graphs from Interactive Text-based Games |
Keunwoo Peter Yu et.al. |
2311.01928v1 |
link |
2023-11-03 |
Enhancing search engine precision and user experience through sentiment-based polysemy resolution |
Mike Nkongolo et.al. |
2311.01895v1 |
null |
2023-11-03 |
TCM-GPT: Efficient Pre-training of Large Language Models for Domain Adaptation in Traditional Chinese Medicine |
Guoxing Yang et.al. |
2311.01786v1 |
null |
2023-11-03 |
Indo LEGO-ABSA: A Multitask Generative Aspect Based Sentiment Analysis for Indonesian Language |
Randy Zakya Suchrady et.al. |
2311.01757v1 |
link |
2023-11-03 |
Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models |
Sean Xie et.al. |
2311.01732v1 |
link |
2023-11-02 |
ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos |
Te-Lin Wu et.al. |
2311.01620v1 |
link |
2023-11-02 |
Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization |
Björn Deiseroth et.al. |
2311.01544v1 |
null |
2023-11-02 |
A Comprehensive Study of Governance Issues in Decentralized Finance Applications |
Wei Ma et.al. |
2311.01433v1 |
null |
2023-11-02 |
Efficient Vision Transformer for Accurate Traffic Sign Detection |
Javad Mirzapour Kaleybar et.al. |
2311.01429v1 |
null |
2023-11-02 |
Finding Common Ground: Annotating and Predicting Common Ground in Spoken Conversations |
Magdalena Markowska et.al. |
2311.01273v1 |
link |
2023-11-02 |
Generating QM1B with PySCF $_{\text{IPU}}$ |
Alexander Mathiasen et.al. |
2311.01135v1 |
link |
2023-11-02 |
Noise-Robust Fine-Tuning of Pretrained Language Models via External Guidance |
Song Wang et.al. |
2311.01108v1 |
null |
2023-11-02 |
On the Concerns of Developers When Using GitHub Copilot |
Xiyu Zhou et.al. |
2311.01020v1 |
null |
2023-11-01 |
Crosslingual Retrieval Augmented In-context Learning for Bangla |
Xiaoqian Li et.al. |
2311.00587v1 |
null |
2023-11-01 |
On the Opportunities of Green Computing: A Survey |
You Zhou et.al. |
2311.00447v1 |
null |
2023-11-01 |
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities |
Md Farhan Ishmam et.al. |
2311.00308v1 |
null |
2023-11-01 |
Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models |
Ran Xu et.al. |
2311.00287v1 |
link |
2023-11-01 |
Transformers as Recognizers of Formal Languages: A Survey on Expressivity |
Lena Strobl et.al. |
2311.00208v1 |
null |
2023-10-31 |
Defining a New NLP Playground |
Sha Li et.al. |
2310.20633v1 |
null |
2023-10-31 |
ACL Anthology Helper: A Tool to Retrieve and Manage Literature from ACL Anthology |
Chen Tang et.al. |
2310.20467v1 |
null |
2023-10-31 |
The SourceData-NLP dataset: integrating curation into scientific publishing for training large language models |
Jorge Abreu-Vicente et.al. |
2310.20440v1 |
link |
2023-10-31 |
AMERICANO: Argument Generation with Discourse-driven Decomposition and Agent Interaction |
Zhe Hu et.al. |
2310.20352v1 |
null |
2023-10-30 |
Integrating Summarization and Retrieval for Enhanced Personalization via Large Language Models |
Chris Richardson et.al. |
2310.20081v1 |
null |
2023-10-30 |
Partial Tensorized Transformers for Natural Language Processing |
Subhadra Vadlamannati et.al. |
2310.20077v1 |
null |
2023-10-30 |
Evaluation Framework for Understanding Sensitive Attribute Association Bias in Latent Factor Recommendation Algorithms |
Lex Beattie et.al. |
2310.20061v1 |
null |
2023-10-30 |
BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing |
Hieu Tran et.al. |
2310.19975v1 |
null |
2023-10-30 |
Deep Learning-Enabled Text Semantic Communication under Interference: An Empirical Study |
Tilahun M. Getu et.al. |
2310.19974v1 |
null |
2023-10-30 |
BTRec: BERT-Based Trajectory Recommendation for Personalized Tours |
Ngai Lam Ho et.al. |
2310.19886v1 |
link |
2023-10-30 |
Adapter Pruning using Tropical Characterization |
Rishabh Bhardwaj et.al. |
2310.19232v1 |
null |
2023-10-29 |
A Survey on Recent Named Entity Recognition and Relation Classification Methods with Focus on Few-Shot Learning Approaches |
Sakher Alqaaidi et.al. |
2310.19055v1 |
null |
2023-10-29 |
Bipartite Graph Pre-training for Unsupervised Extractive Summarization with Graph Convolutional Auto-Encoders |
Qianren Mao et.al. |
2310.18992v1 |
link |
2023-10-29 |
A Multimodal Ecological Civilization Pattern Recommendation Method Based on Large Language Models and Knowledge Graph |
Zhihang Yu et.al. |
2310.18951v1 |
null |
2023-10-29 |
A foundational neural operator that continuously learns without forgetting |
Tapas Tripura et.al. |
2310.18885v1 |
null |
2023-10-29 |
Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition |
Isaac Slaughter et.al. |
2310.18877v1 |
link |
2023-10-28 |
Translating away Translationese without Parallel Data |
Rricha Jalota et.al. |
2310.18830v1 |
null |
2023-10-28 |
Setting the Trap: Capturing and Defeating Backdoors in Pretrained Language Models through Honeypots |
Ruixiang Tang et.al. |
2310.18633v1 |
null |
2023-10-27 |
Maximizing Equitable Reach and Accessibility of ETDs |
William A. Ingram et.al. |
2310.18427v1 |
null |
2023-10-27 |
On General Language Understanding |
David Schlangen et.al. |
2310.18038v1 |
null |
2023-10-27 |
NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark |
Oscar Sainz et.al. |
2310.18018v1 |
null |
2023-10-27 |
SOUL: Towards Sentiment and Opinion Understanding of Language |
Yue Deng et.al. |
2310.17924v1 |
link |
2023-10-27 |
Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method |
Yukun Zhao et.al. |
2310.17918v1 |
null |
2023-10-27 |
Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey |
Weixu Zhang et.al. |
2310.17894v1 |
null |
2023-10-26 |
BERT-PIN: A BERT-based Framework for Recovering Missing Data Segments in Time-series Load Profiles |
Yi Hu et.al. |
2310.17742v1 |
null |
2023-10-26 |
Is Explanation the Cure? Misinformation Mitigation in the Short Term and Long Term |
Yi-Li Hsu et.al. |
2310.17711v1 |
null |
2023-10-26 |
Sliceformer: Make Multi-head Attention as Simple as Sorting in Discriminative Tasks |
Shen Yuan et.al. |
2310.17683v1 |
link |
2023-10-26 |
torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP |
Yoshitomo Matsubara et.al. |
2310.17644v1 |
link |
2023-10-26 |
A Survey on Transferability of Adversarial Examples across Deep Neural Networks |
Jindong Gu et.al. |
2310.17626v1 |
link |
2023-10-26 |
De-novo Chemical Reaction Generation by Means of Temporarily Convolutional Neural Networks |
Andrei Buin et.al. |
2310.17341v1 |
null |
2023-10-26 |
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance? |
Ahmed Alajrami et.al. |
2310.17271v1 |
null |
2023-10-26 |
miditok: A Python package for MIDI file tokenization |
Nathan Fradet et.al. |
2310.17202v1 |
link |
2023-10-26 |
M2C: Towards Automatic Multimodal Manga Complement |
Hongcheng Guo et.al. |
2310.17130v1 |
link |
2023-10-26 |
A Method for Network Intrusion Detection Using Flow Sequence and BERT Framework |
Loc Gia Nguyen et.al. |
2310.17127v1 |
null |
2023-10-25 |
This Reads Like That: Deep Learning for Interpretable Natural Language Processing |
Claudio Fanconi et.al. |
2310.17010v1 |
link |
2023-10-25 |
Understanding Social Structures from Contemporary Literary Fiction using Character Interaction Graph -- Half Century Chronology of Influential Bengali Writers |
Nafis Irtiza Tripto et.al. |
2310.16968v1 |
null |
2023-10-25 |
Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks |
Aradhana Sinha et.al. |
2310.16955v1 |
null |
2023-10-25 |
From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction |
Nima Shoghi et.al. |
2310.16802v1 |
link |
2023-10-25 |
HANSEN: Human and AI Spoken Text Benchmark for Authorship Analysis |
Nafis Irtiza Tripto et.al. |
2310.16746v1 |
null |
2023-10-25 |
SkyMath: Technical Report |
Liu Yang et.al. |
2310.16713v1 |
null |
2023-10-25 |
SSLCL: An Efficient Model-Agnostic Supervised Contrastive Learning Framework for Emotion Recognition in Conversations |
Tao Shi et.al. |
2310.16676v1 |
link |
2023-10-25 |
Exploring Large Language Models for Code Explanation |
Paheli Bhattacharya et.al. |
2310.16673v1 |
null |
2023-10-25 |
WSDMS: Debunk Fake News via Weakly Supervised Detection of Misinforming Sentences with Contextualized Social Wisdom |
Ruichao Yang et.al. |
2310.16579v1 |
link |
2023-10-25 |
FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning |
Jaemin Shin et.al. |
2310.16538v1 |
null |
2023-10-25 |
OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models |
Mingfeng Xue et.al. |
2310.16517v1 |
link |
2023-10-25 |
A Comprehensive Python Library for Deep Learning-Based Event Detection in Multivariate Time Series Data and Information Retrieval in NLP |
Menouar Azib et.al. |
2310.16485v1 |
link |
2023-10-25 |
Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training |
Max Müller-Eberstein et.al. |
2310.16484v1 |
null |
2023-10-24 |
Instruct and Extract: Instruction Tuning for On-Demand Information Extraction |
Yizhu Jiao et.al. |
2310.16040v1 |
link |
2023-10-24 |
This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models |
Iker García-Ferrero et.al. |
2310.15941v1 |
link |
2023-10-24 |
Ensemble of Task-Specific Language Models for Brain Encoding |
Sanjai Kumaran et.al. |
2310.15720v1 |
link |
2023-10-24 |
CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation |
Minzhi Li et.al. |
2310.15638v1 |
link |
2023-10-24 |
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression |
Jiduan Liu et.al. |
2310.15594v1 |
null |
2023-10-24 |
Natural Language Processing for Drug Discovery Knowledge Graphs: promises and pitfalls |
J. Charles G. Jeynes et.al. |
2310.15572v1 |
null |
2023-10-24 |
Improving Language Models Meaning Understanding and Consistency by Learning Conceptual Roles from Dictionary |
Myeongjun Erik Jang et.al. |
2310.15541v1 |
null |
2023-10-24 |
Continual Event Extraction with Semantic Confusion Rectification |
Zitao Wang et.al. |
2310.15470v1 |
link |
2023-10-23 |
Specialist or Generalist? Instruction Tuning for Specific NLP Tasks |
Chufan Shi et.al. |
2310.15326v1 |
null |
2023-10-23 |
HetGPT: Harnessing the Power of Prompt Tuning in Pre-Trained Heterogeneous Graph Neural Networks |
Yihong Ma et.al. |
2310.15318v1 |
null |
2023-10-23 |
TableQAKit: A Comprehensive and Practical Toolkit for Table-based Question Answering |
Fangyu Lei et.al. |
2310.15075v1 |
null |
2023-10-23 |
Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge |
Te-Lin Wu et.al. |
2310.15066v1 |
link |
2023-10-23 |
From Proprietary to High-Level Trigger-Action Programming Rules: A Natural Language Processing Approach |
Ekene Attoh et.al. |
2310.15024v1 |
null |
2023-10-23 |
Efficient Data Learning for Open Information Extraction with Pre-trained Language Models |
Zhiyuan Fan et.al. |
2310.15021v1 |
null |
2023-10-23 |
We are Who We Cite: Bridges of Influence Between Natural Language Processing and Other Academic Fields |
Jan Philip Wahle et.al. |
2310.14870v1 |
link |
2023-10-23 |
Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing |
Sai Koneru et.al. |
2310.14855v1 |
null |
2023-10-23 |
ULTRA-DP: Unifying Graph Pre-training with Multi-task Graph Dual Prompt |
Mouxiang Chen et.al. |
2310.14845v1 |
link |
2023-10-23 |
Generative Pre-trained Transformer for Vietnamese Community-based COVID-19 Question Answering |
Tam Minh Vo et.al. |
2310.14602v1 |
null |
2023-10-23 |
Learning to Correct Noisy Labels for Fine-Grained Entity Typing via Co-Prediction Prompt Tuning |
Minghao Tang et.al. |
2310.14596v1 |
link |
2023-10-23 |
Exploring the Boundaries of GPT-4 in Radiology |
Qianchu Liu et.al. |
2310.14573v1 |
null |
2023-10-20 |
Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations |
Jihyoung Jang et.al. |
2310.13420v1 |
null |
2023-10-20 |
Democratizing Reasoning Ability: Tailored Learning from Large Language Model |
Zhaoyang Wang et.al. |
2310.13332v1 |
link |
2023-10-20 |
Anomaly Detection of Command Shell Sessions based on DistilBERT: Unsupervised and Supervised Approaches |
Zefang Liu et.al. |
2310.13247v1 |
null |
2023-10-20 |
The GitHub Recent Bugs Dataset for Evaluating LLM-based Debugging Applications |
Jae Yong Lee et.al. |
2310.13229v1 |
link |
2023-10-20 |
The Less the Merrier? Investigating Language Representation in Multilingual Models |
Hellina Hailu Nigatu et.al. |
2310.13228v1 |
null |
2023-10-19 |
A Use Case: Reformulating Query Rewriting as a Statistical Machine Translation Problem |
Abdullah Can Algan et.al. |
2310.13031v1 |
null |
2023-10-19 |
TabuLa: Harnessing Language Models for Tabular Data Synthesis |
Zilong Zhao et.al. |
2310.12746v1 |
link |
2023-10-19 |
Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing |
Yue Guo et.al. |
2310.12664v1 |
null |
2023-10-19 |
Towards Real-World Streaming Speech Translation for Code-Switched Speech |
Belen Alastruey et.al. |
2310.12648v1 |
link |
2023-10-19 |
An Exploration of In-Context Learning for Speech Language Model |
Ming-Hao Hsu et.al. |
2310.12477v1 |
null |
2023-10-19 |
Unmasking Transformers: A Theoretical Approach to Data Recovery via Attention Weights |
Yichuan Deng et.al. |
2310.12462v1 |
null |
2023-10-19 |
Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer |
Qingru Zhang et.al. |
2310.12442v1 |
null |
2023-10-19 |
Metadata for Scientific Experiment Reporting: A Case Study in Metal-Organic Frameworks |
Xintong Zhao et.al. |
2310.12417v1 |
null |
2023-10-19 |
LoMAE: Low-level Vision Masked Autoencoders for Low-dose CT Denoising |
Dayang Wang et.al. |
2310.12405v1 |
null |
2023-10-18 |
SPEED: Speculative Pipelined Execution for Efficient Decoding |
Coleman Hooper et.al. |
2310.12072v1 |
null |
2023-10-19 |
Transformers for scientific data: a pedagogical review for astronomers |
Dimitrios Tanoglidis et.al. |
2310.12069v2 |
null |
2023-10-18 |
Evaluating the Symbol Binding Ability of Large Language Models for Multiple-Choice Questions in Vietnamese General Education |
Duc-Vu Nguyen et.al. |
2310.12059v1 |
null |
2023-10-18 |
Removing Spurious Concepts from Neural Network Representations via Joint Subspace Estimation |
Floris Holstege et.al. |
2310.11991v1 |
null |
2023-10-18 |
Towards Graph Foundation Models: A Survey and Beyond |
Jiawei Liu et.al. |
2310.11829v1 |
null |
2023-10-18 |
Telecom AI Native Systems in the Age of Generative AI -- An Engineering Perspective |
Ricardo Britto et.al. |
2310.11770v1 |
null |
2023-10-18 |
Superiority of Softmax: Unveiling the Performance Edge Over Linear Attention |
Yichuan Deng et.al. |
2310.11685v1 |
null |
2023-10-18 |
Field-testing items using artificial intelligence: Natural language processing with transformers |
Hotaka Maeda et.al. |
2310.11655v1 |
null |
2023-10-17 |
Automatic News Summerization |
Kavach Dheer et.al. |
2310.11520v1 |
null |
2023-10-17 |
Neural Attention: Enhancing QKV Calculation in Self-Attention Mechanism with Neural Networks |
Muhan Zhang et.al. |
2310.11398v1 |
link |
2023-10-17 |
Last One Standing: A Comparative Analysis of Security and Privacy of Soft Prompt Tuning, LoRA, and In-Context Learning |
Rui Wen et.al. |
2310.11397v1 |
null |
2023-10-17 |
DialogueLLM: Context and Emotion Knowledge-Tuned LLaMA Models for Emotion Recognition in Conversations |
Yazhou Zhang et.al. |
2310.11374v1 |
link |
2023-10-17 |
Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations |
Shiyuan Huang et.al. |
2310.11207v1 |
null |
2023-10-17 |
ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing |
Quoc-Nam Nguyen et.al. |
2310.11166v1 |
link |
2023-10-17 |
Unsupervised Pre-Training Using Masked Autoencoders for ECG Analysis |
Guoxin Wang et.al. |
2310.11153v1 |
null |
2023-10-17 |
The Quo Vadis of the Relationship between Language and Large Language Models |
Evelina Leivada et.al. |
2310.11146v1 |
null |
2023-10-17 |
Core Building Blocks: Next Gen Geo Spatial GPT Application |
Ashley Fernandez et.al. |
2310.11029v1 |
null |
2023-10-17 |
Enhancing Deep Neural Network Training Efficiency and Performance through Linear Prediction |
Hejie Ying et.al. |
2310.10958v1 |
null |
2023-10-17 |
Enhanced Transformer Architecture for Natural Language Processing |
Woohyeon Moon et.al. |
2310.10930v1 |
null |
2023-10-16 |
"Mistakes Help Us Grow": Facilitating and Evaluating Growth Mindset Supportive Language in Classrooms |
Kunal Handa et.al. |
2310.10637v1 |
null |
2023-10-16 |
Unifying Image Processing as Visual Prompting Question Answering |
Yihao Liu et.al. |
2310.10513v1 |
null |
2023-10-16 |
Text Summarization Using Large Language Models: A Comparative Study of MPT-7b-instruct, Falcon-7b-instruct, and OpenAI Chat-GPT Models |
Lochan Basyal et.al. |
2310.10449v1 |
link |
2023-10-16 |
Prompt Tuning for Multi-View Graph Contrastive Learning |
Chenghua Gong et.al. |
2310.10362v1 |
null |
2023-10-16 |
NLP for Crypto-Asset Regulation: A Roadmap |
Carolina Camassa et.al. |
2310.10333v1 |
null |
2023-10-16 |
VIBE: Topic-Driven Temporal Adaptation for Twitter Classification |
Yuji Zhang et.al. |
2310.10191v1 |
null |
2023-10-16 |
Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset |
Arthur Amalvy et.al. |
2310.10118v1 |
link |
2023-10-16 |
Verbosity Bias in Preference Labeling by Large Language Models |
Keita Saito et.al. |
2310.10076v1 |
null |
2023-10-16 |
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge |
Tom Bryan et.al. |
2310.10050v1 |
null |
2023-10-16 |
Empirical Study of Zero-Shot NER with ChatGPT |
Tingyu Xie et.al. |
2310.10035v1 |
link |
2023-10-12 |
A Survey on Heterogeneous Transfer Learning |
Runxue Bao et.al. |
2310.08459v1 |
link |
2023-10-12 |
Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction |
Kausik Hira et.al. |
2310.08383v1 |
link |
2023-10-12 |
Learn From Model Beyond Fine-Tuning: A Survey |
Hongling Zheng et.al. |
2310.08184v1 |
link |
2023-10-12 |
Who Wrote it and Why? Prompting Large-Language Models for Authorship Verification |
Chia-Yu Hung et.al. |
2310.08123v1 |
null |
2023-10-12 |
ClimateNLP: Analyzing Public Sentiment Towards Climate Change Using Natural Language Processing |
Ajay Krishnan T. K. et.al. |
2310.08099v1 |
null |
2023-10-11 |
Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention |
Huiyin Xue et.al. |
2310.07911v1 |
null |
2023-10-11 |
Hierarchical Pretraining on Multimodal Electronic Health Records |
Xiaochen Wang et.al. |
2310.07871v1 |
link |
2023-10-11 |
Framework for Question-Answering in Sanskrit through Automated Construction of Knowledge Graphs |
Hrishikesh Terdalkar et.al. |
2310.07848v1 |
null |
2023-10-11 |
Does Synthetic Data Make Large Language Models More Efficient? |
Sia Gholami et.al. |
2310.07830v1 |
null |
2023-10-11 |
Antarlekhaka: A Comprehensive Tool for Multi-task Natural Language Annotation |
Hrishikesh Terdalkar et.al. |
2310.07826v1 |
link |
2023-10-11 |
To Build Our Future, We Must Know Our Past: Contextualizing Paradigm Shifts in Natural Language Processing |
Sireesh Gururaja et.al. |
2310.07715v1 |
null |
2023-10-11 |
Composite Backdoor Attacks Against Large Language Models |
Hai Huang et.al. |
2310.07676v1 |
link |
2023-10-11 |
The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values |
Hannah Rose Kirk et.al. |
2310.07629v1 |
null |
2023-10-11 |
PHYDI: Initializing Parameterized Hypercomplex Neural Networks as Identity Functions |
Matteo Mancanelli et.al. |
2310.07612v1 |
link |
2023-10-11 |
Energy Estimates Across Layers of Computing: From Devices to Large-Scale Applications in Machine Learning for Natural Language Processing, Scientific Computing, and Cryptocurrency Mining |
Sadasivan Shankar et.al. |
2310.07516v1 |
null |
2023-10-11 |
KwaiYiiMath: Technical Report |
Jiayi Fu et.al. |
2310.07488v1 |
null |
2023-10-11 |
uxSense: Supporting User Experience Analysis with Visualization and Computer Vision |
Andrea Batch et.al. |
2310.07300v1 |
link |
2023-10-12 |
An Analysis on Large Language Models in Healthcare: A Case Study of BioBERT |
Shyni Sharaf et.al. |
2310.07282v2 |
null |
2023-10-11 |
BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations |
Qizhi Pei et.al. |
2310.07276v1 |
link |
2023-10-11 |
A Comparative Study of Pre-trained CNNs and GRU-Based Attention for Image Caption Generation |
Rashid Khan et.al. |
2310.07252v1 |
null |
2023-10-10 |
Topic-DPR: Topic-based Prompts for Dense Passage Retrieval |
Qingfa Xiao et.al. |
2310.06626v1 |
null |
2023-10-10 |
FTFT: efficient and robust Fine-Tuning by transFerring Training dynamics |
Yupei Du et.al. |
2310.06588v1 |
link |
2023-10-10 |
Watt For What: Rethinking Deep Learning's Energy-Performance Relationship |
Shreyank N Gowda et.al. |
2310.06522v1 |
null |
2023-10-10 |
Revisit Input Perturbation Problems for LLMs: A Unified Robustness Evaluation Framework for Noisy Slot Filling Task |
Guanting Dong et.al. |
2310.06504v1 |
link |
2023-10-10 |
Evolution of Natural Language Processing Technology: Not Just Language Processing Towards General Purpose AI |
Masahiro Yamamoto et.al. |
2310.06228v1 |
null |
2023-10-09 |
From Text to Knowledge with Graphs: modelling, querying and exploiting textual content |
Genoveva Vargas-Solar et.al. |
2310.06122v1 |
null |
2023-10-09 |
Improving Summarization with Human Edits |
Zonghai Yao et.al. |
2310.05857v1 |
link |
2023-10-10 |
Are Large Language Models Post Hoc Explainers? |
Nicholas Kroeger et.al. |
2310.05797v2 |
link |
2023-10-09 |
Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions |
Lucie-Aimée Kaffee et.al. |
2310.05779v1 |
link |
2023-10-09 |
Larth: Dataset and Machine Translation for Etruscan |
Gianluca Vico et.al. |
2310.05688v1 |
link |
2023-10-09 |
ViTs are Everywhere: A Comprehensive Study Showcasing Vision Transformers in Different Domain |
Md Sohag Mia et.al. |
2310.05664v1 |
null |
2023-10-09 |
Regulation and NLP (RegNLP): Taming Large Language Models |
Catalina Goanta et.al. |
2310.05553v1 |
null |
2023-10-09 |
Generative Judge for Evaluating Alignment |
Junlong Li et.al. |
2310.05470v1 |
link |
2023-10-09 |
Establishing Trustworthiness: Rethinking Tasks and Model Evaluation |
Robert Litschko et.al. |
2310.05442v1 |
null |
2023-10-09 |
Resolving the Imbalance Issue in Hierarchical Disciplinary Topic Inference via LLM-based Data Augmentation |
Xunxin Cai et.al. |
2310.05318v1 |
null |
2023-10-09 |
Enhancing Long-form Text Generation in Mental Health\ with Task-adaptive Tokenization |
Siyang Liu et.al. |
2310.05317v1 |
link |
2023-10-06 |
Multi-Industry Simplex : A Probabilistic Extension of GICS |
Maksim Papenkov et.al. |
2310.04280v1 |
null |
2023-10-06 |
Analysis of the Reasoning with Redundant Information Provided Ability of Large Language Models |
Wenbei Xie et.al. |
2310.04039v1 |
null |
2023-10-06 |
Quantized Transformer Language Model Implementations on Edge Devices |
Mohammad Wali Ur Rahman et.al. |
2310.03971v1 |
null |
2023-10-05 |
Multitask Learning for Time Series Data\with 2D Convolution |
Chin-Chia Michael Yeh et.al. |
2310.03925v1 |
null |
2023-10-05 |
The Anatomy of Deception: Technical and Human Perspectives on a Large-scale Phishing Campaign |
Anargyros Chrysanthou et.al. |
2310.03498v1 |
null |
2023-10-05 |
Procedural Text Mining with Large Language Models |
Anisa Rula et.al. |
2310.03376v1 |
link |
2023-10-05 |
A Formalism and Approach for Improving Robustness of Large Language Models Using Risk-Adjusted Confidence Scores |
Ke Shen et.al. |
2310.03283v1 |
null |
2023-10-05 |
InstructProtein: Aligning Human and Protein Language via Knowledge Instruction |
Zeyuan Wang et.al. |
2310.03269v1 |
null |
2023-10-05 |
Sparse Deep Learning for Time Series Data: Theory and Applications |
Mingxuan Zhang et.al. |
2310.03243v1 |
null |
2023-10-05 |
Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs |
Yijia Xiao et.al. |
2310.03221v1 |
link |
2023-10-04 |
Neural architecture impact on identifying temporally extended Reinforcement Learning tasks |
Victor Vadakechirayath George et.al. |
2310.03161v1 |
null |
2023-10-04 |
Federated Fine-Tuning of LLMs on the Very Edge: The Good, the Bad, the Ugly |
Herbert Woisetschläger et.al. |
2310.03150v1 |
null |
2023-10-04 |
MetaTool Benchmark: Deciding Whether to Use Tools and Which to Use |
Yue Huang et.al. |
2310.03128v1 |
link |
2023-10-04 |
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making |
Jeonghye Kim et.al. |
2310.03022v1 |
null |
2023-10-04 |
DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning |
Jiong Xiong et.al. |
2310.02954v1 |
link |
2023-10-04 |
Low Resource Summarization using Pre-trained Language Models |
Mubashir Munaf et.al. |
2310.02790v1 |
null |
2023-10-04 |
SALSA: Semantically-Aware Latent Space Autoencoder |
Kathryn E. Kirchoff et.al. |
2310.02744v1 |
null |
2023-10-04 |
AGIR: Automating Cyber Threat Intelligence Reporting with Natural Language Generation |
Filippo Perrina et.al. |
2310.02655v1 |
link |
2023-10-03 |
Backdoor Adjustment of Confounding by Provenance for Robust Text Classification of Multi-institutional Clinical Notes |
Xiruo Ding et.al. |
2310.02451v1 |
null |
2023-10-03 |
A method to assess trustworthiness of machine coding at scale |
Rebeckah K. Fussell et.al. |
2310.02335v1 |
null |
2023-10-03 |
MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens |
Kaizhi Zheng et.al. |
2310.02239v1 |
link |
2023-10-03 |
Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View |
Jintian Zhang et.al. |
2310.02124v1 |
link |
2023-10-03 |
dFlow: A Domain Specific Language for the Rapid Development of open-source Virtual Assistants |
Nikolaos Malamas et.al. |
2310.02102v1 |
null |
2023-10-03 |
Jury: A Comprehensive Evaluation Toolkit |
Devrim Cavusoglu et.al. |
2310.02040v1 |
link |
2023-10-03 |
Hierarchical Evaluation Framework: Best Practices for Human Evaluation |
Iva Bojic et.al. |
2310.01917v1 |
null |
2023-10-03 |
Effective and Parameter-Efficient Reusing Fine-Tuned Models |
Weisen Jiang et.al. |
2310.01886v1 |
null |
2023-10-03 |
Time-LLM: Time Series Forecasting by Reprogramming Large Language Models |
Ming Jin et.al. |
2310.01728v1 |
link |
2023-10-02 |
Transformers are efficient hierarchical chemical graph learners |
Zihan Pengmei et.al. |
2310.01704v1 |
link |
2023-10-02 |
Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language Models |
Zijun Wu et.al. |
2310.01691v1 |
link |
2023-10-02 |
A Review of Digital Learning Environments for Teaching Natural Language Processing in K-12 Education |
Xiaoyi Tian et.al. |
2310.01603v1 |
null |
2023-09-29 |
A Large Language Model Approach to Educational Survey Feedback Analysis |
Michael J. Parker et.al. |
2309.17447v1 |
null |
2023-09-29 |
Network Memory Footprint Compression Through Jointly Learnable Codebooks and Mappings |
Edouard Yvinec et.al. |
2309.17361v1 |
null |
2023-09-29 |
Overview of the BioLaySumm 2023 Shared Task on Lay Summarization of Biomedical Research Articles |
Tomsa Goldsack et.al. |
2309.17332v1 |
null |
2023-09-29 |
Benchmarking the Abilities of Large Language Models for RDF Knowledge Graph Creation and Comprehension: How Well Do LLMs Speak Turtle? |
Johannes Frey et.al. |
2309.17122v1 |
link |
2023-09-29 |
Interpretable Long-Form Legal Question Answering with Retrieval-Augmented Large Language Models |
Antoine Louis et.al. |
2309.17050v1 |
link |
2023-09-28 |
DeBERTinha: A Multistep Approach to Adapt DebertaV3 XSmall for Brazilian Portuguese Natural Language Processing Task |
Israel Campiotti et.al. |
2309.16844v1 |
null |
2023-09-28 |
How many words does ChatGPT know? The answer is ChatWords |
Gonzalo Martínez et.al. |
2309.16777v1 |
link |
2023-09-28 |
Neural scaling laws for phenotypic drug discovery |
Drew Linsley et.al. |
2309.16773v1 |
null |
2023-09-28 |
Qwen Technical Report |
Jinze Bai et.al. |
2309.16609v1 |
link |
2023-09-28 |
Augmenting LLMs with Knowledge: A survey on hallucination prevention |
Konstantinos Andriopoulos et.al. |
2309.16459v1 |
null |
2023-09-28 |
A Comprehensive Survey of Document-level Relation Extraction (2016-2022) |
Julien Delaunay et.al. |
2309.16396v1 |
null |
2023-09-27 |
ChatGPT-BCI: Word-Level Neural State Classification Using GPT, EEG, and Eye-Tracking Biomarkers in Semantic Inference Reading Comprehension |
Yuhong Zhang et.al. |
2309.15714v1 |
null |
2023-09-27 |
NLPBench: Evaluating Large Language Models on Solving NLP Problems |
Linxin Song et.al. |
2309.15630v1 |
link |
2023-09-27 |
Tackling VQA with Pretrained Foundation Models without Further Training |
Alvin De Jun Tan et.al. |
2309.15487v1 |
null |
2023-09-27 |
A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future |
Zheng Chu et.al. |
2309.15402v1 |
link |
2023-09-26 |
VPA: Fully Test-Time Visual Prompt Adaptation |
Jiachen Sun et.al. |
2309.15251v1 |
null |
2023-09-26 |
Eve Said Yes: AirBone Authentication for Head-Wearable Smart Voice Assistant |
Chenpei Huang et.al. |
2309.15203v1 |
null |
2023-09-26 |
The Role of Document Embedding in Research Paper Recommender Systems: To Breakdown or to Bolster Disciplinary Borders? |
Eoghan Cunningham et.al. |
2309.14984v1 |
null |
2023-09-27 |
Text-to-Image Generation for Abstract Concepts |
Jiayi Liao et.al. |
2309.14623v2 |
null |
2023-09-26 |
Confidence Intervals for the F1 Score: A Comparison of Four Methods |
Kevin Fu Yuan Lam et.al. |
2309.14621v1 |
null |
2023-09-25 |
When Automated Assessment Meets Automated Content Generation: Examining Text Quality in the Era of GPTs |
Marialena Bevilacqua et.al. |
2309.14488v1 |
link |
2023-09-25 |
Urdu Poetry Generated by Using Deep Learning Techniques |
Muhammad Shoaib Farooq et.al. |
2309.14233v1 |
null |
2023-09-25 |
Comprehensive Overview of Named Entity Recognition: Models, Domain-Specific Applications and Challenges |
Kalyani Pakhale et.al. |
2309.14084v1 |
null |
2023-09-25 |
Graph Representation Learning Towards Patents Network Analysis |
Mohammad Heydari et.al. |
2309.13888v1 |
null |
2023-09-24 |
Text Classification: A Perspective of Deep Learning Methods |
Zhongwei Wan et.al. |
2309.13761v1 |
null |
2023-09-24 |
Arabic Sentiment Analysis with Noisy Deep Explainable Model |
Md. Atabuzzaman et.al. |
2309.13731v1 |
null |
2023-09-24 |
Skill Check: Some Considerations on the Evaluation of Gamemastering Models for Role-playing Games |
Santiago Góngora et.al. |
2309.13702v1 |
link |
2023-09-24 |
Accelerating Large Batch Training via Gradient Signal to Noise Ratio (GSNR) |
Guo-qing Jiang et.al. |
2309.13681v1 |
null |
2023-09-23 |
Spanish Resource Grammar version 2023 |
Olga Zamaraeva et.al. |
2309.13318v1 |
null |
2023-09-23 |
Natural Language Processing for Requirements Formalization: How to Derive New Approaches? |
Viju Sudhi et.al. |
2309.13272v1 |
link |
2023-09-23 |
A Survey of Document-Level Information Extraction |
Hanwen Zheng et.al. |
2309.13249v1 |
null |
2023-09-22 |
Decoding Affect in Dyadic Conversations: Leveraging Semantic Similarity through Sentence Embedding |
Chen-Wei Yu et.al. |
2309.12646v1 |
null |
2023-09-22 |
Construction contract risk identification based on knowledge-augmented language model |
Saika Wong et.al. |
2309.12626v1 |
null |
2023-09-21 |
Understanding the language of molecules: Predicting pure component parameters for the PC-SAFT equation of state from SMILES |
Benedikt Winter et.al. |
2309.12404v1 |
null |
2023-09-21 |
Improving VTE Identification through Adaptive NLP Model Selection and Clinical Expert Rule-based Classifier from Radiology Reports |
Jamie Deng et.al. |
2309.12273v1 |
null |
2023-09-22 |
Rethinking the Evaluating Framework for Natural Language Understanding in AI Systems: Language Acquisition as a Core for Future Metrics |
Patricio Vera et.al. |
2309.11981v2 |
null |
2023-09-21 |
Stock Market Sentiment Classification and Backtesting via Fine-tuned BERT |
Jiashu Lou et.al. |
2309.11979v1 |
null |
2023-09-20 |
Transformers versus LSTMs for electronic trading |
Paul Bilokon et.al. |
2309.11400v1 |
link |
2023-09-20 |
Studying Lobby Influence in the European Parliament |
Aswin Suresh et.al. |
2309.11381v1 |
null |
2023-09-20 |
When to Trust AI: Advances and Challenges for Certification of Neural Networks |
Marta Kwiatkowska et.al. |
2309.11196v1 |
null |
2023-09-20 |
Prototype of a robotic system to assist the learning process of English language with text-generation through DNN |
Carlos Morales-Torres et.al. |
2309.11142v1 |
null |
2023-09-20 |
Language-Oriented Communication with Semantic Coding and Knowledge Distillation for Text-to-Image Generation |
Hyelin Nam et.al. |
2309.11127v1 |
null |
2023-09-20 |
AttentionMix: Data augmentation method that relies on BERT attention mechanism |
Dominik Lewy et.al. |
2309.11104v1 |
null |
2023-09-21 |
fakenewsbr: A Fake News Detection Platform for Brazilian Portuguese |
Luiz Giordani et.al. |
2309.11052v2 |
null |
2023-09-20 |
Making Small Language Models Better Multi-task Learners with Mixture-of-Task-Adapters |
Yukang Xie et.al. |
2309.11042v1 |
null |
2023-09-19 |
LMDX: Language Model-based Document Information Extraction and Localization |
Vincent Perot et.al. |
2309.10952v1 |
null |
2023-09-19 |
Artificial Intelligence-Enabled Intelligent Assistant for Personalized and Adaptive Learning in Higher Education |
Ramteja Sajja et.al. |
2309.10892v1 |
null |
2023-09-19 |
FRASIMED: a Clinical French Annotated Resource Produced through Crosslingual BERT-Based Annotation Projection |
Jamil Zaghir et.al. |
2309.10770v1 |
null |
2023-09-19 |
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch |
Juntao Li et.al. |
2309.10706v1 |
link |
2023-09-19 |
NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages |
Samuel Cahyawijaya et.al. |
2309.10661v1 |
link |
2023-09-19 |
CFGPT: Chinese Financial Assistant with Large Language Model |
Jiangtong Li et.al. |
2309.10654v1 |
link |
2023-09-19 |
FRACAS: A FRench Annotated Corpus of Attribution relations in newS |
Ange Richard et.al. |
2309.10604v1 |
null |
2023-09-19 |
Mixed-Distil-BERT: Code-mixed Language Modeling for Bangla, English, and Hindi |
Md Nishat Raihan et.al. |
2309.10272v1 |
null |
2023-09-18 |
Stabilizing RLHF through Advantage Model and Selective Rehearsal |
Baolin Peng et.al. |
2309.10202v1 |
null |
2023-09-18 |
Automated Interviewer or Augmented Survey? Collecting Social Data with Large Language Models |
Alejandro Cuevas Villalba et.al. |
2309.10187v1 |
link |
2023-09-19 |
Watch the Speakers: A Hybrid Continuous Attribution Network for Emotion Recognition in Conversation With Emotion Disentanglement |
Shanglin Lei et.al. |
2309.09799v2 |
null |
2023-09-18 |
FedLALR: Client-Specific Adaptive Learning Rates Achieve Linear Speedup for Non-IID Data |
Hao Sun et.al. |
2309.09719v1 |
null |
2023-09-18 |
Do learned speech symbols follow Zipf's law? |
Shinnosuke Takamichi et.al. |
2309.09690v1 |
null |
2023-09-18 |
FactoFormer: Factorized Hyperspectral Transformers with Self-Supervised Pre-Training |
Shaheer Mohamed et.al. |
2309.09431v1 |
link |
2023-09-17 |
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages |
Thuat Nguyen et.al. |
2309.09400v1 |
null |
2023-09-17 |
OWL: A Large Language Model for IT Operations |
Hongcheng Guo et.al. |
2309.09298v1 |
null |
2023-09-16 |
Constructing a Knowledge Graph for Vietnamese Legal Cases with Heterogeneous Graphs |
Thi-Hai-Yen Vuong et.al. |
2309.09069v1 |
null |
2023-09-16 |
Context-aware Adversarial Attack on Named Entity Recognition |
Shuguang Chen et.al. |
2309.08999v1 |
null |
2023-09-16 |
RMP: A Random Mask Pretrain Framework for Motion Prediction |
Yi Yang et.al. |
2309.08989v1 |
link |
2023-09-16 |
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT) |
Parsa Kavehzadeh et.al. |
2309.08968v1 |
null |
2023-09-15 |
VulnSense: Efficient Vulnerability Detection in Ethereum Smart Contracts by Multimodal Learning with Graph Neural Network and Language Model |
Phan The Duy et.al. |
2309.08474v1 |
null |
2023-09-15 |
Understanding the limitations of self-supervised learning for tabular anomaly detection |
Kimberly T. Mai et.al. |
2309.08374v1 |
null |
2023-09-15 |
Exploring the Potential of ChatGPT in Automated Code Refinement: An Empirical Study |
Qi Guo et.al. |
2309.08221v1 |
null |
2023-09-14 |
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing |
Sonish Sivarajkumar et.al. |
2309.08008v1 |
null |
2023-09-14 |
A Multi-In and Multi-Out Dendritic Neuron Model and its Optimization |
Yu Ding et.al. |
2309.07791v1 |
null |
2023-09-14 |
Complexity Scaling for Speech Denoising |
Hangting Chen et.al. |
2309.07757v1 |
null |
2023-09-14 |
Generative AI Text Classification using Ensemble LLM Approaches |
Harika Abburi et.al. |
2309.07755v1 |
null |
2023-09-14 |
NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation |
Jiaqi Zhang et.al. |
2309.07705v1 |
link |
2023-09-14 |
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation? |
Rishav Hada et.al. |
2309.07462v1 |
null |
2023-09-14 |
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects |
David Ifeoluwa Adelani et.al. |
2309.07445v1 |
link |
2023-09-14 |
Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts |
Dave Van Veen et.al. |
2309.07430v1 |
link |
2023-09-14 |
Multi-Grade Deep Learning for Partial Differential Equations with Applications to the Burgers Equation |
Yuesheng Xu et.al. |
2309.07401v1 |
null |
2023-09-14 |
Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS |
Yifan Yang et.al. |
2309.07377v1 |
link |
2023-09-13 |
Traveling Words: A Geometric Interpretation of Transformers |
Raul Molina et.al. |
2309.07315v1 |
link |
2023-09-13 |
Beyond original Research Articles Categorization via NLP |
Rosanna Turrisi et.al. |
2309.07020v1 |
link |
2023-09-13 |
Comparative Analysis of Contextual Relation Extraction based on Deep Learning Models |
R. Priyadharshini et.al. |
2309.06814v1 |
null |
2023-09-13 |
Electricity Demand Forecasting through Natural Language Processing with Long Short-Term Memory Networks |
Yun Bai et.al. |
2309.06793v1 |
null |
2023-09-13 |
Bias Amplification Enhances Minority Group Performance |
Gaotang Li et.al. |
2309.06717v1 |
link |
2023-09-13 |
Simultaneous Machine Translation with Large Language Models |
Minghan Wang et.al. |
2309.06706v1 |
null |
2023-09-12 |
Narrowing the Gap between Supervised and Unsupervised Sentence Representation Learning with Large Language Model |
Mingxin Li et.al. |
2309.06453v1 |
link |
2023-09-12 |
Grounded Language Acquisition From Object and Action Imagery |
James Robert Kubricht et.al. |
2309.06335v1 |
null |
2023-09-12 |
Improving and Evaluating the Detection of Fragmentation in News Recommendations with the Clustering of News Story Chains |
Alessandra Polimeno et.al. |
2309.06192v1 |
null |
2023-09-13 |
Backdoor Attacks and Countermeasures in Natural Language Processing Models: A Comprehensive Security Review |
Pengzhou Cheng et.al. |
2309.06055v2 |
null |
2023-09-11 |
Black-Box Analysis: GPTs Across Time in Legal Textual Entailment Task |
Ha-Thanh Nguyen et.al. |
2309.05501v1 |
null |
2023-09-11 |
NeCo@ALQAC 2023: Legal Domain Knowledge Acquisition for Low-Resource Languages through Data Enrichment |
Hai-Long Nguyen et.al. |
2309.05500v1 |
null |
2023-09-11 |
LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech |
Titouan Parcollet et.al. |
2309.05472v1 |
null |
2023-09-11 |
Improving Information Extraction on Business Documents with Specific Pre-Training Tasks |
Thibault Douzon et.al. |
2309.05429v1 |
link |
2023-09-11 |
Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach |
Tae Jin Park et.al. |
2309.05248v1 |
null |
2023-09-11 |
DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning |
Zhengxiang Shi et.al. |
2309.05173v1 |
link |
2023-09-10 |
What's Hard in English RST Parsing? Predictive Models for Error Analysis |
Yang Janet Liu et.al. |
2309.04940v1 |
link |
2023-09-10 |
Unsupervised Chunking with Hierarchical RNN |
Zijun Wu et.al. |
2309.04919v1 |
link |
2023-09-09 |
Distributional Data Augmentation Methods for Low Resource Language |
Mosleh Mahamud et.al. |
2309.04862v1 |
link |
2023-09-09 |
Leveraging Large Language Models for Exploiting ASR Uncertainty |
Pranay Dighe et.al. |
2309.04842v1 |
null |
2023-09-08 |
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning |
David Yunis et.al. |
2309.04459v1 |
null |
2023-09-08 |
Active Learning for Classifying 2D Grid-Based Level Completability |
Mahsa Bazzaz et.al. |
2309.04367v1 |
link |
2023-09-08 |
Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts |
Erik Daxberger et.al. |
2309.04354v1 |
null |
2023-09-08 |
Fuzzy Fingerprinting Transformer Language-Models for Emotion Recognition in Conversations |
Patrícia Pereira et.al. |
2309.04292v1 |
null |
2023-09-08 |
LLMCad: Fast and Scalable On-device Large Language Model Inference |
Daliang Xu et.al. |
2309.04255v1 |
null |
2023-09-08 |
Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese |
Haochun Wang et.al. |
2309.04175v1 |
null |
2023-09-07 |
Conformal Autoregressive Generation: Beam Search with Coverage Guarantees |
Nicolas Deutschmann et.al. |
2309.03797v1 |
null |
2023-09-07 |
USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset |
Chengguang Gan et.al. |
2309.03787v1 |
link |
2023-09-07 |
Machine Learning for Tangible Effects: Natural Language Processing for Uncovering the Illicit Massage Industry & Computer Vision for Tactile Sensing |
Rui Ouyang et.al. |
2309.03470v1 |
null |
2023-09-06 |
J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News |
Tharindu Kumarage et.al. |
2309.03164v1 |
link |
2023-09-06 |
Leave no Place Behind: Improved Geolocation in Humanitarian Documents |
Enrico M. Belliardo et.al. |
2309.02914v1 |
null |
2023-09-06 |
ViCGCN: Graph Convolutional Network with Contextualized Language Models for Social Media Mining in Vietnamese |
Chau-Thang Phan et.al. |
2309.02902v1 |
link |
2023-09-07 |
Aligning Large Language Models for Clinical Tasks |
Supun Manathunga et.al. |
2309.02884v2 |
link |
2023-09-05 |
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges |
Maryam Zare et.al. |
2309.02473v1 |
null |
2023-09-05 |
Sample Size in Natural Language Processing within Healthcare Research |
Jaya Chaturvedi et.al. |
2309.02237v1 |
null |
2023-09-05 |
Incorporating Dictionaries into a Neural Network Architecture to Extract COVID-19 Medical Concepts From Social Media |
Abul Hasan et.al. |
2309.02188v1 |
null |
2023-09-05 |
Bridging Emotion Role Labeling and Appraisal-based Emotion Analysis |
Roman Klinger et.al. |
2309.02092v1 |
null |
2023-09-05 |
Enhance Multi-domain Sentiment Analysis of Review Texts through Prompting Strategies |
Yajing Wang et.al. |
2309.02045v1 |
null |
2023-09-05 |
Bilevel Scheduled Sampling for Dialogue Generation |
Jiawen Liu et.al. |
2309.01953v1 |
null |
2023-09-04 |
Into the Single Cell Multiverse: an End-to-End Dataset for Procedural Knowledge Extraction in Biomedical Texts |
Ruth Dannenfelser et.al. |
2309.01812v1 |
link |
2023-09-04 |
Prompting or Fine-tuning? A Comparative Study of Large Language Models for Taxonomy Construction |
Boqi Chen et.al. |
2309.01715v1 |
link |
2023-09-04 |
ChatRule: Mining Logical Rules with Large Language Models for Knowledge Graph Reasoning |
Linhao Luo et.al. |
2309.01538v1 |
link |
2023-09-03 |
A Visual Interpretation-Based Self-Improved Classification System Using Virtual Adversarial Training |
Shuai Jiang et.al. |
2309.01196v1 |
null |
2023-09-03 |
Large Language Models for Generative Recommendation: A Survey and Visionary Discussions |
Lei Li et.al. |
2309.01157v1 |
null |
2023-09-01 |
When Do Discourse Markers Affect Computational Sentence Understanding? |
Ruiqi Li et.al. |
2309.00368v1 |
null |
2023-09-01 |
Comparative Topic Modeling for Determinants of Divergent Report Results Applied to Macular Degeneration Studies |
Lucas Cassiel Jacaruso et.al. |
2309.00312v1 |
null |
2023-09-01 |
FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Automated Fact-Checking |
Tsun-Hin Cheung et.al. |
2309.00240v1 |
null |
2023-09-01 |
ALJP: An Arabic Legal Judgment Prediction in Personal Status Cases Using Machine Learning Models |
Salwa Abbara et.al. |
2309.00238v1 |
null |
2023-08-31 |
Predicting Financial Market Trends using Time Series Analysis and Natural Language Processing |
Ali Asgarov et.al. |
2309.00136v1 |
null |
2023-08-31 |
PointLLM: Empowering Large Language Models to Understand Point Clouds |
Runsen Xu et.al. |
2308.16911v1 |
link |
2023-08-31 |
Using Large Language Models to Automate Category and Trend Analysis of Scientific Articles: An Application in Ophthalmology |
Hina Raja et.al. |
2308.16688v1 |
null |
2023-08-31 |
High Accuracy Location Information Extraction from Social Network Texts Using Natural Language Processing |
Lossan Bonde et.al. |
2308.16615v1 |
null |
2023-08-31 |
Link Prediction for Wikipedia Articles as a Natural Language Inference Task |
Chau-Thang Phan et.al. |
2308.16469v1 |
link |
2023-08-30 |
Debunking Disinformation: Revolutionizing Truth with NLP in Fake News Detection |
Li He et.al. |
2308.16328v1 |
null |
2023-08-30 |
Materials Informatics Transformer: A Language Model for Interpretable Materials Properties Prediction |
Hongshuo Huang et.al. |
2308.16259v1 |
link |
2023-08-30 |
Automatic assessment of text-based responses in post-secondary education: A systematic review |
Rujun Gao et.al. |
2308.16151v1 |
null |
2023-08-30 |
Conti Inc.: Understanding the Internal Discussions of a large Ransomware-as-a-Service Operator with Machine Learning |
Estelle Ruellan et.al. |
2308.16061v1 |
null |
2023-08-30 |
DTrOCR: Decoder-only Transformer for Optical Character Recognition |
Masato Fujitake et.al. |
2308.15996v1 |
null |
2023-08-30 |
AI-powered Fraud Detection in Decentralized Finance: A Project Life Cycle Perspective |
Bingqiao Luo et.al. |
2308.15992v1 |
null |
2023-08-30 |
WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model |
Tianyu Wang et.al. |
2308.15962v1 |
null |
2023-08-30 |
Benchmarking Multilabel Topic Classification in the Kyrgyz Language |
Anton Alekseev et.al. |
2308.15952v1 |
link |
2023-08-30 |
The Janus System: Multi-paradigm Programming in Prolog and Python |
Theresa Swift et.al. |
2308.15893v1 |
null |
2023-08-30 |
HAlf-MAsked Model for Named Entity Sentiment analysis |
Anton Kabaev et.al. |
2308.15793v1 |
null |
2023-08-29 |
Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis |
Sotirios Kastanas et.al. |
2308.15517v1 |
link |
2023-08-29 |
Vulgar Remarks Detection in Chittagonian Dialect of Bangla |
Tanjim Mahmud et.al. |
2308.15448v1 |
null |
2023-08-29 |
Historical patterns of rice farming explain modern-day language use in China and Japan more than modernization and urbanization |
Sharath Chandra Guntuku et.al. |
2308.15352v1 |
null |
2023-08-29 |
A Framework for Responsible Development of Automated Student Feedback with Generative AI |
Euan D Lindsay et.al. |
2308.15334v1 |
null |
2023-08-29 |
CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs |
Hiroyuki Ootomo et.al. |
2308.15136v1 |
link |
2023-08-29 |
Large Language Models on the Chessboard: A Study on ChatGPT's Formal Language Comprehension and Complex Reasoning Skills |
Mu-Tien Kuo et.al. |
2308.15118v1 |
null |
2023-08-29 |
Serving MoE Models on Resource-constrained Edge Devices via Dynamic Expert Swapping |
Rui Kong et.al. |
2308.15030v1 |
null |
2023-08-29 |
TransPrompt v2: A Transferable Prompting Framework for Cross-task Text Classification |
Jianing Wang et.al. |
2308.15010v1 |
null |
2023-08-29 |
CEFHRI: A Communication Efficient Federated Learning Framework for Recognizing Industrial Human-Robot Interaction |
Umar Khalid et.al. |
2308.14965v1 |
link |
2023-08-28 |
Diversified Ensemble of Independent Sub-Networks for Robust Self-Supervised Representation Learning |
Amirhossein Vahidi et.al. |
2308.14705v1 |
null |
2023-08-28 |
ANER: Arabic and Arabizi Named Entity Recognition using Transformer-Based Approach |
Abdelrahman "Boda" Sadallah et.al. |
2308.14669v1 |
null |
2023-08-28 |
Large Graph Models: A Perspective |
Ziwei Zhang et.al. |
2308.14522v1 |
link |
2023-08-28 |
Biomedical Entity Linking with Triple-aware Pre-Training |
Xi Yan et.al. |
2308.14429v1 |
null |
2023-08-28 |
Rethinking Mobile AI Ecosystem in the LLM Era |
Jinliang Yuan et.al. |
2308.14363v1 |
link |
2023-08-28 |
Can Transformer and GNN Help Each Other? |
Peiyan Zhang et.al. |
2308.14355v1 |
null |
2023-08-28 |
FonMTL: Towards Multitask Learning for the Fon Language |
Bonaventure F. P. Dossou et.al. |
2308.14280v1 |
link |
2023-08-28 |
Goodhart's Law Applies to NLP's Explanation Benchmarks |
Jennifer Hsia et.al. |
2308.14272v1 |
null |
2023-08-27 |
Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models |
Kaiyuan Gao et.al. |
2308.14149v1 |
link |
2023-08-27 |
Detecting Language Model Attacks with Perplexity |
Gabriel Alon et.al. |
2308.14132v1 |
null |
2023-08-25 |
ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent Detection |
Yihao Fang et.al. |
2308.13517v1 |
link |
2023-08-25 |
Ngambay-French Neural Machine Translation (sba-Fr) |
Sakayo Toadoum Sari et.al. |
2308.13497v1 |
link |
2023-08-25 |
Leveraging Knowledge and Reinforcement Learning for Enhanced Reliability of Language Models |
Nancy Tyagi et.al. |
2308.13467v1 |
null |
2023-08-25 |
ARTIST: ARTificial Intelligence for Simplified Text |
Lorenzo Corti et.al. |
2308.13458v1 |
link |
2023-08-25 |
QKSAN: A Quantum Kernel Self-Attention Network |
Ren-Xin Zhao et.al. |
2308.13422v1 |
null |
2023-08-25 |
In-context learning for model-free system identification |
Marco Forgione et.al. |
2308.13380v1 |
link |
2023-08-25 |
Construction Grammar and Language Models |
Harish Tayyar Madabushi et.al. |
2308.13315v1 |
null |
2023-08-25 |
LLM2KB: Constructing Knowledge Bases using instruction tuned context aware Large Language Models |
Anmol Nayak et.al. |
2308.13207v1 |
link |
2023-08-25 |
Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers |
Jiawen Xie et.al. |
2308.13191v1 |
null |
2023-08-25 |
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models |
Wenqi Shao et.al. |
2308.13137v1 |
link |
2023-08-24 |
Text Similarity from Image Contents using Statistical and Semantic Analysis Techniques |
Sagar Kulkarni et.al. |
2308.12842v1 |
null |
2023-08-24 |
Sparks of Large Audio Models: A Survey and Outlook |
Siddique Latif et.al. |
2308.12792v1 |
link |
2023-08-24 |
Pre-training Code Representation with Semantic Flow Graph for Effective Bug Localization |
Yali Du et.al. |
2308.12773v1 |
link |
2023-08-23 |
Simple is Better and Large is Not Enough: Towards Ensembling of Foundational Language Models |
Nancy Tyagi et.al. |
2308.12272v1 |
null |
2023-08-23 |
Curriculum Learning with Adam: The Devil Is in the Wrong Details |
Lucas Weber et.al. |
2308.12202v1 |
null |
2023-08-23 |
Out of the Cage: How Stochastic Parrots Win in Cyber Security Environments |
Maria Rigaki et.al. |
2308.12086v1 |
link |
2023-08-23 |
Bridging the Gap: Deciphering Tabular Data Using Large Language Model |
Hengyuan Zhang et.al. |
2308.11891v1 |
null |
2023-08-22 |
Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model |
Yuezhou Zhang et.al. |
2308.11773v1 |
null |
2023-08-24 |
Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models |
Mohamed Elaraby et.al. |
2308.11764v2 |
link |
2023-08-22 |
Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices |
Elizaveta Kostenok et.al. |
2308.11295v1 |
null |
2023-08-22 |
The Software Heritage License Dataset (2022 Edition) |
Jesús M. González-Barahona et.al. |
2308.11258v1 |
null |
2023-08-22 |
ConcatPlexer: Additional Dim1 Batching for Faster ViTs |
Donghoon Han et.al. |
2308.11199v1 |
null |
2023-08-22 |
ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation |
Jianghao Lin et.al. |
2308.11131v1 |
link |
2023-08-21 |
Unlocking Hardware Security Assurance: The Potential of LLMs |
Xingyu Meng et.al. |
2308.11042v1 |
null |
2023-08-21 |
Practical Parallel Algorithms for Non-Monotone Submodular Maximization |
Shuang Cui et.al. |
2308.10656v1 |
null |
2023-08-21 |
Exploring Equation as a Better Intermediate Meaning Representation for Numerical Reasoning |
Dingzirui Wang et.al. |
2308.10585v1 |
link |
2023-08-22 |
An Effective Method using Phrase Mechanism in Neural Machine Translation |
Phuong Minh Nguyen et.al. |
2308.10482v2 |
link |
2023-08-21 |
Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP Concepts |
Fan Gao et.al. |
2308.10410v1 |
link |
2023-08-20 |
How Good Are Large Language Models at Out-of-Distribution Detection? |
Bo Liu et.al. |
2308.10261v1 |
link |
2023-08-20 |
ChatEDA: A Large Language Model Powered Autonomous Agent for EDA |
Zhuolun He et.al. |
2308.10204v1 |
null |
2023-08-19 |
Deep Generative Modeling-based Data Augmentation with Demonstration using the BFBT Benchmark Void Fraction Datasets |
Farah Alsafadi et.al. |
2308.10120v1 |
null |
2023-08-19 |
FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models |
Liwen Zhang et.al. |
2308.09975v1 |
link |
2023-08-19 |
A Transformer-based Framework For Multi-variate Time Series: A Remaining Useful Life Prediction Use Case |
Oluwaseyi Ogunfowora et.al. |
2308.09884v1 |
null |
2023-08-19 |
Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders |
Jie Cheng et.al. |
2308.09882v1 |
link |
2023-08-18 |
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct |
Haipeng Luo et.al. |
2308.09583v1 |
link |
2023-08-18 |
Learnt Contrastive Concept Embeddings for Sign Recognition |
Ryan Wong et.al. |
2308.09515v1 |
null |
2023-08-18 |
Exploring Sampling Techniques for Generating Melodies with a Transformer Language Model |
Mathias Rose Bjare et.al. |
2308.09454v1 |
null |
2023-08-18 |
Differentiable Retrieval Augmentation via Generative Language Modeling for E-commerce Query Intent Classification |
Chenyu Zhao et.al. |
2308.09308v1 |
null |
2023-08-17 |
Characterizing Information Seeking Events in Health-Related Social Discourse |
Omar Sharif et.al. |
2308.09156v1 |
null |
2023-08-17 |
Enhancing API Documentation through BERTopic Modeling and Summarization |
AmirHossein Naghshzan et.al. |
2308.09070v1 |
link |
2023-08-17 |
Don't lose the message while paraphrasing: A study on content preserving style transfer |
Nikolay Babakov et.al. |
2308.09055v1 |
link |
2023-08-17 |
CodeCoT and Beyond: Learning to Program and Test like a Developer |
Dong Huang et.al. |
2308.08784v1 |
null |
2023-08-17 |
Real-Time Construction Algorithm of Co-Occurrence Network Based on Inverted Index |
Jiahao Cheng et.al. |
2308.08756v1 |
null |
2023-08-17 |
Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase Extraction |
Yuanzhen Luo et.al. |
2308.08739v1 |
null |
2023-08-16 |
Can Transformers Learn Optimal Filtering for Unknown Systems? |
Haldun Balim et.al. |
2308.08536v1 |
link |
2023-08-16 |
LLM4TS: Two-Stage Fine-Tuning for Time-Series Forecasting with Pre-Trained LLMs |
Ching Chang et.al. |
2308.08469v1 |
null |
2023-08-16 |
Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey |
Lovre Torbarina et.al. |
2308.08234v1 |
null |
2023-08-16 |
Fast Training of NMT Model with Data Sorting |
Daniela N. Rim et.al. |
2308.08153v1 |
null |
2023-08-15 |
Using Artificial Populations to Study Psychological Phenomena in Neural Models |
Jesse Roberts et.al. |
2308.08032v1 |
link |
2023-08-15 |
Through the Lens of Core Competency: Survey on Evaluation of Large Language Models |
Ziyu Zhuang et.al. |
2308.07902v1 |
null |
2023-08-15 |
Emotion Embeddings $\unicode{x2014}$ Learning Stable and Homogeneous Abstractions from Heterogeneous Affective Datasets |
Sven Buechel et.al. |
2308.07871v1 |
null |
2023-08-15 |
Attention Is Not All You Need Anymore |
Zhe Chen et.al. |
2308.07661v1 |
null |
2023-08-15 |
A Survey on Model Compression for Large Language Models |
Xunyu Zhu et.al. |
2308.07633v1 |
null |
2023-08-15 |
A User-Centered Evaluation of Spanish Text Simplification |
Adrian de Wynter et.al. |
2308.07556v1 |
link |
2023-08-14 |
Cross-Attribute Matrix Factorization Model with Shared User Embedding |
Wen Liang et.al. |
2308.07284v1 |
null |
2023-08-14 |
Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt Optimization for Few-shot Learning |
Chengzhengxu Li et.al. |
2308.07272v1 |
link |
2023-08-14 |
Human-centered NLP Fact-checking: Co-Designing with Fact-checkers using Matchmaking for AI |
Houjiang Liu et.al. |
2308.07213v1 |
null |
2023-08-14 |
Natural Language is All a Graph Needs |
Ruosong Ye et.al. |
2308.07134v1 |
link |
2023-08-15 |
Large Language Models for Information Retrieval: A Survey |
Yutao Zhu et.al. |
2308.07107v2 |
link |
2023-08-14 |
EcomGPT: Instruction-tuning Large Language Model with Chain-of-Task Tasks for E-commerce |
Yangning Li et.al. |
2308.06966v1 |
link |
2023-08-14 |
GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text |
Pengfei Liu et.al. |
2308.06911v1 |
link |
2023-08-13 |
An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM |
Sanad Aburass et.al. |
2308.06828v1 |
null |
2023-08-13 |
AerialVLN: Vision-and-Language Navigation for UAVs |
Shubo Liu et.al. |
2308.06735v1 |
link |
2023-08-12 |
Copilot Security: A User Study |
Owura Asare et.al. |
2308.06587v1 |
link |
2023-08-11 |
KETM:A Knowledge-Enhanced Text Matching method |
Kexin Jiang et.al. |
2308.06235v1 |
link |
2023-08-11 |
Large Language Models for Telecom: Forthcoming Impact on the Industry |
Ali Maatouk et.al. |
2308.06013v1 |
null |
2023-08-10 |
LASIGE and UNICAGE solution to the NASA LitCoin NLP Competition |
Pedro Ruas et.al. |
2308.05609v1 |
null |
2023-08-10 |
Bringing order into the realm of Transformer-based language models for artificial intelligence and law |
Candida M. Greco et.al. |
2308.05502v1 |
null |
2023-08-11 |
Exploring Machine Learning and Transformer-based Approaches for Deceptive Text Classification: A Comparative Analysis |
Anusuya Krishnan et.al. |
2308.05476v2 |
null |
2023-08-10 |
From CNN to Transformer: A Review of Medical Image Segmentation Models |
Wenjian Yao et.al. |
2308.05305v1 |
null |
2023-08-09 |
A Novel Method for improving accuracy in neural network by reinstating traditional back propagation technique |
Gokulprasath R et.al. |
2308.05059v1 |
null |
2023-08-09 |
Performance Analysis of Transformer Based Models (BERT, ALBERT and RoBERTa) in Fake News Detection |
Shafna Fitria Nur Azizah et.al. |
2308.04950v1 |
link |
2023-08-09 |
An Empirical Study on Using Large Language Models to Analyze Software Supply Chain Security Failures |
Tanmay Singla et.al. |
2308.04898v1 |
null |
2023-08-09 |
No Need to Lift a Finger Anymore? Assessing the Quality of Code Generation by ChatGPT |
Zhijie Liu et.al. |
2308.04838v1 |
null |
2023-08-09 |
TSSR: A Truncated and Signed Square Root Activation Function for Neural Networks |
Yuanhao Gong et.al. |
2308.04832v1 |
null |
2023-08-09 |
Optimizing a Transformer-based network for a deep learning seismic processing workflow |
Randy Harsuko et.al. |
2308.04739v1 |
null |
2023-08-09 |
A Comparative Study of Open-Source Large Language Models, GPT-4 and Claude 2: Multiple-Choice Test Taking in Nephrology |
Sean Wu et.al. |
2308.04709v1 |
null |
2023-08-09 |
Cross-Lingual Constituency Parsing for Middle High German: A Delexicalized Approach |
Ercong Nie et.al. |
2308.04645v1 |
null |
2023-08-08 |
Unmasking Nationality Bias: A Study of Human Perception of Nationalities in AI-Generated Articles |
Pranav Narayanan Venkit et.al. |
2308.04346v1 |
null |
2023-08-08 |
Deep Learning-Based Knowledge Injection for Metaphor Detection: A Comprehensive Review |
Cheng Yang et.al. |
2308.04306v1 |
null |
2023-08-08 |
CLASSLA-Stanza: The Next Step for Linguistic Processing of South Slavic Languages |
Luka Terčon et.al. |
2308.04255v1 |
link |
2023-08-08 |
Assistive Chatbots for healthcare: a succinct review |
Basabdatta Sen Bhattacharya et.al. |
2308.04178v1 |
null |
2023-08-08 |
I-WAS: a Data Augmentation Method with GPT-2 for Simile Detection |
Yongzhu Chang et.al. |
2308.04109v1 |
null |
2023-08-08 |
Portrayal: Leveraging NLP and Visualization for Analyzing Fictional Characters |
Md Naimul Hoque et.al. |
2308.04056v1 |
null |
2023-08-08 |
A Comparative Study on TF-IDF feature Weighting Method and its Analysis using Unstructured Dataset |
Mamata Das et.al. |
2308.04037v1 |
null |
2023-08-08 |
AI Chatbots as Multi-Role Pedagogical Agents: Transforming Engagement in CS Education |
Cassie Chen Cao et.al. |
2308.03992v1 |
null |
2023-08-07 |
Extracting detailed oncologic history and treatment plan from medical oncology notes with large language models |
Madhumita Sushil et.al. |
2308.03853v1 |
link |
2023-08-07 |
"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models |
Xinyue Shen et.al. |
2308.03825v1 |
link |
2023-08-07 |
RCMHA: Relative Convolutional Multi-Head Attention for Natural Language Modelling |
Herman Sugiharto et.al. |
2308.03429v1 |
link |
2023-08-07 |
TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents |
Jingqing Ruan et.al. |
2308.03427v1 |
null |
2023-08-07 |
Symmetry-Preserving Program Representations for Learning Code Semantics |
Kexin Pei et.al. |
2308.03312v1 |
null |
2023-08-07 |
From Ambiguity to Explicitness: NLP-Assisted 5G Specification Abstraction for Formal Analysis |
Shiyu Yuan et.al. |
2308.03277v1 |
null |
2023-08-07 |
Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion Mining |
Nour Eddine Zekaoui et.al. |
2308.03235v1 |
link |
2023-08-06 |
Average-Hard Attention Transformers are Constant-Depth Uniform Threshold Circuits |
Lena Strobl et.al. |
2308.03212v1 |
null |
2023-08-04 |
Meta-Tsallis-Entropy Minimization: A New Self-Training Approach for Domain Adaptation on Text Classification |
Menglong Lu et.al. |
2308.02746v1 |
null |
2023-08-04 |
Universal Approximation of Linear Time-Invariant (LTI) Systems through RNNs: Power of Randomness in Reservoir Computing |
Shashank Jere et.al. |
2308.02464v1 |
null |
2023-08-04 |
Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from Text |
Nandana Mihindukulasooriya et.al. |
2308.02357v1 |
link |
2023-08-04 |
Sinhala-English Parallel Word Dictionary Dataset |
Kasun Wickramasinghe et.al. |
2308.02234v1 |
link |
2023-08-04 |
Explaining Relation Classification Models with Semantic Extents |
Lars Klöser et.al. |
2308.02193v1 |
link |
2023-08-04 |
From Fake to Hyperpartisan News Detection Using Domain Adaptation |
Răzvan-Alexandru Smădu et.al. |
2308.02185v1 |
null |
2023-08-04 |
ParaFuzz: An Interpretability-Driven Technique for Detecting Poisoned Samples in NLP |
Lu Yan et.al. |
2308.02122v1 |
null |
2023-08-04 |
Model Provenance via Model DNA |
Xin Mu et.al. |
2308.02121v1 |
null |
2023-08-03 |
Causality Guided Disentanglement for Cross-Platform Hate Speech Detection |
Paras Sheth et.al. |
2308.02080v1 |
link |
2023-08-03 |
Accurate Neural Network Pruning Requires Rethinking Sparse Optimization |
Denis Kuznedelev et.al. |
2308.02060v1 |
null |
2023-08-03 |
Seasonality Based Reranking of E-commerce Autocomplete Using Natural Language Queries |
Prateek Verma et.al. |
2308.02055v1 |
null |
2023-08-03 |
Tag Prediction of Competitive Programming Problems using Deep Learning Techniques |
Taha Lokat et.al. |
2308.01863v1 |
null |
2023-08-03 |
XNLP: An Interactive Demonstration System for Universal Structured NLP |
Hao Fei et.al. |
2308.01846v1 |
null |
2023-08-03 |
Lexicon and Rule-based Word Lemmatization Approach for the Somali Language |
Shafie Abdi Mohamed et.al. |
2308.01785v1 |
link |
2023-08-03 |
Does Correction Remain An Problem For Large Language Models? |
Xiaowu Zhang et.al. |
2308.01776v1 |
null |
2023-08-03 |
NBIAS: A Natural Language Processing Framework for Bias Identification in Text |
Shaina Razaa et.al. |
2308.01681v1 |
null |
2023-08-03 |
Holy Grail 2.0: From Natural Language to Constraint Models |
Dimos Tsouros et.al. |
2308.01589v1 |
null |
2023-08-03 |
Large Language Model Displays Emergent Ability to Interpret Novel Literary Metaphors |
Nicholas Ichien et.al. |
2308.01497v1 |
null |
2023-08-02 |
Manual Tests Do Smell! Cataloging and Identifying Natural Language Test Smells |
Elvys Soares et.al. |
2308.01386v1 |
link |
2023-08-02 |
Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification |
Laurin Wagner et.al. |
2308.01327v1 |
null |
2023-08-02 |
ADS-Cap: A Framework for Accurate and Diverse Stylized Captioning with Unpaired Stylistic Corpora |
Kanzhi Cheng et.al. |
2308.01143v1 |
link |
2023-08-02 |
Feature-aware conditional GAN for category text generation |
Xinze Li et.al. |
2308.00939v1 |
null |
2023-07-31 |
Predicting masked tokens in stochastic locations improves masked image modeling |
Amir Bar et.al. |
2308.00566v1 |
null |
2023-08-01 |
Discourse-Aware Text Simplification: From Complex Sentences to Linked Propositions |
Christina Niklaus et.al. |
2308.00425v1 |
null |
2023-08-01 |
LimeAttack: Local Explainable Method for Textual Hard-Label Adversarial Attack |
Hai Zhu et.al. |
2308.00319v1 |
link |
2023-08-01 |
LGViT: Dynamic Early Exiting for Accelerating Vision Transformer |
Guanyu Xu et.al. |
2308.00255v1 |
null |
2023-07-31 |
Adversarially Robust Neural Legal Judgement Systems |
Rohit Raj et.al. |
2308.00165v1 |
null |
2023-07-31 |
Structural Transfer Learning in NL-to-Bash Semantic Parsers |
Kyle Duffy et.al. |
2307.16795v1 |
null |
2023-08-02 |
LLMs4OL: Large Language Models for Ontology Learning |
Hamed Babaei Giglou et.al. |
2307.16648v2 |
link |
2023-07-31 |
Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks |
Xinyu Zhang et.al. |
2307.16630v1 |
null |
2023-07-31 |
Toward Quantum Machine Translation of Syntactically Distinct Languages |
Mina Abbaszade et.al. |
2307.16576v1 |
null |
2023-07-31 |
AMOE: a Tool to Automatically Extract and Assess Organizational Evidence for Continuous Cloud Audit |
Franz Deimling et.al. |
2307.16541v1 |
null |
2023-07-31 |
A Benchmark for Understanding Dialogue Safety in Mental Health Support |
Huachuan Qiu et.al. |
2307.16457v1 |
link |
2023-07-31 |
Camoscio: an Italian Instruction-tuned LLaMA |
Andrea Santilli et.al. |
2307.16456v1 |
link |
2023-07-31 |
LP-MusicCaps: LLM-Based Pseudo Music Captioning |
SeungHeon Doh et.al. |
2307.16372v1 |
link |
2023-07-30 |
Self-Supervised Learning of Gait-Based Biomarkers |
R. James Cotton et.al. |
2307.16321v1 |
null |
2023-07-30 |
Text Analysis Using Deep Neural Networks in Digital Humanities and Information Science |
Omri Suissa et.al. |
2307.16217v1 |
null |
2023-07-28 |
Universal Recurrent Event Memories for Streaming Data |
Ran Dou et.al. |
2307.15694v1 |
null |
2023-07-28 |
BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering |
Khiem Vinh Tran et.al. |
2307.15335v1 |
null |
2023-07-28 |
TrafficSafetyGPT: Tuning a Pre-trained Large Language Model to a Domain-Specific Expert in Transportation Safety |
Ou Zheng et.al. |
2307.15311v1 |
link |
2023-07-27 |
f-Divergence Minimization for Sequence-Level Knowledge Distillation |
Yuqiao Wen et.al. |
2307.15190v1 |
link |
2023-07-27 |
Text-guided Foundation Model Adaptation for Pathological Image Classification |
Yunkun Zhang et.al. |
2307.14901v1 |
link |
2023-07-27 |
Improving Natural Language Inference in Arabic using Transformer Models and Linguistically Informed Pre-Training |
Mohammad Majd Saad Al Deen et.al. |
2307.14666v1 |
link |
2023-07-27 |
Metric-Based In-context Learning: A Case Study in Text Simplification |
Subha Vadlamannati et.al. |
2307.14632v1 |
link |
2023-07-27 |
Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models |
Yuchi Qiu et.al. |
2307.14587v1 |
null |
2023-07-26 |
Words That Stick: Predicting Decision Making and Synonym Engagement Using Cognitive Biases and Computational Linguistics |
Nimrod Dvir et.al. |
2307.14511v1 |
null |
2023-07-26 |
A Predictive Model of Digital Information Engagement: Forecasting User Engagement With English Words by Incorporating Cognitive Biases, Computational Linguistics and Natural Language Processing |
Nimrod Dvir et.al. |
2307.14500v1 |
null |
2023-07-26 |
TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning |
Yury Gorishniy et.al. |
2307.14338v1 |
link |
2023-07-26 |
Comparative Analysis of Libraries for the Sentimental Analysis |
Wendy Ccoya et.al. |
2307.14311v1 |
null |
2023-07-26 |
Mining Reddit Data to Elicit Students' Requirements During COVID-19 Pandemic |
Shadikur Rahman et.al. |
2307.14212v1 |
null |
2023-07-26 |
A semantics-driven methodology for high-quality image annotation |
Fausto Giunchiglia et.al. |
2307.14119v1 |
null |
2023-07-26 |
Decoding ChatGPT: A Taxonomy of Existing Research, Current Challenges, and Possible Future Directions |
Shahab Saquib Sohail et.al. |
2307.14107v1 |
null |
2023-07-25 |
Evaluating Large Language Models for Radiology Natural Language Processing |
Zhengliang Liu et.al. |
2307.13693v1 |
link |
2023-07-25 |
Multilevel Large Language Models for Everyone |
Yuanhao Gong et.al. |
2307.13221v1 |
null |
2023-07-24 |
Explaining Math Word Problem Solvers |
Abby Newcomb et.al. |
2307.13128v1 |
null |
2023-07-24 |
Making Metadata More FAIR Using Large Language Models |
Sowmya S. Sundaram et.al. |
2307.13085v1 |
null |
2023-07-24 |
A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models |
Jindong Gu et.al. |
2307.12980v1 |
link |
2023-07-24 |
Aligning Large Language Models with Human: A Survey |
Yufei Wang et.al. |
2307.12966v1 |
link |
2023-07-24 |
Concept-based explainability for an EEG transformer model |
Anders Gjølbye Madsen et.al. |
2307.12745v1 |
link |
2023-07-23 |
Transformer-based Joint Source Channel Coding for Textual Semantic Communication |
Shicong Liu et.al. |
2307.12266v1 |
null |
2023-07-22 |
A Zero-shot and Few-shot Study of Instruction-Finetuned Large Language Models Applied to Clinical and Biomedical Tasks |
Yanis Labrak et.al. |
2307.12114v1 |
null |
2023-07-22 |
Sparse then Prune: Toward Efficient Vision Transformers |
Yogi Prasetyo et.al. |
2307.11988v1 |
link |
2023-07-22 |
HIQL: Offline Goal-Conditioned RL with Latent States as Actions |
Seohong Park et.al. |
2307.11949v1 |
link |
2023-07-21 |
Multimodal Document Analytics for Banking Process Automation |
Christopher Gerling et.al. |
2307.11845v1 |
null |
2023-07-21 |
Advancing Visual Grounding with Scene Knowledge: Benchmark and Method |
Zhihong Chen et.al. |
2307.11558v1 |
link |
2023-07-21 |
YOLOPose V2: Understanding and Improving Transformer-based 6D Pose Estimation |
Arul Selvam Periyasamy et.al. |
2307.11550v1 |
null |
2023-07-21 |
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation |
Zunnan Xu et.al. |
2307.11545v1 |
link |
2023-07-20 |
A Systematic Evaluation of Federated Learning on Biomedical Natural Language Processing |
Le Peng et.al. |
2307.11254v1 |
link |
2023-07-20 |
Extreme Multi-Label Skill Extraction Training using Large Language Models |
Jens-Joris Decorte et.al. |
2307.10778v1 |
null |
2023-07-20 |
A Dataset and Strong Baselines for Classification of Czech News Texts |
Hynek Kydlíček et.al. |
2307.10666v1 |
link |
2023-07-20 |
Exploring the Landscape of Natural Language Processing Research |
Tim Schopf et.al. |
2307.10652v1 |
link |
2023-07-20 |
Instruction-following Evaluation through Verbalizer Manipulation |
Shiyang Li et.al. |
2307.10558v1 |
null |
2023-07-19 |
Mood Classification of Bangla Songs Based on Lyrics |
Maliha Mahajebin et.al. |
2307.10314v1 |
null |
2023-07-19 |
Alzheimer's Disease Detection from Spontaneous Speech and Text: A review |
Vrindha M. K. et.al. |
2307.10005v1 |
null |
2023-07-19 |
Large Language Models can accomplish Business Process Management Tasks |
Michael Grohs et.al. |
2307.09923v1 |
null |
2023-07-19 |
Chit-Chat or Deep Talk: Prompt Engineering for Process Mining |
Urszula Jessen et.al. |
2307.09909v1 |
null |
2023-07-19 |
Test-takers have a say: understanding the implications of the use of AI in language tests |
Dawen Zhang et.al. |
2307.09885v1 |
null |
2023-07-19 |
Enhancing conversational quality in language learning chatbots: An evaluation of GPT4 for ASR error correction |
Long Mai et.al. |
2307.09744v1 |
null |
2023-07-19 |
Improving Domain Generalization for Sound Classification with Sparse Frequency-Regularized Transformer |
Honglin Mu et.al. |
2307.09723v1 |
link |
2023-07-19 |
Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation |
Hao Peng et.al. |
2307.09701v1 |
null |
2023-07-18 |
Can Model Fusing Help Transformers in Long Document Classification? An Empirical Study |
Damith Premasiri et.al. |
2307.09532v1 |
link |
2023-07-18 |
Scaling Laws for Imitation Learning in NetHack |
Jens Tuyls et.al. |
2307.09423v1 |
null |
2023-07-18 |
UniTabE: Pretraining a Unified Tabular Encoder for Heterogeneous Tabular Data |
Yazheng Yang et.al. |
2307.09249v1 |
null |
2023-07-18 |
Mitigating masked pixels in climate-critical datasets |
Angelina Agabin et.al. |
2307.09227v1 |
null |
2023-07-18 |
Automated Ableism: An Exploration of Explicit Disability Biases in Sentiment and Toxicity Analysis Models |
Pranav Narayanan Venkit et.al. |
2307.09209v1 |
null |
2023-07-18 |
Unveiling Gender Bias in Terms of Profession Across LLMs: Analyzing and Addressing Sociological Implications |
Vishesh Thakur et.al. |
2307.09162v1 |
null |
2023-07-18 |
R-Cut: Enhancing Explainability in Vision Transformers with Relationship Weighted Out and Cut |
Yingjie Niu et.al. |
2307.09050v1 |
null |
2023-07-18 |
On the (In)Effectiveness of Large Language Models for Chinese Text Correction |
Yinghui Li et.al. |
2307.09007v1 |
null |
2023-07-18 |
NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning |
Tianxin Wei et.al. |
2307.08941v1 |
link |
2023-07-18 |
Teach model to answer questions after comprehending the document |
Ruiqing Sun et.al. |
2307.08931v1 |
null |
2023-07-17 |
Harnessing the Power of AI based Image Generation Model DALLE 2 in Agricultural Settings |
Ranjan Sapkota et.al. |
2307.08789v1 |
null |
2023-07-17 |
COLLIE: Systematic Construction of Constrained Text Generation Tasks |
Shunyu Yao et.al. |
2307.08689v1 |
link |
2023-07-17 |
Utilization of Pre-trained Language Model for Adapter-based Knowledge Transfer in Software Engineering |
Iman Saberi et.al. |
2307.08540v1 |
null |
2023-07-17 |
BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization |
Chaoya Jiang et.al. |
2307.08504v1 |
null |
2023-07-17 |
On the application of Large Language Models for language teaching and assessment technology |
Andrew Caines et.al. |
2307.08393v1 |
null |
2023-07-16 |
Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling |
Longyue Wang et.al. |
2307.08074v1 |
null |
2023-07-16 |
Fast Quantum Algorithm for Attention Computation |
Yeqi Gao et.al. |
2307.08045v1 |
null |
2023-07-16 |
A Survey of Techniques for Optimizing Transformer Inference |
Krishna Teja Chitty-Venkata et.al. |
2307.07982v1 |
null |
2023-07-15 |
AspectCSE: Sentence Embeddings for Aspect-based Semantic Textual Similarity using Contrastive Learning and Structured Knowledge |
Tim Schopf et.al. |
2307.07851v1 |
null |
2023-07-15 |
Improving Trace Link Recommendation by Using Non-Isotropic Distances and Combinations |
Christof Tinnes et.al. |
2307.07781v1 |
null |
2023-07-15 |
Leveraging Large Language Models to Generate Answer Set Programs |
Adam Ishay et.al. |
2307.07699v1 |
link |
2023-07-14 |
Investigating ChatGPT's Potential to Assist in Requirements Elicitation Processes |
Krishna Ronanki et.al. |
2307.07381v1 |
null |
2023-07-14 |
AIC-AB NET: A Neural Network for Image Captioning with Spatial Attention and Text Attributes |
Guoyun Tu et.al. |
2307.07370v1 |
null |
2023-07-14 |
A scoping review on multimodal deep learning in biomedical images and texts |
Zhaoyi Sun et.al. |
2307.07362v1 |
null |
2023-07-14 |
MaxSR: Image Super-Resolution Using Improved MaxViT |
Bincheng Yang et.al. |
2307.07240v1 |
null |
2023-07-14 |
Software Testing with Large Language Model: Survey, Landscape, and Vision |
Junjie Wang et.al. |
2307.07221v1 |
null |
2023-07-13 |
Making the Most Out of the Limited Context Length: Predictive Power Varies with Clinical Note Type and Note Section |
Hongyi Zheng et.al. |
2307.07051v1 |
null |
2023-07-13 |
Parmesan: mathematical concept extraction for education |
Jacob Collard et.al. |
2307.06699v1 |
null |
2023-07-13 |
Going Beyond Local: Global Graph-Enhanced Personalized News Recommendations |
Boming Yang et.al. |
2307.06576v1 |
link |
2023-07-13 |
Convolutional Neural Networks for Sentiment Analysis on Weibo Data: A Natural Language Processing Approach |
Yufei Xie et.al. |
2307.06540v1 |
null |
2023-07-13 |
Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study |
Zeping Min et.al. |
2307.06530v1 |
null |
2023-07-12 |
Transformers in Reinforcement Learning: A Survey |
Pranav Agarwal et.al. |
2307.05979v1 |
null |
2023-07-11 |
Machine Learning Study of the Extended Drug-target Interaction Network informed by Pain Related Voltage-Gated Sodium Channels |
Long Chen et.al. |
2307.05794v1 |
link |
2023-07-10 |
Exploring Large Language Model for Graph Data Understanding in Online Job Recommendations |
Likang Wu et.al. |
2307.05722v1 |
link |
2023-07-11 |
Objaverse-XL: A Universe of 10M+ 3D Objects |
Matt Deitke et.al. |
2307.05663v1 |
null |
2023-07-10 |
Hate Speech Detection via Dual Contrastive Learning |
Junyu Lu et.al. |
2307.05578v1 |
null |
2023-07-11 |
GujiBERT and GujiGPT: Construction of Intelligent Information Processing Foundation Language Models for Ancient Texts |
Dongbo Wang et.al. |
2307.05354v1 |
null |
2023-07-11 |
On the Effectiveness of Speech Self-supervised Learning for Music |
Yinghao Ma et.al. |
2307.05161v1 |
null |
2023-07-11 |
Hybrid hidden Markov LSTM for short-term traffic flow prediction |
Agnimitra Sengupta et.al. |
2307.04954v1 |
null |
2023-07-10 |
Entity Identifier: A Natural Text Parsing-based Framework For Entity Relation Extraction |
El Mehdi Chouham et.al. |
2307.04892v1 |
null |
2023-07-10 |
COMEX: A Tool for Generating Customized Source Code Representations |
Debeshee Das et.al. |
2307.04693v1 |
link |
2023-07-10 |
Search-time Efficient Device Constraints-Aware Neural Architecture Search |
Oshin Dutta et.al. |
2307.04443v1 |
null |
2023-07-10 |
Privacy-Preserving Graph Machine Learning from Data to Computation: A Survey |
Dongqi Fu et.al. |
2307.04338v1 |
null |
2023-07-10 |
CT-BERT: Learning Better Tabular Representations Through Cross-Table Pre-training |
Chao Ye et.al. |
2307.04308v1 |
link |
2023-07-09 |
ChatGPT in the Age of Generative AI and Large Language Models: A Concise Survey |
Salman Mohamadi et.al. |
2307.04251v1 |
link |
2023-07-09 |
A Novel Pipeline for Improving Optical Character Recognition through Post-processing Using Natural Language Processing |
Aishik Rakshit et.al. |
2307.04245v1 |
null |
2023-07-09 |
Can Generative Large Language Models Perform ASR Error Correction? |
Rao Ma et.al. |
2307.04172v1 |
null |
2023-07-09 |
Dream Content Discovery from Reddit with an Unsupervised Mixed-Method Approach |
Anubhab Das et.al. |
2307.04167v1 |
null |
2023-07-09 |
DebateKG: Automatic Policy Debate Case Creation with Semantic Knowledge Graphs |
Allen Roush et.al. |
2307.04090v1 |
link |
2023-07-08 |
Evaluating the Capability of Large-scale Language Models on Chinese Grammatical Error Correction Task |
Fanyi Qu et.al. |
2307.03972v1 |
null |
2023-07-07 |
ITA: An Energy-Efficient Attention and Softmax Accelerator for Quantized Transformers |
Gamze İslamoğlu et.al. |
2307.03493v1 |
null |
2023-07-06 |
Vision Language Transformers: A Survey |
Clayton Fields et.al. |
2307.03254v1 |
null |
2023-07-06 |
BrickPal: Augmented Reality-based Assembly Instructions for Brick Models |
Yao Shi et.al. |
2307.03162v1 |
null |
2023-07-06 |
A Survey on Evaluation of Large Language Models |
Yupeng Chang et.al. |
2307.03109v1 |
link |
2023-07-06 |
Efficient Domain Adaptation of Sentence Embeddings using Adapters |
Tim Schopf et.al. |
2307.03104v1 |
link |
2023-07-06 |
Efficient Semiring-Weighted Earley Parsing |
Andreas Opedal et.al. |
2307.02982v1 |
link |
2023-07-06 |
UIT-Saviors at MEDVQA-GI 2023: Improving Multimodal Learning with Image Enhancement for Gastrointestinal Visual Question Answering |
Triet M. Thai et.al. |
2307.02783v1 |
null |
2023-07-05 |
Unsupervised Sentiment Analysis of Plastic Surgery Social Media Posts |
Alexandrea K. Ramnarine et.al. |
2307.02640v1 |
null |
2023-07-05 |
ODD: A Benchmark Dataset for the NLP-based Opioid Related Aberrant Behavior Detection |
Sunjae Kwon et.al. |
2307.02591v1 |
link |
2023-07-05 |
Sumformer: Universal Approximation for Efficient Transformers |
Silas Alberti et.al. |
2307.02301v1 |
null |
2023-07-05 |
Make A Long Image Short: Adaptive Token Length for Vision Transformers |
Qiqi Zhou et.al. |
2307.02092v1 |
null |
2023-07-05 |
Emoji Prediction using Transformer Models |
Muhammad Osama Nusrat et.al. |
2307.02054v1 |
link |
2023-07-05 |
Recommender Systems in the Era of Large Language Models (LLMs) |
Wenqi Fan et.al. |
2307.02046v1 |
null |
2023-07-04 |
RRCNN: A novel signal decomposition approach based on recurrent residue convolutional neural network |
Feng Zhou et.al. |
2307.01725v1 |
link |
2023-07-04 |
A Language Model for Grammatical Error Correction in L2 Russian |
Nikita Remnev et.al. |
2307.01609v1 |
null |
2023-07-04 |
Learning to Prompt in the Classroom to Understand AI Limits: A pilot study |
Emily Theophilou et.al. |
2307.01540v1 |
null |
2023-07-04 |
All in One: Multi-task Prompting for Graph Neural Networks |
Xiangguo Sun et.al. |
2307.01504v1 |
link |
2023-07-04 |
On Evaluating and Mitigating Gender Biases in Multilingual Settings |
Aniket Vashishtha et.al. |
2307.01503v1 |
null |
2023-07-04 |
SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification |
Junjie Wu et.al. |
2307.01488v1 |
null |
2023-07-03 |
Improving Language Plasticity via Pretraining with Active Forgetting |
Yihong Chen et.al. |
2307.01163v1 |
null |
2023-07-03 |
Exploring the In-context Learning Ability of Large Language Model for Biomedical Concept Linking |
Qinyong Wang et.al. |
2307.01137v1 |
null |
2023-07-03 |
Challenges in Domain-Specific Abstractive Summarization and How to Overcome them |
Anum Afzal et.al. |
2307.00963v1 |
null |
2023-07-03 |
Automatic Design of Semantic Similarity Ensembles Using Grammatical Evolution |
Jorge Martinez-Gil et.al. |
2307.00925v1 |
link |
2023-07-03 |
Contextual Prompt Learning for Vision-Language Understanding |
Koustava Goswami et.al. |
2307.00910v1 |
null |
2023-07-03 |
Element similarity in high-dimensional materials representations |
Anthony Onwuli et.al. |
2307.00784v1 |
null |
2023-07-02 |
Neuro-Symbolic Sudoku Solver |
Ashutosh Hathidara et.al. |
2307.00653v1 |
link |
2023-07-02 |
Text based Large Language Model for Recommendation |
Jianchao Ji et.al. |
2307.00457v1 |
link |
2023-07-02 |
Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal Data |
Xinzhe Li et.al. |
2307.00456v1 |
link |
2023-07-01 |
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency |
Yan Wang et.al. |
2307.00280v1 |
null |
2023-06-30 |
Towards Improving the Performance of Pre-Trained Speech Models for Low-Resource Languages Through Lateral Inhibition |
Andrei-Marius Avram et.al. |
2306.17792v1 |
null |
2023-06-30 |
Augmenting Holistic Review in University Admission using Natural Language Processing for Essays and Recommendation Letters |
Jinsook Lee et.al. |
2306.17575v1 |
null |
2023-06-30 |
A Cost-aware Study of Depression Language on Social Media using Topic and Affect Contextualization |
Andrea Laguna et.al. |
2306.17564v1 |
null |
2023-06-30 |
GPT-FinRE: In-context Learning for Financial Relation Extraction using Large Language Models |
Pawan Kumar Rajpoot et.al. |
2306.17519v1 |
link |
2023-06-29 |
Prediction of COVID-19 Patients' Emergency Room Revisit using Multi-Source Transfer Learning |
Yuelyu Ji et.al. |
2306.17257v1 |
null |
2023-06-29 |
Towards Grammatical Tagging for the Legal Language of Cybersecurity |
Gianpietro Castiglione et.al. |
2306.17042v1 |
null |
2023-06-29 |
Benchmarking Large Language Model Capabilities for Conditional Generation |
Joshua Maynez et.al. |
2306.16793v1 |
null |
2023-06-29 |
Principles and Guidelines for Evaluating Social Robot Navigation Algorithms |
Anthony Francis et.al. |
2306.16740v1 |
null |
2023-06-29 |
Beyond CO2 Emissions: The Overlooked Impact of Water Consumption of Information Retrieval Models |
Guido Zuccon et.al. |
2306.16668v1 |
link |
2023-06-28 |
An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs |
Haihao Shen et.al. |
2306.16601v1 |
link |
2023-06-28 |
Multi-Site Clinical Federated Learning using Recursive and Attentive Models and NVFlare |
Won Joon Yun et.al. |
2306.16367v1 |
null |
2023-06-28 |
cuSLINK: Single-linkage Agglomerative Clustering on the GPU |
Corey J. Nolet et.al. |
2306.16354v1 |
link |
2023-06-28 |
Generative User-Experience Research for Developing Domain-specific Natural Language Processing Applications |
Anastasia Zhukova et.al. |
2306.16143v1 |
null |
2023-06-28 |
ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases |
Jiaxi Cui et.al. |
2306.16092v1 |
link |
2023-06-28 |
Sentence-to-Label Generation Framework for Multi-task Learning of Japanese Sentence Classification and Named Entity Recognition |
Chengguang Gan et.al. |
2306.15978v1 |
link |
2023-06-28 |
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias |
Yue Yu et.al. |
2306.15895v1 |
link |
2023-06-27 |
MAT: Mixed-Strategy Game of Adversarial Training in Fine-tuning |
Zhehua Zhong et.al. |
2306.15826v1 |
null |
2023-06-27 |
To Spike or Not To Spike: A Digital Hardware Perspective on Deep Learning Acceleration |
Fabrizio Ottati et.al. |
2306.15749v1 |
link |
2023-06-27 |
Exploring Durham University Physics exams with Large Language Models |
Will Yeadon et.al. |
2306.15609v1 |
link |
2023-06-27 |
Using Large Language Models to Provide Explanatory Feedback to Human Tutors |
Jionghao Lin et.al. |
2306.15498v1 |
null |
2023-06-27 |
Gender Bias in BERT -- Measuring and Analysing Biases through Sentiment Rating in a Realistic Downstream Classification Task |
Sophie Jentzsch et.al. |
2306.15298v1 |
null |
2023-06-28 |
Investigating Cross-Domain Behaviors of BERT in Review Understanding |
Albert Lu et.al. |
2306.15123v2 |
null |
2023-06-26 |
FeedbackMap: a tool for making sense of open-ended survey responses |
Doug Beeferman et.al. |
2306.15112v1 |
link |
2023-06-26 |
LM4HPC: Towards Effective Language Model Application in High-Performance Computing |
Le Chen et.al. |
2306.14979v1 |
null |
2023-06-26 |
The Art of Embedding Fusion: Optimizing Hate Speech Detection |
Mohammad Aflah Khan et.al. |
2306.14939v1 |
link |
2023-06-26 |
Learning to Modulate pre-trained Models in RL |
Thomas Schmied et.al. |
2306.14884v1 |
link |
2023-06-26 |
Enriching the NArabizi Treebank: A Multifaceted Approach to Supporting an Under-Resourced Language |
Riabi Arij et.al. |
2306.14866v1 |
null |
2023-06-26 |
Inter-Annotator Agreement in the Wild: Uncovering Its Emerging Roles and Considerations in Real-World Scenarios |
NamHyeok Kim et.al. |
2306.14373v1 |
null |
2023-06-25 |
Revolutionizing Cyber Threat Detection with Large Language Models |
Mohamed Amine Ferrag et.al. |
2306.14263v1 |
null |
2023-06-25 |
Towards Trustworthy Explanation: On Causal Rationalization |
Wenbo Zhang et.al. |
2306.14115v1 |
link |
2023-06-25 |
Chinese Fine-Grained Financial Sentiment Analysis with Large Language Models |
Yinyu Lan et.al. |
2306.14096v1 |
link |
2023-06-24 |
On the Uses of Large Language Models to Interpret Ambiguous Cyberattack Descriptions |
Reza Fayyazi et.al. |
2306.14062v1 |
null |
2023-06-24 |
Comparison of Pre-trained Language Models for Turkish Address Parsing |
Muhammed Cihat Ünal et.al. |
2306.13947v1 |
null |
2023-06-24 |
Large Sequence Models for Sequential Decision-Making: A Survey |
Muning Wen et.al. |
2306.13945v1 |
null |
2023-06-24 |
Spatio-temporal Storytelling? Leveraging Generative Models for Semantic Trajectory Analysis |
Shreya Ghosh et.al. |
2306.13905v1 |
null |
2023-06-23 |
Knowledge-Infused Self Attention Transformers |
Kaushik Roy et.al. |
2306.13501v1 |
null |
2023-06-23 |
Abstractive Text Summarization for Resumes With Cutting Edge NLP Transformers and LSTM |
Öykü Berfin Mercan et.al. |
2306.13315v1 |
null |
2023-06-22 |
Prompt to GPT-3: Step-by-Step Thinking Instructions for Humor Generation |
Yuetian Chen et.al. |
2306.13195v1 |
link |
2023-06-22 |
On Hate Scaling Laws For Data-Swamps |
Abeba Birhane et.al. |
2306.13141v1 |
link |
2023-06-22 |
Named entity recognition in resumes |
Ege Kesim et.al. |
2306.13062v1 |
null |
2023-06-22 |
Tracking public attitudes toward ChatGPT on Twitter using sentiment analysis and topic modeling |
Ratanond Koonchanok et.al. |
2306.12951v1 |
link |
2023-06-22 |
Cross-lingual Cross-temporal Summarization: Dataset, Models, Evaluation |
Ran Zhang et.al. |
2306.12916v1 |
link |
2023-06-22 |
Natural Language Processing in Electronic Health Records in Relation to Healthcare Decision-making: A Systematic Review |
Elias Hossain et.al. |
2306.12834v1 |
null |
2023-06-22 |
Instruct-FinGPT: Financial Sentiment Analysis by Instruction Tuning of General-Purpose Large Language Models |
Boyu Zhang et.al. |
2306.12659v1 |
null |
2023-06-22 |
Identifying and Extracting Rare Disease Phenotypes with Large Language Models |
Cathy Shyr et.al. |
2306.12656v1 |
link |
2023-06-21 |
SIFTER: A Task-specific Alignment Strategy for Enhancing Sentence Embeddings |
Chao Yu et.al. |
2306.12280v1 |
null |
2023-06-21 |
What Constitutes Good Contrastive Learning in Time-Series Forecasting? |
Chiyu Zhang et.al. |
2306.12086v1 |
null |
2023-06-21 |
Task-Robust Pre-Training for Worst-Case Downstream Adaptation |
Jianghui Wang et.al. |
2306.12070v1 |
null |
2023-06-21 |
Sample Attackability in Natural Language Adversarial Attacks |
Vyas Raina et.al. |
2306.12043v1 |
link |
2023-06-21 |
Multimodality Fusion for Smart Healthcare: a Journey from Data, Information, Knowledge to Wisdom |
Thanveer Shaik et.al. |
2306.11963v1 |
null |
2023-06-20 |
Deep Fusion: Efficient Network Training via Pre-trained Initializations |
Hanna Mazzawi et.al. |
2306.11903v1 |
null |
2023-06-20 |
Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications |
Saed Rezayi et.al. |
2306.11892v1 |
null |
2023-06-21 |
Event Stream GPT: A Data Pre-processing and Modeling Library for Generative, Pre-trained Transformers over Continuous-time Sequences of Complex Events |
Matthew B. A. McDermott et.al. |
2306.11547v2 |
link |
2023-06-20 |
One model to rule them all: ranking Slovene summarizers |
Aleš Žagar et.al. |
2306.11518v1 |
null |
2023-06-20 |
TrustGPT: A Benchmark for Trustworthy and Responsible Large Language Models |
Yue Huang et.al. |
2306.11507v1 |
null |
2023-06-20 |
Transforming Graphs for Enhanced Attribute-Based Clustering: An Innovative Graph Transformer Method |
Shuo Han et.al. |
2306.11307v1 |
null |
2023-06-20 |
UVSCAN: Detecting Third-Party Component Usage Violations in IoT Firmware |
Binbin Zhao et.al. |
2306.11206v1 |
null |
2023-06-19 |
BioREx: Improving Biomedical Relation Extraction by Leveraging Heterogeneous Datasets |
Po-Ting Lai et.al. |
2306.11189v1 |
link |
2023-06-18 |
Understanding and Characterizing Cryptocurrency Free Giveaway and Arbitrage Bot Scams In the Wild |
Kai Li et.al. |
2306.10634v1 |
link |
2023-06-17 |
Multilingual Multiword Expression Identification Using Lateral Inhibition and Domain Adaptation |
Andrei-Marius Avram et.al. |
2306.10419v1 |
null |
2023-06-16 |
SSE: A Metric for Evaluating Search System Explainability |
Catherine Chen et.al. |
2306.10175v1 |
link |
2023-06-16 |
Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects |
Kexin Zhang et.al. |
2306.10125v1 |
link |
2023-06-16 |
Rewriting the Script: Adapting Text Instructions for Voice Interaction |
Alyssa Hwang et.al. |
2306.09992v1 |
null |
2023-06-16 |
ClinicalGPT: Large Language Models Finetuned with Diverse Medical Data and Comprehensive Evaluation |
Guangyu Wang et.al. |
2306.09968v1 |
null |
2023-06-16 |
Revealing the impact of social circumstances on the selection of cancer therapy through natural language processing of social work notes |
Shenghuan Sun et.al. |
2306.09877v1 |
null |
2023-06-16 |
Full Parameter Fine-tuning for Large Language Models with Limited Resources |
Kai Lv et.al. |
2306.09782v1 |
link |
2023-06-16 |
Using Natural Language Processing and Networks to Automate Structured Literature Reviews: An Application to Farmers Climate Change Adaptation |
Sofia Gil-Clavel et.al. |
2306.09737v1 |
null |
2023-06-16 |
Reducing Computational Costs in Sentiment Analysis: Tensorized Recurrent Networks vs. Recurrent Networks |
Gabriel Lopez et.al. |
2306.09705v1 |
null |
2023-06-15 |
Building blocks for complex tasks: Robust generative event extraction for radiology reports under domain shifts |
Sitong Zhou et.al. |
2306.09544v1 |
null |
2023-06-15 |
FedMultimodal: A Benchmark For Multimodal Federated Learning |
Tiantian Feng et.al. |
2306.09486v1 |
null |
2023-06-15 |
From BERT to GPT-3 Codex: Harnessing the Potential of Very Large Language Models for Data Management |
Immanuel Trummer et.al. |
2306.09339v1 |
null |
2023-06-15 |
Opportunities for Large Language Models and Discourse in Engineering Design |
Jan Göpfert et.al. |
2306.09169v1 |
null |
2023-06-15 |
Mapping Researcher Activity based on Publication Data by means of Transformers |
Zineddine Bettouche et.al. |
2306.09049v1 |
null |
2023-06-15 |
Voting Booklet Bias: Stance Detection in Swiss Federal Communication |
Eric Egli et.al. |
2306.08999v1 |
link |
2023-06-15 |
Multilingual End to End Entity Linking |
Mikhail Plekhanov et.al. |
2306.08896v1 |
link |
2023-06-15 |
Description-Enhanced Label Embedding Contrastive Learning for Text Classification |
Kun Zhang et.al. |
2306.08817v1 |
link |
2023-06-14 |
Explore In-Context Learning for 3D Point Cloud Understanding |
Zhongbin Fang et.al. |
2306.08659v1 |
link |
2023-06-14 |
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models |
Lingxi Xie et.al. |
2306.08641v1 |
null |
2023-06-14 |
SQL2Circuits: Estimating Metrics for SQL Queries with A Quantum Natural Language Processing Method |
Valter Uotila et.al. |
2306.08529v1 |
link |
2023-06-14 |
AlbMoRe: A Corpus of Movie Reviews for Sentiment Analysis in Albanian |
Erion Çano et.al. |
2306.08526v1 |
link |
2023-06-13 |
Adversarial Capsule Networks for Romanian Satire Detection and Sentiment Analysis |
Sebastian-Vasile Echim et.al. |
2306.07845v1 |
null |
2023-06-13 |
A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews |
Robert Lakatos et.al. |
2306.07786v1 |
null |
2023-06-13 |
Rethink the Effectiveness of Text Data Augmentation: An Empirical Analysis |
Zhengxiang Shi et.al. |
2306.07664v1 |
link |
2023-06-12 |
Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and Siswati |
Andani Madodonga et.al. |
2306.07426v1 |
link |
2023-06-12 |
EriBERTa: A Bilingual Pre-Trained Language Model for Clinical Natural Language Processing |
Iker de la Iglesia et.al. |
2306.07373v1 |
null |
2023-06-11 |
A Comprehensive Survey on Applications of Transformers for Deep Learning Tasks |
Saidul Islam et.al. |
2306.07303v1 |
null |
2023-06-12 |
A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation |
Jeremy Gwinnup et.al. |
2306.07198v1 |
null |
2023-06-12 |
A language-inspired machine learning approach for solving strongly correlated problems with dynamical mean-field theory |
Zelong Zhao et.al. |
2306.06975v1 |
link |
2023-06-12 |
A Brief Review of Hypernetworks in Deep Learning |
Vinod Kumar Chauhan et.al. |
2306.06955v1 |
link |
2023-06-11 |
AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural Language Processing |
Asaad Alghamdi et.al. |
2306.06800v1 |
null |
2023-06-11 |
Adapting to the Impact of AI in Scientific Writing: Balancing Benefits and Drawbacks while Developing Policies and Regulations |
Ahmed S. BaHammam et.al. |
2306.06699v1 |
null |
2023-06-11 |
Computational Language Assessment: Open Brain AI |
Charalambos Themistocleous et.al. |
2306.06693v1 |
null |
2023-06-11 |
EaSyGuide : ESG Issue Identification Framework leveraging Abilities of Generative Large Language Models |
Hanwool Lee et.al. |
2306.06662v1 |
link |
2023-06-11 |
RoBERTweet: A BERT Language Model for Romanian Tweets |
Iulian-Marius Tăiatu et.al. |
2306.06598v1 |
null |
2023-06-10 |
Universal Language Modelling agent |
Anees Aslam et.al. |
2306.06521v1 |
null |
2023-06-10 |
A Comprehensive Review of State-of-The-Art Methods for Java Code Generation from Natural Language Text |
Jessica López Espejel et.al. |
2306.06371v1 |
null |
2023-06-09 |
FinGPT: Open-Source Financial Large Language Models |
Hongyang Yang et.al. |
2306.06031v1 |
link |
2023-06-09 |
HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine |
Rodrigo Agerri et.al. |
2306.06029v1 |
null |
2023-06-09 |
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect? |
Wissam Antoun et.al. |
2306.05871v1 |
null |
2023-06-09 |
Towards the Exploitation of LLM-based Chatbot for Providing Legal Support to Palestinian Cooperatives |
Rabee Qasem et.al. |
2306.05827v1 |
null |
2023-06-09 |
How Can Recommender Systems Benefit from Large Language Models: A Survey |
Jianghao Lin et.al. |
2306.05817v1 |
link |
2023-06-09 |
Detecting Phishing Sites Using ChatGPT |
Takashi Koide et.al. |
2306.05816v1 |
null |
2023-06-09 |
Exploring Effective Mask Sampling Modeling for Neural Image Compression |
Lin Liu et.al. |
2306.05704v1 |
null |
2023-06-09 |
Customizing General-Purpose Foundation Models for Medical Report Generation |
Bang Yang et.al. |
2306.05642v1 |
null |
2023-06-09 |
Word sense extension |
Lei Yu et.al. |
2306.05609v1 |
link |
2023-06-08 |
Emotion and Sentiment Guided Paraphrasing |
Justin J. Xie et.al. |
2306.05556v1 |
null |
2023-06-08 |
Advancing Italian Biomedical Information Extraction with Large Language Models: Methodological Insights and Multicenter Practical Application |
Claudio Crema et.al. |
2306.05323v1 |
null |
2023-06-08 |
Are fairness metric scores enough to assess discrimination biases in machine learning? |
Fanny Jourdan et.al. |
2306.05307v1 |
null |
2023-06-08 |
Extensive Evaluation of Transformer-based Architectures for Adverse Drug Events Extraction |
Simone Scaboro et.al. |
2306.05276v1 |
link |
2023-06-09 |
Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models |
Tianzhe Chu et.al. |
2306.05272v2 |
link |
2023-06-08 |
M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models |
Wenxuan Zhang et.al. |
2306.05179v1 |
link |
2023-06-09 |
RRWKV: Capturing Long-range Dependencies in RWKV |
Leilei Wang et.al. |
2306.05176v2 |
null |
2023-06-08 |
Learning A Foundation Language Model for Geoscience Knowledge Understanding and Utilization |
Cheng Deng et.al. |
2306.05064v1 |
link |
2023-06-08 |
Knowledge Detection by Relevant Question and Image Attributes in Visual Question Answering |
Param Ahir et.al. |
2306.04938v1 |
null |
2023-06-08 |
covLLM: Large Language Models for COVID-19 Biomedical Literature |
Yousuf A. Khan et.al. |
2306.04926v1 |
null |
2023-06-08 |
Flow-based Network Intrusion Detection Based on BERT Masked Language Model |
Loc Gia Nguyen et.al. |
2306.04920v1 |
null |
2023-06-07 |
Cross-attention learning enables real-time nonuniform rotational distortion correction in OCT |
Haoran Zhang et.al. |
2306.04512v1 |
null |
2023-06-07 |
How to Find Opinion Leader on the Online Social Network? |
Bailu Jin et.al. |
2306.04452v1 |
null |
2023-06-07 |
Multilingual Clinical NER: Translation or Cross-lingual Transfer? |
Xavier Fontaine et.al. |
2306.04384v1 |
null |
2023-06-07 |
IUTEAM1 at MEDIQA-Chat 2023: Is simple fine tuning effective for multilayer summarization of clinical conversations? |
Dhananjay Srivastava et.al. |
2306.04328v1 |
link |
2023-06-07 |
Leveraging Knowledge Graph Embeddings to Enhance Contextual Representations for Relation Extraction |
Fréjus A. A. Laleye et.al. |
2306.04203v1 |
null |
2023-06-07 |
A Survey on Generative Diffusion Models for Structured Data |
Heejoon Koo et.al. |
2306.04139v1 |
null |
2023-06-06 |
GEO-Bench: Toward Foundation Models for Earth Monitoring |
Alexandre Lacoste et.al. |
2306.03831v1 |
link |
2023-06-06 |
Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models |
Fobo Shi et.al. |
2306.03799v1 |
link |
2023-06-06 |
On the Difference of BERT-style and CLIP-style Text Encoders |
Zhihong Chen et.al. |
2306.03678v1 |
link |
2023-06-06 |
Take the Hint: Improving Arabic Diacritization with Partially-Diacritized Text |
Parnia Bahar et.al. |
2306.03557v1 |
link |
2023-06-06 |
SciLit: A Platform for Joint Scientific Literature Discovery, Summarization and Citation Generation |
Nianlong Gu et.al. |
2306.03535v1 |
link |
2023-06-06 |
Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning |
Peggy Tang et.al. |
2306.03415v1 |
link |
2023-06-06 |
Stabilizing Contrastive RL: Techniques for Offline Goal Reaching |
Chongyi Zheng et.al. |
2306.03346v1 |
link |
2023-06-05 |
A Scalable and Adaptive System to Infer the Industry Sectors of Companies: Prompt + Model Tuning of Generative Language Models |
Lele Cao et.al. |
2306.03313v1 |
null |
2023-06-05 |
Easy-to-Read in Germany: A Survey on its Current State and Available Resources |
Margot Madina et.al. |
2306.03189v1 |
null |
2023-06-05 |
Machine Learning and Statistical Approaches to Measuring Similarity of Political Parties |
Daria Boratyn et.al. |
2306.03079v1 |
null |
2023-06-05 |
Using Sequences of Life-events to Predict Human Lives |
Germans Savcisens et.al. |
2306.03009v1 |
link |
2023-06-05 |
Gen-IR @ SIGIR 2023: The First Workshop on Generative Information Retrieval |
Gabriel Bénédict et.al. |
2306.02887v1 |
null |
2023-06-05 |
COMET: Learning Cardinality Constrained Mixture of Experts with Trees and Local Search |
Shibal Ibrahim et.al. |
2306.02824v1 |
link |
2023-06-05 |
Enhancing Language Representation with Constructional Information for Natural Language Understanding |
Lvxiaowei Xu et.al. |
2306.02819v1 |
link |
2023-06-05 |
Cheap-fake Detection with LLM using Prompt Engineering |
Guangyang Wu et.al. |
2306.02776v1 |
null |
2023-06-05 |
Colexifications for Bootstrapping Cross-lingual Datasets: The Case of Phonology, Concreteness, and Affectiveness |
Yiyi Chen et.al. |
2306.02646v1 |
null |
2023-06-04 |
Adversary for Social Good: Leveraging Adversarial Attacks to Protect Personal Attribute Privacy |
Xiaoting Li et.al. |
2306.02488v1 |
null |
2023-06-04 |
Modeling Cross-Cultural Pragmatic Inference with Codenames Duet |
Omar Shaikh et.al. |
2306.02475v1 |
link |
2023-06-04 |
Taught by the Internet, Exploring Bias in OpenAIs GPT3 |
Ali Ayaz et.al. |
2306.02428v1 |
null |
2023-06-02 |
Towards In-context Scene Understanding |
Ivana Balažević et.al. |
2306.01667v1 |
null |
2023-06-02 |
Analyzing Credit Risk Model Problems through NLP-Based Clustering and Machine Learning: Insights from Validation Reports |
Szymon Lis et.al. |
2306.01618v1 |
null |
2023-06-02 |
Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today |
Zhuo Wang et.al. |
2306.01499v1 |
null |
2023-06-02 |
Syntax-aware Hybrid prompt model for Few-shot multi-modal sentiment analysis |
Zikai Zhou et.al. |
2306.01312v1 |
null |
2023-06-02 |
Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation |
Hanbyul Kim et.al. |
2306.01296v1 |
null |
2023-06-02 |
Egocentric Planning for Scalable Embodied Task Achievement |
Xiaotian Liu et.al. |
2306.01295v1 |
null |
2023-06-02 |
Active Code Learning: Benchmarking Sample-Efficient Training of Code Models |
Qiang Hu et.al. |
2306.01250v1 |
null |
2023-06-02 |
Transforming ECG Diagnosis:An In-depth Review of Transformer-based DeepLearning Models in Cardiovascular Disease Detection |
Zibin Zhao et.al. |
2306.01249v1 |
null |
2023-06-01 |
Hybrid Long Document Summarization using C2F-FAR and ChatGPT: A Practical Study |
Guang Lu et.al. |
2306.01169v1 |
null |
2023-06-01 |
Leveraging Natural Language Processing For Public Health Screening On YouTube: A COVID-19 Case Study |
Ahrar Bin Aslam et.al. |
2306.01164v1 |
null |
2023-06-01 |
Effective Structured Prompting by Meta-Learning and Representative Verbalizer |
Weisen Jiang et.al. |
2306.00618v1 |
link |
2023-06-01 |
Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior |
Shashank Subramanian et.al. |
2306.00258v1 |
null |
2023-05-31 |
Measuring the Robustness of Natural Language Processing Models to Domain Shifts |
Nitay Calderon et.al. |
2306.00168v1 |
link |
2023-05-31 |
Multilingual Multi-Figurative Language Detection |
Huiyuan Lai et.al. |
2306.00121v1 |
link |
2023-05-31 |
Findings of the VarDial Evaluation Campaign 2023 |
Noëmi Aepli et.al. |
2305.20080v1 |
null |
2023-05-31 |
Computational Language Assessment in patients with speech, language, and communication impairments |
Charalambos Themistocleous et.al. |
2305.20046v1 |
null |
2023-05-31 |
ActiveAED: A Human in the Loop Improves Annotation Error Detection |
Leon Weber et.al. |
2305.20045v1 |
link |
2023-06-01 |
A Survey on Large Language Models for Recommendation |
Likang Wu et.al. |
2305.19860v2 |
link |
2023-05-31 |
UKP-SQuARE: An Interactive Tool for Teaching Question Answering |
Haishuo Fang et.al. |
2305.19748v1 |
link |
2023-05-31 |
Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation |
Joonhyuk Yang et.al. |
2305.19666v1 |
link |
2023-05-31 |
Large Language Models Are Not Abstract Reasoners |
Gaël Gendron et.al. |
2305.19555v1 |
link |
2023-05-31 |
Ethical Considerations for Machine Translation of Indigenous Languages: Giving a Voice to the Speakers |
Manuel Mager et.al. |
2305.19474v1 |
null |
2023-05-30 |
Examining risks of racial biases in NLP tools for child protective services |
Anjalie Field et.al. |
2305.19409v1 |
null |
2023-05-30 |
Quantum Natural Language Processing based Sentiment Analysis using lambeq Toolkit |
Srinjoy Ganguly et.al. |
2305.19383v1 |
null |
2023-05-30 |
Jointly Reparametrized Multi-Layer Adaptation for Efficient and Private Tuning |
Umang Gupta et.al. |
2305.19264v1 |
link |
2023-05-30 |
Grokking of Hierarchical Structure in Vanilla Transformers |
Shikhar Murty et.al. |
2305.18741v1 |
link |
2023-05-30 |
LonXplain: Lonesomeness as a Consequence of Mental Disturbance in Reddit Posts |
Muskan Garg et.al. |
2305.18736v1 |
null |
2023-05-30 |
An Annotated Dataset for Explainable Interpersonal Risk Factors of Mental Disturbance in Social Media Posts |
Muskan Garg et.al. |
2305.18727v1 |
link |
2023-05-31 |
Beyond One-Model-Fits-All: A Survey of Domain Specialization for Large Language Models |
Chen Ling et.al. |
2305.18703v2 |
null |
2023-05-30 |
Approximation and Estimation Ability of Transformers for Sequence-to-Sequence Functions with Infinite Dimensional Input |
Shokichi Takakura et.al. |
2305.18699v1 |
null |
2023-05-29 |
SlimFit: Memory-Efficient Fine-Tuning of Transformer-based Models Using Training Dynamics |
Arash Ardakani et.al. |
2305.18513v1 |
null |
2023-05-29 |
Leveraging Training Data in Few-Shot Prompting for Numerical Reasoning |
Zhanming Jie et.al. |
2305.18170v1 |
link |
2023-05-29 |
Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods |
Mengsay Loem et.al. |
2305.18156v1 |
null |
2023-05-30 |
Do Large Language Models Know What They Don't Know? |
Zhangyue Yin et.al. |
2305.18153v2 |
link |
2023-05-29 |
The Utility of Large Language Models and Generative AI for Education Research |
Andrew Katz et.al. |
2305.18125v1 |
null |
2023-05-29 |
Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning |
Xuankai Chang et.al. |
2305.18108v1 |
null |
2023-05-29 |
Semantic Role Labeling Guided Out-of-distribution Detection |
Jinan Zou et.al. |
2305.18026v1 |
link |
2023-05-29 |
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition |
Xiaoliang Wu et.al. |
2305.18011v1 |
null |
2023-05-28 |
Transfer Learning for Power Outage Detection Task with Limited Training Data |
Olukunle Owolabi et.al. |
2305.17817v1 |
null |
2023-05-28 |
Tab-CoT: Zero-shot Tabular Chain of Thought |
Ziqi Jin et.al. |
2305.17812v1 |
link |
2023-05-28 |
ConvGenVisMo: Evaluation of Conversational Generative Vision Models |
Narjes Nikzad Khasmakhi et.al. |
2305.17784v1 |
link |
2023-05-26 |
Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model |
David Soong et.al. |
2305.17116v1 |
null |
2023-05-26 |
NeuroX Library for Neuron Analysis of Deep NLP Models |
Fahim Dalvi et.al. |
2305.17073v1 |
link |
2023-05-26 |
Counterfactuals of Counterfactuals: a back-translation-inspired approach to analyse counterfactual editors |
Giorgos Filandrianos et.al. |
2305.17055v1 |
link |
2023-05-26 |
Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation |
David Brandfonbrener et.al. |
2305.16985v1 |
link |
2023-05-26 |
Theoretical and Practical Perspectives on what Influence Functions Do |
Andrea Schioppa et.al. |
2305.16971v1 |
null |
2023-05-26 |
RankCSE: Unsupervised Sentence Representations Learning via Learning to Rank |
Jiduan Liu et.al. |
2305.16726v1 |
null |
2023-05-26 |
TADA: Task-Agnostic Dialect Adapters for English |
Will Held et.al. |
2305.16651v1 |
link |
2023-05-26 |
Are Fairy Tales Fair? Analyzing Gender Bias in Temporal Narrative Event Chains of Children's Fairy Tales |
Paulina Toro Isaza et.al. |
2305.16641v1 |
null |
2023-05-26 |
Zero is Not Hero Yet: Benchmarking Zero-Shot Performance of LLMs for Financial Tasks |
Agam Shah et.al. |
2305.16633v1 |
link |
2023-05-26 |
ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-Translation |
Kuan-Hao Huang et.al. |
2305.16585v1 |
link |
2023-05-25 |
Landmark Attention: Random-Access Infinite Context Length for Transformers |
Amirkeivan Mohtashami et.al. |
2305.16300v1 |
link |
2023-05-25 |
Understanding Idea Creation in Collaborative Discourse through Networks: The Joint Attention-Interaction-Creation (AIC) Framework |
Xinran Zhu et.al. |
2305.16262v1 |
null |
2023-05-25 |
Neural Natural Language Processing for Long Texts: A Survey of the State-of-the-Art |
Dimitrios Tsirmpas et.al. |
2305.16259v1 |
null |
2023-05-25 |
Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification |
Gokul Bhusal et.al. |
2305.16239v1 |
null |
2023-05-25 |
More than Words: Twitter Chatter and Financial Market Sentiment |
Travis Adams et.al. |
2305.16164v1 |
null |
2023-05-25 |
Training Data Extraction From Pre-trained Language Models: A Survey |
Shotaro Ishihara et.al. |
2305.16157v1 |
null |
2023-05-25 |
On Influence Functions, Classification Influence, Relative Influence, Memorization and Generalization |
Michael Kounavis et.al. |
2305.16094v1 |
null |
2023-05-25 |
Efficient Document Embeddings via Self-Contrastive Bregman Divergence Learning |
Daniel Saggau et.al. |
2305.16031v1 |
null |
2023-05-25 |
SING: A Plug-and-Play DNN Learning Technique |
Adrien Courtois et.al. |
2305.15997v1 |
link |
2023-05-25 |
Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data |
Aryan Patil et.al. |
2305.15722v1 |
null |
2023-05-24 |
READ: Recurrent Adaptation of Large Transformers |
Sid Wang et.al. |
2305.15348v1 |
null |
2023-05-24 |
EvEval: A Comprehensive Evaluation of Event Semantics for Large Language Models |
Zhengwei Tao et.al. |
2305.15268v1 |
null |
2023-05-24 |
SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation |
Tetsu Kasanishi et.al. |
2305.15186v1 |
link |
2023-05-24 |
A Mini Review on the utilization of Reinforcement Learning with OPC UA |
Simon Schindler et.al. |
2305.15113v1 |
null |
2023-05-24 |
GPT4Graph: Can Large Language Models Understand Graph Structured Data ? An Empirical Evaluation and Benchmarking |
Jiayan Guo et.al. |
2305.15066v1 |
null |
2023-05-24 |
Exploring Adapter-based Transfer Learning for Recommender Systems: Empirical Studies and Practical Insights |
Junchen Fu et.al. |
2305.15036v1 |
link |
2023-05-24 |
Unlocking Temporal Question Answering for Large Language Models Using Code Execution |
Xingxuan Li et.al. |
2305.15014v1 |
link |
2023-05-24 |
Bactrian-X : A Multilingual Replicable Instruction-Following Model with Low-Rank Adaptation |
Haonan Li et.al. |
2305.15011v1 |
link |
2023-05-24 |
Sentiment Analysis in the Era of Large Language Models: A Reality Check |
Wenxuan Zhang et.al. |
2305.15005v1 |
link |
2023-05-24 |
Frugal Prompting for Dialog Models |
Bishal Santra et.al. |
2305.14919v1 |
link |
2023-05-23 |
RET-LLM: Towards a General Read-Write Memory for Large Language Models |
Ali Modarressi et.al. |
2305.14322v1 |
link |
2023-05-23 |
VIP5: Towards Multimodal Foundation Models for Recommendation |
Shijie Geng et.al. |
2305.14302v1 |
link |
2023-05-23 |
LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages |
Milind Agarwal et.al. |
2305.14263v1 |
link |
2023-05-23 |
TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale |
Ziyun Zeng et.al. |
2305.14173v1 |
link |
2023-05-23 |
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future |
Linyi Yang et.al. |
2305.14104v1 |
null |
2023-05-23 |
Predicting Survey Response with Quotation-based Modeling: A Case Study on Favorability towards the United States |
Alireza Amirshahi et.al. |
2305.14086v1 |
null |
2023-05-23 |
Robust Instruction Optimization for Large Language Models with Distribution Shifts |
Moxin Li et.al. |
2305.13954v1 |
null |
2023-05-23 |
Parameterized Complexity Classification for Interval Constraints |
Konrad K. Dabrowski et.al. |
2305.13889v1 |
null |
2023-05-23 |
PaD: Program-aided Distillation Specializes Large Models in Reasoning |
Xuekai Zhu et.al. |
2305.13888v1 |
link |
2023-05-23 |
A Trip Towards Fairness: Bias and De-Biasing in Large Language Models |
Leonardo Ranaldi et.al. |
2305.13862v1 |
null |
2023-05-22 |
Parallel Attention and Feed-Forward Net Design for Pre-training and Inference on Transformers |
Shashank Sonkar et.al. |
2305.13297v1 |
null |
2023-05-22 |
VideoLLM: Modeling Video Sequence with Large Language Models |
Guo Chen et.al. |
2305.13292v1 |
link |
2023-05-22 |
Watermarking Text Data on Large Language Models for Dataset Copyright Protection |
Yixin Liu et.al. |
2305.13257v1 |
null |
2023-05-22 |
Interactive Natural Language Processing |
Zekun Wang et.al. |
2305.13246v1 |
null |
2023-05-22 |
Should We Attend More or Less? Modulating Attention for Fairness |
Abdelrahman Zayed et.al. |
2305.13088v1 |
null |
2023-05-22 |
Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization |
Zihao Fu et.al. |
2305.13066v1 |
link |
2023-05-22 |
RWKV: Reinventing RNNs for the Transformer Era |
Bo Peng et.al. |
2305.13048v1 |
link |
2023-05-22 |
Rethinking Semi-supervised Learning with Language Models |
Zhengxiang Shi et.al. |
2305.13002v1 |
link |
2023-05-22 |
VanillaNet: the Power of Minimalism in Deep Learning |
Hanting Chen et.al. |
2305.12972v1 |
link |
2023-05-22 |
A Diachronic Analysis of the NLP Research Paradigm Shift: When, How, and Why? |
Aniket Pramanick et.al. |
2305.12920v1 |
null |
2023-05-19 |
Recent progress in the JARVIS infrastructure for next-generation data-driven materials design |
Daniel Wines et.al. |
2305.11842v1 |
null |
2023-05-19 |
Marginalized Beam Search Algorithms for Hierarchical HMMs |
Xuechun Xu et.al. |
2305.11752v1 |
link |
2023-05-19 |
Introspective Tips: Large Language Model for In-Context Decision Making |
Liting Chen et.al. |
2305.11598v1 |
null |
2023-05-19 |
Diving into the Inter-Consistency of Large Language Models: An Insightful Analysis through Debate |
Kai Xiong et.al. |
2305.11595v1 |
link |
2023-05-19 |
Speech-Text Dialog Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment |
Tianshu Yu et.al. |
2305.11579v1 |
link |
2023-05-19 |
Constructing Word-Context-Coupled Space Aligned with Associative Knowledge Relations for Interpretable Language Modeling |
Fanyu Wang et.al. |
2305.11543v1 |
link |
2023-05-19 |
A Sequence-to-Sequence Approach for Arabic Pronoun Resolution |
Hanan S. Murayshid et.al. |
2305.11529v1 |
null |
2023-05-19 |
AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation |
Sara Papi et.al. |
2305.11408v1 |
link |
2023-05-18 |
Comparing Biases and the Impact of Multilingual Training across Multiple Languages |
Sharon Levy et.al. |
2305.11242v1 |
null |
2023-05-18 |
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model |
Siyuan Huang et.al. |
2305.11176v1 |
link |
2023-05-18 |
Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature |
Ana Cláudia Akemi Matsuki de Faria et.al. |
2305.11033v1 |
null |
2023-05-18 |
How Deep Learning Sees the World: A Survey on Adversarial Attacks & Defenses |
Joana C. Costa et.al. |
2305.10862v1 |
null |
2023-05-18 |
Deep Learning Methods for Extracting Metaphorical Names of Flowers and Plants |
Amal Haddad Haddad et.al. |
2305.10833v1 |
null |
2023-05-18 |
Expanding the Role of Affective Phenomena in Multimodal Interaction Research |
Leena Mathur et.al. |
2305.10827v1 |
null |
2023-05-18 |
A Survey on Time-Series Pre-Trained Models |
Qianli Ma et.al. |
2305.10716v1 |
link |
2023-05-18 |
Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding |
Taolin Zhang et.al. |
2305.10714v1 |
null |
2023-05-18 |
NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in Natural Language Processing |
Tingting Wu et.al. |
2305.10709v1 |
link |
2023-05-18 |
MolXPT: Wrapping Molecules with Text for Generative Pre-training |
Zequn Liu et.al. |
2305.10688v1 |
null |
2023-05-17 |
Incorporating Attribution Importance for Improving Faithfulness Metrics |
Zhixue Zhao et.al. |
2305.10496v1 |
link |
2023-05-17 |
G-Adapter: Towards Structure-Aware Parameter-Efficient Transfer Learning for Graph Transformer Networks |
Anchun Gui et.al. |
2305.10329v1 |
null |
2023-05-17 |
Towards More Robust NLP System Evaluation: Handling Missing Scores in Benchmarks |
Anas Himmi et.al. |
2305.10284v1 |
null |
2023-05-17 |
A quantitative study of NLP approaches to question difficulty estimation |
Luca Benedetto et.al. |
2305.10236v1 |
link |
2023-05-17 |
Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection |
Shadi Iskander et.al. |
2305.10204v1 |
link |
2023-05-17 |
Qualifying Chinese Medical Licensing Examination with Knowledge Enhanced Generative Pre-training Model |
Jiageng Wu et.al. |
2305.10163v1 |
null |
2023-05-17 |
Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark |
Wenjun Peng et.al. |
2305.10036v1 |
link |
2023-05-17 |
When Gradient Descent Meets Derivative-Free Optimization: A Match Made in Black-Box Scenario |
Chengcheng Han et.al. |
2305.10013v1 |
null |
2023-05-17 |
Semantic Similarity Measure of Natural Language Text through Machine Learning and a Keyword-Aware Cross-Encoder-Ranking Summarizer -- A Case Study Using UCGIS GIS&T Body of Knowledge |
Yuanyuan Tian et.al. |
2305.09877v1 |
null |
2023-05-17 |
Knowledge Graph Completion Models are Few-shot Learners: An Empirical Study of Relation Labeling in E-commerce with LLMs |
Jiao Chen et.al. |
2305.09858v1 |
null |
2023-05-16 |
Mirages: On Anthropomorphism in Dialogue Systems |
Gavin Abercrombie et.al. |
2305.09800v1 |
null |
2023-05-16 |
Adapting Sentence Transformers for the Aviation Domain |
Liya Wang et.al. |
2305.09556v1 |
null |
2023-05-16 |
Life of PII -- A PII Obfuscation Transformer |
Ajinkya Deshmukh et.al. |
2305.09550v1 |
null |
2023-05-16 |
MetaSRL++: A Uniform Scheme for Modelling Deeper Semantics |
Fritz Hohl et.al. |
2305.09534v1 |
null |
2023-05-16 |
On the Origins of Bias in NLP through the Lens of the Jim Code |
Fatma Elsafoury et.al. |
2305.09281v1 |
null |
2023-05-16 |
Progressive Translation: Improving Domain Robustness of Neural Machine Translation with Intermediate Sequences |
Chaojun Wang et.al. |
2305.09154v1 |
link |
2023-05-15 |
An assessment of measuring local levels of homelessness through proxy social media signals |
Yoshi Meke Bird et.al. |
2305.08978v1 |
null |
2023-05-15 |
Sentence Level Curriculum Learning for Improved Neural Conversational Models |
Sean Paulsen et.al. |
2305.08818v1 |
null |
2023-05-15 |
Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text |
Hanieh Khorashadizadeh et.al. |
2305.08804v1 |
null |
2023-05-15 |
Question-Answering System Extracts Information on Injection Drug Use from Clinical Progress Notes |
Maria Mahbub et.al. |
2305.08777v1 |
link |
2023-05-15 |
Measuring Consistency in Text-based Financial Forecasting Models |
Linyi Yang et.al. |
2305.08524v1 |
link |
2023-05-15 |
Beqi: Revitalize the Senegalese Wolof Language with a Robust Spelling Corrector |
Derguene Mbaye et.al. |
2305.08518v1 |
null |
2023-05-15 |
Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages |
Chunlan Ma et.al. |
2305.08487v1 |
null |
2023-05-15 |
What's the Meaning of Superhuman Performance in Today's NLU? |
Simone Tedeschi et.al. |
2305.08414v1 |
null |
2023-05-14 |
MatSci-NLP: Evaluating Scientific Language Models on Materials Science Language Tasks Using Text-to-Schema Modeling |
Yu Song et.al. |
2305.08264v1 |
link |
2023-05-14 |
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity |
Raman Dutt et.al. |
2305.08252v1 |
null |
2023-05-14 |
Learning to Generalize for Cross-domain QA |
Yingjie Niu et.al. |
2305.08208v1 |
link |
2023-05-12 |
PALR: Personalization Aware LLMs for Recommendation |
Zheng Chen et.al. |
2305.07622v1 |
null |
2023-05-12 |
Retrospective End-User Walkthrough: A Method for Assessing How People Combine Multiple AI Models in Decision-Making Systems |
Vagner Figueredo de Santana et.al. |
2305.07530v1 |
null |
2023-05-12 |
ArtGPT-4: Artistic Vision-Language Understanding with Adapter-enhanced MiniGPT-4 |
Zhengqing Yuan et.al. |
2305.07490v1 |
link |
2023-05-12 |
Implications of Deep Circuits in Improving Quality of Quantum Question Answering |
Pragya Katyayan et.al. |
2305.07374v1 |
null |
2023-05-12 |
Gaussian Prior Reinforcement Learning for Nested Named Entity Recognition |
Yawen Yang et.al. |
2305.07266v1 |
null |
2023-05-12 |
T-former: An Efficient Transformer for Image Inpainting |
Ye Deng et.al. |
2305.07239v1 |
link |
2023-05-12 |
When Giant Language Brains Just Aren't Enough! Domain Pizzazz with Knowledge Sparkle Dust |
Minh-Tien Nguyen et.al. |
2305.07230v1 |
null |
2023-05-12 |
Asymmetric feature interaction for interpreting model predictions |
Xiaolei Lu et.al. |
2305.07224v1 |
link |
2023-05-11 |
Automated Smell Detection and Recommendation in Natural Language Requirements |
Alvaro Veizaga et.al. |
2305.07097v1 |
null |
2023-05-11 |
Cost-efficient Crowdsourcing for Span-based Sequence Labeling: Worker Selection and Data Augmentation |
Yujie Wang et.al. |
2305.06683v1 |
null |
2023-05-11 |
When the Majority is Wrong: Leveraging Annotator Disagreement for Subjective Tasks |
Eve Fleisig et.al. |
2305.06626v1 |
null |
2023-05-11 |
GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark |
Dongyang Li et.al. |
2305.06545v1 |
null |
2023-05-11 |
How Good are Commercial Large Language Models on African Languages? |
Jessica Ojo et.al. |
2305.06530v1 |
null |
2023-05-10 |
Exploring the Landscape of Machine Unlearning: A Survey and Taxonomy |
Thanveer Shaik et.al. |
2305.06360v1 |
null |
2023-05-10 |
CADGE: Context-Aware Dialogue Generation Enhanced with Graph-Structured Knowledge Aggregation |
Hongbo Zhanga et.al. |
2305.06294v1 |
link |
2023-05-09 |
Alleviating Over-smoothing for Unsupervised Sentence Representation |
Nuo Chen et.al. |
2305.06154v1 |
link |
2023-05-10 |
CrudeBERT: Applying Economic Theory towards fine-tuning Transformer-based Sentiment Analysis Models to the Crude Oil Market |
Himmet Kaplan et.al. |
2305.06140v1 |
null |
2023-05-10 |
Transformer-based model for monocular visual odometry: a video understanding approach |
André O. Françani et.al. |
2305.06121v1 |
link |
2023-05-10 |
XTab: Cross-table Pretraining for Tabular Transformers |
Bingzhao Zhu et.al. |
2305.06090v1 |
link |
2023-05-10 |
FedSOV: Federated Model Secure Ownership Verification with Unforgeable Signature |
Wenyuan Yang et.al. |
2305.06085v1 |
null |
2023-05-09 |
Learning to Parallelize with OpenMP by Augmented Heterogeneous AST Representation |
Le Chen et.al. |
2305.05779v1 |
null |
2023-05-09 |
Beyond Good Intentions: Reporting the Research Landscape of NLP for Social Good |
Fernando Gonzalez et.al. |
2305.05471v1 |
link |
2023-05-09 |
Estimating related words computationally using language model from the Mahabharata -- an Indian epic |
Vrunda Gadesha et.al. |
2305.05420v1 |
null |
2023-05-08 |
Knowledge-enhanced Agents for Interactive Text Games |
Prateek Chhikara et.al. |
2305.05091v1 |
null |
2023-05-08 |
A Unified Evaluation Framework for Novelty Detection and Accommodation in NLP with an Instantiation in Authorship Attribution |
Neeraj Varshney et.al. |
2305.05079v1 |
null |
2023-05-08 |
Dreams Are More "Predictable'' Than You Think |
Lorenzo Bertolini et.al. |
2305.05054v1 |
link |
2023-05-08 |
Knowledge Graph Guided Semantic Evaluation of Language Models For User Trust |
Kaushik Roy et.al. |
2305.04989v1 |
null |
2023-05-08 |
Towards Understanding Machine Learning Testing in Practise |
Arumoy Shome et.al. |
2305.04988v1 |
null |
2023-05-08 |
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification |
Anastasiia Grishina et.al. |
2305.04940v1 |
link |
2023-05-08 |
Augmented Large Language Models with Parametric Knowledge Guiding |
Ziyang Luo et.al. |
2305.04757v1 |
null |
2023-05-08 |
Toeplitz Neural Network for Sequence Modeling |
Zhen Qin et.al. |
2305.04749v1 |
link |
2023-05-08 |
Differentially Private Attention Computation |
Yeqi Gao et.al. |
2305.04701v1 |
null |
2023-05-08 |
Putting Natural in Natural Language Processing |
Grzegorz Chrupała et.al. |
2305.04572v1 |
null |
2023-05-08 |
Multi-source Education Knowledge Graph Construction and Fusion for College Curricula |
Zeju Li et.al. |
2305.04567v1 |
null |
2023-05-08 |
Flex-SFU: Accelerating DNN Activation Functions by Non-Uniform Piecewise Approximation |
Enrico Reggiani et.al. |
2305.04546v1 |
null |
2023-05-08 |
A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues |
Yunxin Li et.al. |
2305.04530v1 |
link |
2023-05-08 |
Token-level Fitting Issues of Seq2seq Models |
Guangsheng Bao et.al. |
2305.04493v1 |
null |
2023-05-08 |
SmartState: A Protocol-Driven Human Interface |
Samuel E. Armstrong et.al. |
2305.04411v1 |
link |
2023-05-07 |
LatinCy: Synthetic Trained Pipelines for Latin NLP |
Patrick J. Burns et.al. |
2305.04365v1 |
null |
2023-05-05 |
How Segment Anything Model (SAM) Boost Medical Image Segmentation? |
Yichi Zhang et.al. |
2305.03678v1 |
link |
2023-05-05 |
Now It Sounds Like You: Learning Personalized Vocabulary On Device |
Sid Wang et.al. |
2305.03584v1 |
null |
2023-05-05 |
Context-Aware Semantic Similarity Measurement for Unsupervised Word Sense Disambiguation |
Jorge Martinez-Gil et.al. |
2305.03520v1 |
link |
2023-05-05 |
T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Large Language Model Signals for Science Question Answering |
Lei Wang et.al. |
2305.03453v1 |
link |
2023-05-05 |
Online Gesture Recognition using Transformer and Natural Language Processing |
G. C. M. Silvestre et.al. |
2305.03407v1 |
null |
2023-05-05 |
Visualization in the Era of Artificial Intelligence: Experiments for Creating Structural Visualizations by Prompting Large Language Models |
Hans-Georg Fill et.al. |
2305.03380v1 |
null |
2023-05-05 |
The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions, Cross-Cultural Humour, and Personalisation |
Lukas Christ et.al. |
2305.03369v1 |
link |
2023-05-05 |
MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic |
Damien Sileo et.al. |
2305.03353v1 |
link |
2023-05-05 |
HiPool: Modeling Long Documents Using Graph Neural Networks |
Irene Li et.al. |
2305.03319v1 |
link |
2023-05-05 |
A Survey on Out-of-Distribution Detection in NLP |
Hao Lang et.al. |
2305.03236v1 |
null |
2023-05-04 |
Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Recover the Whole Sentence |
Haoran Li et.al. |
2305.03010v1 |
link |
2023-05-04 |
Simple Noisy Environment Augmentation for Reinforcement Learning |
Raad Khraishi et.al. |
2305.02882v1 |
link |
2023-05-04 |
Interpretable Sentence Representation with Variational Autoencoders and Attention |
Ghazi Felhi et.al. |
2305.02810v1 |
null |
2023-05-04 |
The Elephant in the Room: Analyzing the Presence of Big Tech in Natural Language Processing Research |
Mohamed Abdalla et.al. |
2305.02797v1 |
link |
2023-05-04 |
DN at SemEval-2023 Task 12: Low-Resource Language Text Classification via Multilingual Pretrained Language Model Fine-tuning |
Daniil Homskiy et.al. |
2305.02607v1 |
null |
2023-05-04 |
AutoML-GPT: Automatic Machine Learning with GPT |
Shujian Zhang et.al. |
2305.02499v1 |
null |
2023-05-03 |
Quantifying the Dissimilarity of Texts |
Benjamin Shade et.al. |
2305.02457v1 |
link |
2023-05-03 |
Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs |
Deepak Narayanan et.al. |
2305.02440v1 |
null |
2023-05-03 |
Uncovering ChatGPT's Capabilities in Recommender Systems |
Sunhao Dai et.al. |
2305.02182v1 |
link |
2023-05-03 |
Natural language processing on customer note data |
Andrew Hilditch et.al. |
2305.02029v1 |
null |
2023-05-03 |
Exploring the Protein Sequence Space with Global Generative Models |
Sergio Romero-Romero et.al. |
2305.01941v1 |
null |
2023-05-03 |
Can Large Language Models Be an Alternative to Human Evaluations? |
Cheng-Han Chiang et.al. |
2305.01937v1 |
null |
2023-05-03 |
Improving Contrastive Learning of Sentence Embeddings from AI Feedback |
Qinyuan Cheng et.al. |
2305.01918v1 |
link |
2023-05-02 |
Post-Abstention: Towards Reliably Re-Attempting the Abstained Instances in QA |
Neeraj Varshney et.al. |
2305.01812v1 |
null |
2023-05-02 |
Don't Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner |
Zhengxiang Shi et.al. |
2305.01711v1 |
link |
2023-05-02 |
BrainNPT: Pre-training of Transformer networks for brain network classification |
Jinlong Hu et.al. |
2305.01666v1 |
null |
2023-05-02 |
The Benefits of Bad Advice: Autocontrastive Decoding across Model Layers |
Ariel Gera et.al. |
2305.01628v1 |
link |
2023-05-02 |
MultiLegalSBD: A Multilingual Legal Sentence Boundary Detection Dataset |
Tobias Brugger et.al. |
2305.01211v1 |
link |
2023-05-02 |
Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding |
Juan Zuluaga-Gomez et.al. |
2305.01155v1 |
null |
2023-05-02 |
RadAdapt: Radiology Report Summarization via Lightweight Domain Adaptation of Large Language Models |
Dave Van Veen et.al. |
2305.01146v1 |
link |
2023-05-01 |
Company classification using zero-shot learning |
Maryan Rizinski et.al. |
2305.01028v1 |
null |
2023-05-01 |
Attack-SAM: Towards Evaluating Adversarial Robustness of Segment Anything Model |
Chenshuang Zhang et.al. |
2305.00866v1 |
null |
2023-05-01 |
Performance and Energy Consumption of Parallel Machine Learning Algorithms |
Xidong Wu et.al. |
2305.00798v1 |
null |
2023-05-01 |
An Iterative Algorithm for Rescaled Hyperbolic Functions Regression |
Yeqi Gao et.al. |
2305.00660v1 |
null |
2023-05-01 |
Low-Resourced Machine Translation for Senegalese Wolof Language |
Derguene Mbaye et.al. |
2305.00606v1 |
null |
2023-04-30 |
Graph Global Attention Network with Memory for Fake News Detection |
Qian Chang et.al. |
2305.00456v1 |
null |
2023-04-29 |
Patent Mining by Extracting Functional Analysis Information Modelled As Graph Structure: A Patent Knowledge-base Collaborative Building Approach |
Manal E. Helal et.al. |
2305.00309v1 |
null |
2023-04-29 |
When Deep Learning Meets Polyhedral Theory: A Survey |
Joey Huchette et.al. |
2305.00241v1 |
null |
2023-04-29 |
Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework |
David Alonso del Barrio et.al. |
2305.00182v1 |
null |
2023-04-28 |
NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis |
Mingyang Wang et.al. |
2305.00090v1 |
null |
2023-04-28 |
Prompt Engineering for Healthcare: Methodologies and Applications |
Jiaqi Wang et.al. |
2304.14670v1 |
null |
2023-04-27 |
pyBibX -- A Python Library for Bibliometric and Scientometric Analysis Powered with Artificial Intelligence Tools |
Valdecy Pereira et.al. |
2304.14516v1 |
link |
2023-04-27 |
Framing the News:From Human Perception to Large Language Model Inferences |
David Alonso del Barrio et.al. |
2304.14456v1 |
null |
2023-04-27 |
string2string: A Modern Python Library for String-to-String Algorithms |
Mirac Suzgun et.al. |
2304.14395v1 |
link |
2023-04-26 |
Fine Tuning with Abnormal Examples |
Will Rieger et.al. |
2304.13783v1 |
null |
2023-04-27 |
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond |
Jingfeng Yang et.al. |
2304.13712v2 |
link |
2023-04-26 |
FVP: Fourier Visual Prompting for Source-Free Unsupervised Domain Adaptation of Medical Image Segmentation |
Yan Wang et.al. |
2304.13672v1 |
null |
2023-04-26 |
Using Implicit Feedback to Improve Question Generation |
Hugo Rodrigues et.al. |
2304.13664v1 |
null |
2023-04-26 |
Impact of Position Bias on Language Models in Token Classification |
Mehdi Ben Amor et.al. |
2304.13567v1 |
link |
2023-04-26 |
Tensor Decomposition for Model Reduction in Neural Networks: A Review |
Xingyi Liu et.al. |
2304.13539v1 |
null |
2023-04-26 |
The Closeness of In-Context Learning and Weight Shifting for Softmax Regression |
Shuai Li et.al. |
2304.13276v1 |
null |
2023-04-25 |
Representing and extracting knowledge from single cell data |
Ionut Sebastian Mihai et.al. |
2304.13084v1 |
null |
2023-04-25 |
Optimizing Deep Learning Models For Raspberry Pi |
Salem Ameen et.al. |
2304.13039v1 |
link |
2023-04-25 |
The Potential of Visual ChatGPT For Remote Sensing |
Lucas Prado Osco et.al. |
2304.13009v1 |
null |
2023-04-24 |
Topological properties and organizing principles of semantic networks |
Gabriel Budel et.al. |
2304.12940v1 |
null |
2023-04-25 |
Lessons Learned from a Citizen Science Project for Natural Language Processing |
Jan-Christoph Klie et.al. |
2304.12836v1 |
link |
2023-04-25 |
What does BERT learn about prosody? |
Sofoklis Kakouros et.al. |
2304.12706v1 |
null |
2023-04-25 |
A Preliminary Evaluation of ChatGPT in Requirements Information Retrieval |
Jianzhang Zhang et.al. |
2304.12562v1 |
link |
2023-04-24 |
Understanding and Predicting Human Label Variation in Natural Language Inference through Explanation |
Nan-Jiang Jiang et.al. |
2304.12443v1 |
null |
2023-04-24 |
Semantic Tokenizer for Enhanced Natural Language Processing |
Sandeep Mehta et.al. |
2304.12404v1 |
null |
2023-04-24 |
ThreatCrawl: A BERT-based Focused Crawler for the Cybersecurity Domain |
Philipp Kuehn et.al. |
2304.11960v1 |
null |
2023-04-23 |
Graph Neural Networks for Text Classification: A Survey |
Kunze Wang et.al. |
2304.11534v1 |
null |
2023-04-22 |
Understanding Lexical Biases when Identifying Gang-related Social Media Communications |
Dhiraj Murthy et.al. |
2304.11485v1 |
null |
2023-04-22 |
A Review of Deep Learning for Video Captioning |
Moloud Abdar et.al. |
2304.11431v1 |
null |
2023-04-22 |
Romanian Multiword Expression Detection Using Multilingual Adversarial Training and Lateral Inhibition |
Andrei-Marius Avram et.al. |
2304.11350v1 |
null |
2023-04-21 |
The Role of AI in Human-AI Creative Writing for Hong Kong Secondary Students |
Hengky Susanto et.al. |
2304.11276v1 |
null |
2023-04-20 |
Backpropagation-free Training of Deep Physical Neural Networks |
Ali Momeni et.al. |
2304.11042v1 |
null |
2023-04-21 |
BERT Based Clinical Knowledge Extraction for Biomedical Knowledge Graph Construction and Analysis |
Ayoub Harnoune et.al. |
2304.10996v1 |
null |
2023-04-21 |
Information Extraction from Documents: Question Answering vs Token Classification in real-world setups |
Laurent Lam et.al. |
2304.10994v1 |
null |
2023-04-24 |
Text2Time: Transformer-based Article Time Period Prediction |
Karthick Prasad Gunasekaran et.al. |
2304.10859v2 |
null |
2023-04-21 |
Hyperbolic Geometry in Computer Vision: A Survey |
Pengfei Fang et.al. |
2304.10764v1 |
null |
2023-04-21 |
Improving Grounded Language Understanding in a Collaborative Environment by Interacting with Agents Through Help Feedback |
Nikhil Mehta et.al. |
2304.10750v1 |
null |
2023-04-20 |
IXA/Cogcomp at SemEval-2023 Task 2: Context-enriched Multilingual Named Entity Recognition using Knowledge Bases |
Iker García-Ferrero et.al. |
2304.10637v1 |
link |
2023-04-20 |
An Introduction to Transformers |
Richard E. Turner et.al. |
2304.10557v1 |
null |
2023-04-20 |
Multidimensional Uncertainty Quantification for Deep Neural Networks |
Xujiang Zhao et.al. |
2304.10527v1 |
null |
2023-04-20 |
Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health |
Shaoxiong Ji et.al. |
2304.10447v1 |
null |
2023-04-20 |
OptoGPT: A Foundation Model for Inverse Design in Optical Multilayer Thin Film Structures |
Taigao Ma et.al. |
2304.10294v1 |
null |
2023-04-20 |
Is augmentation effective to improve prediction in imbalanced text datasets? |
Gabriel O. Assunção et.al. |
2304.10283v1 |
null |
2023-04-20 |
Replication and Verifiability in Requirements Engineering: the NLP for RE Case |
Sallam Abualhaija et.al. |
2304.10265v1 |
null |
2023-04-20 |
Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning? |
Haoyang Peng et.al. |
2304.10224v1 |
null |
2023-04-19 |
Radar de Parité: An NLP system to measure gender representation in French news stories |
Valentin-Gabriel Soumah et.al. |
2304.09982v1 |
link |
2023-04-19 |
SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in Surgery |
Lalithkumar Seenivasan et.al. |
2304.09974v1 |
link |
2023-04-19 |
Catch Me If You Can: Identifying Fraudulent Physician Reviews with Large Language Models Using Generative Pre-Trained Transformers |
Aishwarya Deep Shukla et.al. |
2304.09948v1 |
null |
2023-04-19 |
Transformer-Based Visual Segmentation: A Survey |
Xiangtai Li et.al. |
2304.09854v1 |
link |
2023-04-19 |
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models |
Pan Lu et.al. |
2304.09842v1 |
link |
2023-04-19 |
A Survey of Corpora for Germanic Low-Resource Languages and Dialects |
Verena Blaschke et.al. |
2304.09805v1 |
link |
2023-04-19 |
Bridging Natural Language Processing and Psycholinguistics: computationally grounded semantic similarity and relatedness datasets for Basque and Spanish |
J. Goikoetxea et.al. |
2304.09616v1 |
null |
2023-04-19 |
NetGPT: Generative Pretrained Transformer for Network Traffic |
Xuying Meng et.al. |
2304.09513v1 |
null |
2023-04-18 |
Revisiting k-NN for Pre-trained Language Models |
Lei Li et.al. |
2304.09058v1 |
link |
2023-04-18 |
From Words to Music: A Study of Subword Tokenization Techniques in Symbolic Music Generation |
Adarsh Kumar et.al. |
2304.08953v1 |
null |
2023-04-18 |
Along the Margins: Marginalized Communities' Ethical Concerns about Social Platforms |
Lauren Olson et.al. |
2304.08882v1 |
null |
2023-04-18 |
A Survey on Biomedical Text Summarization with Pre-trained Language Model |
Qianqian Xie et.al. |
2304.08763v1 |
null |
2023-04-17 |
Classification of US Supreme Court Cases using BERT-Based Techniques |
Shubham Vatsal et.al. |
2304.08649v1 |
link |
2023-04-17 |
Improving Autoregressive NLP Tasks via Modular Linearized Attention |
Victor Agostinelli et.al. |
2304.08453v1 |
null |
2023-04-17 |
Physics-inspired Neuroacoustic Computing Based on Tunable Nonlinear Multiple-scattering |
Ali Momeni et.al. |
2304.08380v1 |
null |
2023-04-17 |
Use of social media and Natural Language Processing (NLP) in natural hazard research |
José Augusto Proença Maia Devienne et.al. |
2304.08341v1 |
null |
2023-04-17 |
Thorny Roses: Investigating the Dual Use Dilemma in Natural Language Processing |
Lucie-Aimée Kaffee et.al. |
2304.08315v1 |
link |
2023-04-17 |
Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca |
Yiming Cui et.al. |
2304.08177v1 |
link |
2023-04-17 |
A Survey on Few-Shot Class-Incremental Learning |
Songsong Tian et.al. |
2304.08130v1 |
null |
2023-04-18 |
A Comparative Study between Full-Parameter and LoRA-based Fine-Tuning on Chinese Instruction Data for Instruction Following Large Language Model |
Xianghui Sun et.al. |
2304.08109v2 |
link |
2023-04-16 |
Chain of Thought Prompt Tuning in Vision Language Models |
Jiaxin Ge et.al. |
2304.07919v1 |
null |
2023-04-16 |
It's All in the Embedding! Fake News Detection Using Document Embeddings |
Ciprian-Octavian Truică et.al. |
2304.07781v1 |
link |
2023-04-16 |
Syntactic Complexity Identification, Measurement, and Reduction Through Controlled Syntactic Simplification |
Muhammad Salman et.al. |
2304.07774v1 |
link |
2023-04-14 |
Optimal inference of a generalised Potts model by single-layer transformers with factored attention |
Riccardo Rende et.al. |
2304.07235v1 |
null |
2023-04-14 |
DINOv2: Learning Robust Visual Features without Supervision |
Maxime Oquab et.al. |
2304.07193v1 |
link |
2023-04-14 |
Just Tell Me: Prompt Engineering in Business Process Management |
Kiran Busch et.al. |
2304.07183v1 |
null |
2023-04-14 |
Radio Galaxy Zoo EMU: Towards a Semantic Radio Galaxy Morphology Taxonomy |
Micah Bowles et.al. |
2304.07171v1 |
link |
2023-04-14 |
HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge |
Haochun Wang et.al. |
2304.06975v1 |
link |
2023-04-14 |
Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding |
Yu-Qi Yang et.al. |
2304.06906v1 |
link |
2023-04-14 |
Tempo vs. Pitch: understanding self-supervised tempo estimation |
Giovana Morais et.al. |
2304.06868v1 |
link |
2023-04-14 |
Exploring the State of the Art in Legal QA Systems |
Abdelrahman Abdallah et.al. |
2304.06623v2 |
link |
2023-04-13 |
Solving Tensor Low Cycle Rank Approximation |
Yichuan Deng et.al. |
2304.06594v1 |
null |
2023-04-13 |
Efficient Multimodal Fusion via Interactive Prompting |
Yaowei Li et.al. |
2304.06306v1 |
null |
2023-04-12 |
AGI for Agriculture |
Guoyu Lu et.al. |
2304.06136v1 |
null |
2023-04-12 |
ReDWINE: A Clinical Datamart with Text Analytical Capabilities to Facilitate Rehabilitation Research |
David Oniani et.al. |
2304.05929v1 |
null |
2023-04-12 |
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning |
Viet Dac Lai et.al. |
2304.05613v1 |
null |
2023-04-11 |
A Survey of Resources and Methods for Natural Language Processing of Serbian Language |
Ulfeta A. Marovac et.al. |
2304.05468v1 |
null |
2023-04-10 |
SAM.MD: Zero-shot medical image segmentation capabilities of the Segment Anything Model |
Saikat Roy et.al. |
2304.05396v1 |
null |
2023-04-10 |
The Wall Street Neophyte: A Zero-Shot Analysis of ChatGPT Over MultiModal Stock Movement Prediction Challenges |
Qianqian Xie et.al. |
2304.05351v1 |
null |
2023-04-11 |
Toxicity in ChatGPT: Analyzing Persona-assigned Language Models |
Ameet Deshpande et.al. |
2304.05335v1 |
null |
2023-04-11 |
Prompt Learning for News Recommendation |
Zizhuo Zhang et.al. |
2304.05263v1 |
link |
2023-04-12 |
r-softmax: Generalized Softmax with Controllable Sparsity Rate |
Klaudia Bałazy et.al. |
2304.05243v2 |
link |
2023-04-11 |
What Food Do We Tweet about on a Rainy Day? |
Maija Kāle et.al. |
2304.05041v1 |
null |
2023-04-10 |
SELFormer: Molecular Representation Learning via SELFIES Language Models |
Atakan Yüksel et.al. |
2304.04662v1 |
link |
2023-04-10 |
On Evaluation of Bangla Word Analogies |
Mousumi Akter et.al. |
2304.04613v1 |
null |
2023-04-10 |
Two Steps Forward and One Behind: Rethinking Time Series Forecasting with Deep Learning |
Riccardo Ughi et.al. |
2304.04553v1 |
null |
2023-04-09 |
Extractive Summarization via ChatGPT for Faithful Summary Generation |
Haopeng Zhang et.al. |
2304.04193v1 |
null |
2023-04-08 |
MphayaNER: Named Entity Recognition for Tshivenda |
Rendani Mbuvha et.al. |
2304.03952v1 |
link |
2023-04-08 |
GPT4Rec: A Generative Framework for Personalized Recommendation and User Interests Interpretation |
Jinming Li et.al. |
2304.03879v1 |
null |
2023-04-07 |
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review |
Li Shen et.al. |
2304.03589v1 |
null |
2023-04-07 |
HyperTab: Hypernetwork Approach for Deep Learning on Small Tabular Datasets |
Witold Wydmański et.al. |
2304.03543v1 |
link |
2023-04-06 |
Using LSTM and GRU With a New Dataset for Named Entity Recognition in the Arabic Language |
Alaa Shaker et.al. |
2304.03399v1 |
null |
2023-04-06 |
Deep Learning for Opinion Mining and Topic Classification of Course Reviews |
Anna Koufakou et.al. |
2304.03394v1 |
null |
2023-04-06 |
Entity Graphs for Exploring Online Discourse |
Nicholas Botzer et.al. |
2304.03351v1 |
null |
2023-04-06 |
On the Evaluations of ChatGPT and Emotion-enhanced Prompting for Mental Health Analysis |
Kailai Yang et.al. |
2304.03347v1 |
link |
2023-04-06 |
Locate: Low-Power Viterbi Decoder Exploration using Approximate Adders |
Rajat Bhattacharjya et.al. |
2304.03257v1 |
null |
2023-04-06 |
Bridging the Language Gap: Knowledge Injected Multilingual Question Answering |
Zhichao Duan et.al. |
2304.03159v1 |
null |
2023-04-06 |
Zero-Shot Next-Item Recommendation using Large Pretrained Language Models |
Lei Wang et.al. |
2304.03153v1 |
null |
2023-04-06 |
BotTriNet: A Unified and Efficient Embedding for Social Bots Detection via Metric Learning |
Jun Wu et.al. |
2304.03144v1 |
null |
2023-04-06 |
Static Fuzzy Bag-of-Words: a lightweight sentence embedding algorithm |
Matteo Muffo et.al. |
2304.03098v1 |
null |
2023-04-06 |
PointCAT: Cross-Attention Transformer for point cloud |
Xincheng Yang et.al. |
2304.03012v1 |
link |
2023-04-06 |
Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions |
Chen Feng Tsai et.al. |
2304.02868v1 |
null |
2023-04-06 |
Opportunities and challenges of ChatGPT for design knowledge management |
Xin Hu et.al. |
2304.02796v1 |
null |
2023-04-05 |
Application of Transformers based methods in Electronic Medical Records: A Systematic Literature Review |
Vitor Alcantara Batista et.al. |
2304.02768v1 |
link |
2023-04-05 |
The Saudi Privacy Policy Dataset |
Hend Al-Khalifa et.al. |
2304.02757v1 |
link |
2023-04-06 |
ParroT: Translating During Chat Using Large Language Models |
Wenxiang Jiao et.al. |
2304.02426v2 |
link |
2023-04-05 |
Machine Learning of Public Sentiments toward Wind Energy in Norway |
Oskar Vågerö et.al. |
2304.02388v1 |
null |
2023-04-05 |
Document-Level Machine Translation with Large Language Models |
Longyue Wang et.al. |
2304.02210v1 |
link |
2023-04-05 |
Unleashing the Power of ChatGPT for Translation: An Empirical Study |
Yuan Gao et.al. |
2304.02182v1 |
null |
2023-04-04 |
PromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language Models |
Aditi Mishra et.al. |
2304.01964v1 |
null |
2023-04-04 |
Summary of ChatGPT/GPT-4 Research and Perspective Towards the Future of Large Language Models |
Yiheng Liu et.al. |
2304.01852v1 |
null |
2023-04-04 |
Is ChatGPT a Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation |
Tao Fang et.al. |
2304.01746v1 |
null |
2023-04-04 |
Rumour Detection and Analysis on Twitter |
Yaohou Fan et.al. |
2304.01712v1 |
null |
2023-04-04 |
A Survey on Contextualised Semantic Shift Detection |
Stefano Montanelli et.al. |
2304.01666v1 |
null |
2023-04-04 |
Neural Comprehension: Language Models with Compiled Neural Networks |
Yixuan Weng et.al. |
2304.01665v1 |
link |
2023-04-04 |
EDeR: A Dataset for Exploring Dependency Relations Between Events |
Ruiqi Li et.al. |
2304.01612v1 |
link |
2023-04-04 |
G2PTL: A Pre-trained Model for Delivery Address and its Applications in Logistics System |
Lixia Wu et.al. |
2304.01559v1 |
null |
2023-04-04 |
RARE: Robust Masked Graph Autoencoder |
Wenxuan Tu et.al. |
2304.01507v1 |
null |
2023-04-04 |
Unsupervised Brain Tumor Segmentation with Image-based Prompts |
Xinru Zhang et.al. |
2304.01472v1 |
null |
2023-04-03 |
Changes to Captions: An Attentive Network for Remote Sensing Change Captioning |
Shizhen Chang et.al. |
2304.01091v1 |
link |
2023-04-03 |
DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains |
Yanis Labrak et.al. |
2304.00958v1 |
null |
2023-04-03 |
ScandEval: A Benchmark for Scandinavian Natural Language Processing |
Dan Saattrup Nielsen et.al. |
2304.00906v1 |
link |
2023-04-03 |
GreekBART: The First Pretrained Greek Sequence-to-Sequence Model |
Iakovos Evdaimon et.al. |
2304.00869v1 |
link |
2023-04-03 |
Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: A Preliminary Empirical Study |
Yi Chen et.al. |
2304.00723v1 |
null |
2023-04-03 |
MiniRBT: A Two-stage Distilled Small Chinese Pre-trained Model |
Xin Yao et.al. |
2304.00717v1 |
link |
2023-04-03 |
DiffuRec: A Diffusion Model for Sequential Recommendation |
Zihao Li et.al. |
2304.00686v1 |
link |
2023-04-02 |
Classifying COVID-19 Related Tweets for Fake News Detection and Sentiment Analysis with BERT-based Models |
Rabia Bounaama et.al. |
2304.00636v1 |
null |
2023-04-02 |
MMT: A Multilingual and Multi-Topic Indian Social Media Dataset |
Dwip Dalal et.al. |
2304.00634v1 |
null |
2023-04-02 |
Sequence-aware item recommendations for multiply repeated user-item interactions |
Juan Pablo Equihua et.al. |
2304.00578v1 |
null |
2023-03-31 |
A Closer Look at Parameter-Efficient Tuning in Diffusion Models |
Chendong Xiang et.al. |
2303.18181v1 |
link |
2023-03-31 |
BERTino: an Italian DistilBERT model |
Matteo Muffo et.al. |
2303.18121v1 |
link |
2023-03-31 |
Dataset and Baseline System for Multi-lingual Extraction and Normalization of Temporal and Numerical Expressions |
Sanxing Chen et.al. |
2303.18103v1 |
link |
2023-03-31 |
ConceptEVA: Concept-Based Interactive Exploration and Customization of Document Summaries |
Xiaoyu Zhang et.al. |
2303.17826v1 |
null |
2023-03-31 |
Attention is Not Always What You Need: Towards Efficient Classification of Domain-Specific Text |
Yasmen Wahba et.al. |
2303.17786v1 |
null |
2023-03-30 |
A CI-based Auditing Framework for Data Collection Practices |
Athina Markopoulou et.al. |
2303.17740v1 |
null |
2023-03-30 |
Evaluation of GPT and BERT-based models on identifying protein-protein interactions in biomedical text |
Hasin Rehana et.al. |
2303.17728v1 |
null |
2023-03-30 |
BOLT: An Automated Deep Learning Framework for Training and Deploying Large-Scale Neural Networks on Commodity CPU Hardware |
Nicholas Meisburger et.al. |
2303.17727v1 |
link |
2023-03-30 |
Whether and When does Endoscopy Domain Pretraining Make Sense? |
Dominik Batić et.al. |
2303.17636v1 |
null |
2023-03-30 |
A BERT-based Unsupervised Grammatical Error Correction Framework |
Nankai Lin et.al. |
2303.17367v1 |
null |
2023-03-30 |
Topics in the Haystack: Extracting and Evaluating Topics beyond Coherence |
Anton Thielmann et.al. |
2303.17324v1 |
null |
2023-03-29 |
Adapting to the Low-Resource Double-Bind: Investigating Low-Compute Methods on Low-Resource African Languages |
Colin Leong et.al. |
2303.16985v1 |
null |
2023-03-29 |
AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators |
Xingwei He et.al. |
2303.16854v1 |
link |
2023-03-27 |
ACO-tagger: A Novel Method for Part-of-Speech Tagging using Ant Colony Optimization |
Amirhossein Mohammadi et.al. |
2303.16760v1 |
null |
2023-03-28 |
How can Deep Learning Retrieve the Write-Missing Additional Diagnosis from Chinese Electronic Medical Record For DRG |
Shaohui Liu et.al. |
2303.16757v1 |
null |
2023-03-29 |
LMExplainer: a Knowledge-Enhanced Explainer for Language Models |
Zichen Chen et.al. |
2303.16537v1 |
null |
2023-03-28 |
Exploring Natural Language Processing Methods for Interactive Behaviour Modelling |
Guanhua Zhang et.al. |
2303.16039v1 |
null |
2023-03-28 |
Soft-prompt tuning to predict lung cancer using primary care free-text Dutch medical notes |
Auke Elfrink et.al. |
2303.15846v1 |
link |
2023-03-28 |
Evaluation of ChatGPT for NLP-based Mental Health Applications |
Bishal Lamichhane et.al. |
2303.15727v1 |
null |
2023-03-28 |
Explicit Planning Helps Language Models in Logical Reasoning |
Hongyu Zhao et.al. |
2303.15714v1 |
link |
2023-03-27 |
Typhoon: Towards an Effective Task-Specific Masking Strategy for Pre-trained Language Models |
Muhammed Shahir Abdurrahman et.al. |
2303.15619v1 |
null |
2023-03-27 |
Evaluating self-attention interpretability through human-grounded experimental protocol |
Milan Bhan et.al. |
2303.15190v1 |
null |
2023-03-27 |
unarXive 2022: All arXiv Publications Pre-Processed for NLP, Including Structured Full-Text and Citation Network |
Tarek Saier et.al. |
2303.14957v1 |
link |
2023-03-27 |
Unified Text Structuralization with Instruction-tuned Language Models |
Xuanfan Ni et.al. |
2303.14956v1 |
null |
2023-03-27 |
Improving Contextualized Topic Models with Negative Sampling |
Suman Adhya et.al. |
2303.14951v1 |
link |
2023-03-27 |
Coupling Artificial Neurons in BERT and Biological Neurons in the Human Brain |
Xu Liu et.al. |
2303.14871v1 |
null |
2023-03-26 |
MGTBench: Benchmarking Machine-Generated Text Detection |
Xinlei He et.al. |
2303.14822v1 |
link |
2023-03-26 |
Nature Language Reasoning, A Survey |
Fei Yu et.al. |
2303.14725v1 |
link |
2023-03-25 |
Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities |
Atnafu Lambebo Tonja et.al. |
2303.14406v1 |
link |
2023-03-25 |
An Analysis of GPT-3's Performance in Grammatical Error Correction |
Steven Coyne et.al. |
2303.14342v1 |
null |
2023-03-25 |
Backdoor Attacks with Input-unique Triggers in NLP |
Xukun Zhou et.al. |
2303.14325v1 |
null |
2023-03-24 |
The crime of being poor |
Georgina Curto et.al. |
2303.14128v1 |
null |
2023-03-24 |
Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods |
Thilo Hagendorff et.al. |
2303.13988v1 |
null |
2023-03-24 |
Unleasing ChatGPT on the Metaverse: Savior or Destroyer? |
Pengyuan Zhou et.al. |
2303.13856v1 |
null |
2023-03-24 |
Where to Go Next for Recommender Systems? ID- vs. Modality-based recommender models revisited |
Zheng Yuan et.al. |
2303.13835v1 |
link |
2023-03-24 |
Natural language processing to automatically extract the presence and severity of esophagitis in notes of patients undergoing radiotherapy |
Shan Chen et.al. |
2303.13722v1 |
link |
2023-03-23 |
Primer: Fast Private Transformer Inference on Encrypted Data |
Mengxin Zheng et.al. |
2303.13679v1 |
null |
2023-03-23 |
Prompting Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages |
Zheng-Xin Yong et.al. |
2303.13592v1 |
null |
2023-03-23 |
Return of the RNN: Residual Recurrent Networks for Invertible Sentence Embeddings |
Jeremy Wilkerson et.al. |
2303.13570v1 |
null |
2023-03-22 |
Extracting Physical Rehabilitation Exercise Information from Clinical Notes: a Comparison of Rule-Based and Machine Learning Natural Language Processing Techniques |
Stephen W. Shaffran et.al. |
2303.13466v1 |
null |
2023-03-23 |
Human Behavior in the Time of COVID-19: Learning from Big Data |
Hanjia Lyu et.al. |
2303.13452v1 |
null |
2023-03-23 |
Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse |
Xavier Tannier et.al. |
2303.13451v1 |
null |
2023-03-23 |
Parameter-Efficient Sparse Retrievers and Rerankers using Adapters |
Vaishali Pal et.al. |
2303.13220v1 |
link |
2023-03-22 |
Analyzing the Generalizability of Deep Contextualized Language Representations For Text Classification |
Berfu Buyukoz et.al. |
2303.12936v1 |
null |
2023-03-22 |
TRON: Transformer Neural Network Acceleration with Non-Coherent Silicon Photonics |
Salma Afifi et.al. |
2303.12914v1 |
null |
2023-03-22 |
A Small-Scale Switch Transformer and NLP-based Model for Clinical Narratives Classification |
Thanh-Dung Le et.al. |
2303.12892v1 |
null |
2023-03-22 |
MEGA: Multilingual Evaluation of Generative AI |
Kabir Ahuja et.al. |
2303.12528v1 |
null |
2023-03-22 |
System and Design Technology Co-optimization of SOT-MRAM for High-Performance AI Accelerator Memory System |
Kaniz Mishty et.al. |
2303.12310v1 |
null |
2023-03-21 |
Machine Learning for Brain Disorders: Transformers and Visual Transformers |
Robin Courant et.al. |
2303.12068v1 |
null |
2023-03-21 |
Transformers in Speech Processing: A Survey |
Siddique Latif et.al. |
2303.11607v1 |
null |
2023-03-21 |
Difficulty in learning chirality for Transformer fed with SMILES |
Yasuhiro Yoshikai et.al. |
2303.11593v1 |
link |
2023-03-21 |
SIFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency |
Shreyas Saxena et.al. |
2303.11525v1 |
link |
2023-03-20 |
Investigating Topological Order using Recurrent Neural Networks |
Mohamed Hibat-Allah et.al. |
2303.11207v1 |
null |
2023-03-20 |
On the Educational Impact of ChatGPT: Is Artificial Intelligence Ready to Obtain a University Degree? |
Kamil Malinka et.al. |
2303.11146v1 |
null |
2023-03-20 |
Controllable Ancient Chinese Lyrics Generation Based on Phrase Prototype Retrieving |
Li Yi et.al. |
2303.11005v1 |
null |
2023-03-20 |
Translate your gibberish: black-box adversarial attack on machine translation systems |
Andrei Chertkov et.al. |
2303.10974v1 |
link |
2023-03-20 |
Self-Improving-Leaderboard(SIL): A Call for Real-World Centric Natural Language Processing Leaderboards |
Chanjun Park et.al. |
2303.10888v1 |
null |
2023-03-20 |
NASA Science Mission Directorate Knowledge Graph Discovery |
Roelien C. Timmer et.al. |
2303.10871v1 |
null |
2023-03-20 |
Multi-task Transformer with Relation-attention and Type-attention for Named Entity Recognition |
Ying Mo et.al. |
2303.10870v1 |
null |
2023-03-18 |
Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning |
Qingru Zhang et.al. |
2303.10512v1 |
link |
2023-03-18 |
A Deep Learning System for Domain-specific speech Recognition |
Yanan Jia et.al. |
2303.10510v1 |
null |
2023-03-18 |
Is Prompt All You Need? No. A Comprehensive and Broader View of Instruction Learning |
Renze Lou et.al. |
2303.10475v1 |
link |
2023-03-17 |
IRGen: Generative Modeling for Image Retrieval |
Yidan Zhang et.al. |
2303.10126v1 |
link |
2023-03-17 |
STIXnet: A Novel and Modular Solution for Extracting All STIX Objects in CTI Reports |
Francesco Marchiori et.al. |
2303.09999v1 |
link |
2023-03-17 |
CoLT5: Faster Long-Range Transformers with Conditional Computation |
Joshua Ainslie et.al. |
2303.09752v1 |
null |
2023-03-16 |
Measuring Improvement of F $_1$ -Scores in Detection of Self-Admitted Technical Debt |
William Aiken et.al. |
2303.09617v1 |
null |
2023-03-17 |
BanglaCoNER: Towards Robust Bangla Complex Named Entity Recognition |
HAZ Sameen Shahgir et.al. |
2303.09306v2 |
link |
2023-03-16 |
Block-wise Bit-Compression of Transformer-based Models |
Gaochen Dong et.al. |
2303.09184v1 |
null |
2023-03-16 |
A Short Survey of Viewing Large Language Models in Legal Aspect |
Zhongxiang Sun et.al. |
2303.09136v1 |
link |
2023-03-15 |
Cross-domain Sentiment Classification in Spanish |
Lautaro Estienne et.al. |
2303.08985v1 |
null |
2023-03-17 |
Automated Interactive Domain-Specific Conversational Agents that Understand Human Dialogs |
Yankai Zeng et.al. |
2303.08941v2 |
null |
2023-03-15 |
Applying unsupervised keyphrase methods on concepts extracted from discharge sheets |
Hoda Memarzadeh et.al. |
2303.08928v1 |
null |
2023-03-15 |
ROSE: A Neurocomputational Architecture for Syntax |
Elliot Murphy et.al. |
2303.08877v1 |
null |
2023-03-15 |
Building an Effective Email Spam Classification Model with spaCy |
Kazem Taghandiki et.al. |
2303.08792v1 |
null |
2023-03-14 |
Clinical Concept and Relation Extraction Using Prompt-based Machine Reading Comprehension |
Cheng Peng et.al. |
2303.08262v1 |
null |
2023-03-14 |
Contextualized Medication Information Extraction Using Transformer-based Deep Learning Architectures |
Aokun Chen et.al. |
2303.08259v1 |
null |
2023-03-14 |
Progress Note Understanding -- Assessment and Plan Reasoning: Overview of the 2022 N2C2 Track 3 Shared Task |
Yanjun Gao et.al. |
2303.08038v1 |
null |
2023-03-14 |
Geolocation Predicting of Tweets Using BERT-Based Models |
Kateryna Lutsai et.al. |
2303.07865v1 |
link |
2023-03-14 |
Input-length-shortening and text generation via attention values |
Neşet Özkan Tan et.al. |
2303.07585v1 |
null |
2023-03-14 |
Diffusion Models in NLP: A Survey |
Yuansong Zhu et.al. |
2303.07576v1 |
null |
2023-03-13 |
Automated Vulnerability Detection in Source Code Using Quantum Natural Language Processing |
Mst Shapna Akter et.al. |
2303.07525v1 |
null |
2023-03-13 |
X-Former: In-Memory Acceleration of Transformers |
Shrihari Sridharan et.al. |
2303.07470v1 |
null |
2023-03-13 |
Learning the language of QCD jets with transformers |
Thorben Finke et.al. |
2303.07364v1 |
null |
2023-03-13 |
Scaling Vision-Language Models with Sparse Mixture of Experts |
Sheng Shen et.al. |
2303.07226v1 |
null |
2023-03-13 |
A Comprehensive Empirical Evaluation of Existing Word Embedding Approaches |
Obaidullah Zaland et.al. |
2303.07196v1 |
null |
2023-03-13 |
$\nabla$ SD: Differentiable Programming for Sparse Tensors |
Amir Shaikhha et.al. |
2303.07030v1 |
null |
2023-03-13 |
Roadmap towards Meta-being |
Tianyi Huang et.al. |
2303.06795v1 |
null |
2023-03-12 |
AidUI: Toward Automated Recognition of Dark Patterns in User Interfaces |
SM Hasan Mansur et.al. |
2303.06782v1 |
link |
2023-03-12 |
Diffusion Models for Non-autoregressive Text Generation: A Survey |
Yifan Li et.al. |
2303.06574v1 |
link |
2023-03-11 |
Graph Neural Network contextual embedding for Deep Learning on Tabular Data |
Mario Villaizán-Vallelado et.al. |
2303.06455v1 |
link |
2023-03-11 |
Explainable AI for Time Series via Virtual Inspection Layers |
Johanna Vielhaben et.al. |
2303.06365v1 |
null |
2023-03-10 |
Generating Query Focused Summaries without Fine-tuning the Transformer-based Pre-trained Models |
Deen Abdullah et.al. |
2303.06230v1 |
null |
2023-03-10 |
Towards MoE Deployment: Mitigating Inefficiencies in Mixture-of-Expert (MoE) Inference |
Haiyang Huang et.al. |
2303.06182v1 |
null |
2023-03-10 |
Distributionally Robust Optimization with Probabilistic Group |
Soumya Suvra Ghosal et.al. |
2303.05809v1 |
link |
2023-03-10 |
An Overview on Language Models: Recent Developments and Outlook |
Chengwei Wei et.al. |
2303.05759v1 |
null |
2023-03-10 |
Research on CPI Prediction Based on Natural Language Processing |
Xiaobin Tang et.al. |
2303.05666v1 |
null |
2023-03-09 |
Open World Classification with Adaptive Negative Samples |
Ke Bai et.al. |
2303.05581v1 |
null |
2023-03-08 |
Automatic Detection of Industry Sectors in Legal Articles Using Machine Learning Approaches |
Hui Yang et.al. |
2303.05387v1 |
null |
2023-03-09 |
Dynamic Stashing Quantization for Efficient Transformer Training |
Guo Yang et.al. |
2303.05295v1 |
null |
2023-03-09 |
Can a Frozen Pretrained Language Model be used for Zero-shot Neural Retrieval on Entity-centric Questions? |
Yasuto Hoshi et.al. |
2303.05153v1 |
null |
2023-03-09 |
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction |
Jiabang He et.al. |
2303.05063v1 |
link |
2023-03-09 |
Rethinking Visual Prompt Learning as Masked Visual Token Modeling |
Ning Liao et.al. |
2303.04998v1 |
null |
2023-03-08 |
DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks |
Zohreh Aghababaeyan et.al. |
2303.04878v1 |
link |
2023-03-08 |
Non-Binary Gender Expression in Online Interactions |
Rebecca Dorn et.al. |
2303.04837v1 |
null |
2023-03-08 |
Comprehensive Event Representations using Event Knowledge Graphs and Natural Language Processing |
Tin Kuculo et.al. |
2303.04794v1 |
null |
2023-03-08 |
Student's t-Distribution: On Measuring the Inter-Rater Reliability When the Observations are Scarce |
Serge Gladkoff et.al. |
2303.04526v1 |
null |
2023-03-08 |
An Annexure to the Paper "Driving the Technology Value Stream by Analyzing App Reviews" |
Souvick Das et.al. |
2303.04519v1 |
null |
2023-03-07 |
A Challenging Benchmark for Low-Resource Learning |
Yudong Wang et.al. |
2303.03840v1 |
link |
2023-03-07 |
Exploring the Feasibility of ChatGPT for Event Extraction |
Jun Gao et.al. |
2303.03836v1 |
null |
2023-03-06 |
Multi-resolution Interpretation and Diagnostics Tool for Natural Language Classifiers |
Peyman Jalali et.al. |
2303.03542v1 |
null |
2023-03-06 |
Guilt Detection in Text: A Step Towards Understanding Complex Emotions |
Abdul Gafar Manuel Meque et.al. |
2303.03510v1 |
null |
2023-03-06 |
On the Visualisation of Argumentation Graphs to Support Text Interpretation |
Hanadi Mardah et.al. |
2303.03235v1 |
null |
2023-03-03 |
Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT |
Mostafa M. Amin et.al. |
2303.03186v1 |
null |
2023-03-03 |
Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis |
Vikramjit Mitra et.al. |
2303.03177v1 |
null |
2023-03-06 |
GlobalNER: Incorporating Non-local Information into Named Entity Recognition |
Chiao-Wei Hsu et.al. |
2303.02915v1 |
null |
2023-03-06 |
Artificial Intelligence: 70 Years Down the Road |
Lin Zhang et.al. |
2303.02819v1 |
null |
2023-03-05 |
Swim: A General-Purpose, High-Performing, and Efficient Activation Function for Locomotion Control Tasks |
Maryam Abdool et.al. |
2303.02640v1 |
link |
2023-03-04 |
Variational Quantum Classifiers for Natural-Language Text |
Daniel T. Chang et.al. |
2303.02469v1 |
null |
2023-03-04 |
Calibrating Transformers via Sparse Gaussian Processes |
Wenlong Chen et.al. |
2303.02444v1 |
link |
2023-03-03 |
TrojText: Test-time Invisible Textual Trojan Insertion |
Yepeng Liu et.al. |
2303.02242v1 |
link |
2023-03-03 |
Exploring Data Augmentation Methods on Social Media Corpora |
Isabel Garcia Pietri et.al. |
2303.02198v1 |
null |
2023-03-02 |
DeepLens: Interactive Out-of-distribution Data Detection in NLP Models |
Da Song et.al. |
2303.01577v1 |
link |
2023-03-02 |
DeepSeer: Interactive RNN Explanation and Debugging via State Abstraction |
Zhijie Wang et.al. |
2303.01576v1 |
link |
2023-03-02 |
Local data structures |
J. F. Jardine et.al. |
2303.01415v1 |
null |
2023-03-02 |
Letz Translate: Low-Resource Machine Translation for Luxembourgish |
Yewei Song et.al. |
2303.01347v1 |
null |
2023-03-01 |
Frauds Bargain Attack: Generating Adversarial Text Samples via Word Manipulation Process |
Mingze Ni et.al. |
2303.01234v1 |
link |
2023-03-02 |
Can BERT Refrain from Forgetting on Sequential Tasks? A Probing Study |
Mingxu Tao et.al. |
2303.01081v1 |
link |
2023-03-01 |
SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks |
Kai-Wei Chang et.al. |
2303.00733v1 |
null |
2023-03-01 |
Uzbek text summarization based on TF-IDF |
Khabibulla Madatov et.al. |
2303.00461v1 |
null |
2023-03-01 |
How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks |
Xuanting Chen et.al. |
2303.00293v1 |
null |
2023-03-01 |
Machine-learning Repurposing of DrugBank Compounds for Opioid Use Disorder |
Hongsong Feng et.al. |
2303.00240v1 |
link |
2023-02-28 |
Automatic Scoring of Dream Reports' Emotional Content with Large Language Models |
Lorenzo Bertolini et.al. |
2302.14828v1 |
link |
2023-02-28 |
AccelTran: A Sparsity-Aware Accelerator for Dynamic Inference with Transformers |
Shikhar Tuli et.al. |
2302.14705v1 |
link |
2023-02-28 |
Improving Expert Specialization in Mixture of Experts |
Yamuna Krishnamurthy et.al. |
2302.14703v1 |
link |
2023-02-27 |
SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic Speech Processing |
Weidong Chen et.al. |
2302.14638v1 |
link |
2023-02-28 |
H-AES: Towards Automated Essay Scoring for Hindi |
Shubhankar Singh et.al. |
2302.14635v1 |
link |
2023-02-28 |
A Survey on Long Text Modeling with Transformers |
Zican Dong et.al. |
2302.14502v1 |
null |
2023-02-28 |
Text classification dataset and analysis for Uzbek language |
Elmurod Kuriyozov et.al. |
2302.14494v1 |
link |
2023-02-28 |
Efficient Masked Autoencoders with Self-Consistency |
Zhaowen Li et.al. |
2302.14431v1 |
null |
2023-02-28 |
HugNLP: A Unified and Comprehensive Library for Natural Language Processing |
Jianing Wang et.al. |
2302.14286v1 |
link |
2023-02-27 |
Inseq: An Interpretability Toolkit for Sequence Generation Models |
Gabriele Sarti et.al. |
2302.13942v1 |
link |
2023-02-24 |
Adapting Pre-trained Language Models for Quantum Natural Language Processing |
Qiuchi Li et.al. |
2302.13812v1 |
null |
2023-02-26 |
A Survey on Uncertainty Quantification Methods for Deep Neural Networks: An Uncertainty Source Perspective |
Wenchong He et.al. |
2302.13425v1 |
null |
2023-02-26 |
From Audio to Symbolic Encoding |
Shenli Yuan et.al. |
2302.13401v1 |
null |
2023-02-26 |
The blame game: Understanding blame assignment in social media |
Ruijie Xi et.al. |
2302.13352v1 |
null |
2023-02-26 |
Bayesian Networks for Named Entity Prediction in Programming Community Question Answering |
Alexey Gorbatovski et.al. |
2302.13253v1 |
null |
2023-02-25 |
ChatAug: Leveraging ChatGPT for Text Data Augmentation |
Haixing Dai et.al. |
2302.13007v1 |
null |
2023-02-24 |
STA: Self-controlled Text Augmentation for Improving Text Classifications |
Congcong Wang et.al. |
2302.12784v1 |
link |
2023-02-24 |
Time-aware Multiway Adaptive Fusion Network for Temporal Knowledge Graph Question Answering |
Yonghao Liu et.al. |
2302.12529v1 |
null |
2023-02-24 |
SGL-PT: A Strong Graph Learner with Graph Prompt Tuning |
Yun Zhu et.al. |
2302.12449v1 |
null |
2023-02-23 |
What makes a language easy to deep-learn? |
Lukas Galke et.al. |
2302.12239v1 |
link |
2023-02-23 |
Deep learning model for Mongolian Citizens Feedback Analysis using Word Vector Embeddings |
Zolzaya Dashdorj et.al. |
2302.12069v1 |
null |
2023-02-23 |
Natural Language Processing in the Legal Domain |
Daniel Martin Katz et.al. |
2302.12039v1 |
null |
2023-02-23 |
Sentence Simplification via Large Language Models |
Yutao Feng et.al. |
2302.11957v1 |
link |
2023-02-23 |
Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers |
Minsoo Kim et.al. |
2302.11812v1 |
link |
2023-02-23 |
MUTANT: A Multi-sentential Code-mixed Hinglish Dataset |
Rahul Gupta et.al. |
2302.11766v1 |
null |
2023-02-24 |
VLSP2022 EVJVQA Challenge: Multilingual Visual Question Answering |
Ngan Luu-Thuy Nguyen et.al. |
2302.11752v2 |
null |
2023-02-22 |
Scaling Robot Learning with Semantically Imagined Experience |
Tianhe Yu et.al. |
2302.11550v1 |
null |
2023-02-22 |
Data Augmentation for Neural NLP |
Domagoj Pluščec et.al. |
2302.11412v1 |
null |
2023-02-22 |
Learning from Predictions: Fusing Training and Autoregressive Inference for Long-Term Spatiotemporal Forecasts |
Pantelis R. Vlachas et.al. |
2302.11101v1 |
null |
2023-02-22 |
Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks |
Sudipta Kar et.al. |
2302.11074v1 |
null |
2023-02-21 |
Device Tuning for Multi-Task Large Model |
Penghao Jiang et.al. |
2302.10820v1 |
null |
2023-02-21 |
ChatGPT: Jack of all trades, master of none |
Jan Kocoń et.al. |
2302.10724v1 |
link |
2023-02-21 |
NLPLego: Assembling Test Generation for Natural Language Processing Applications |
Pin Ji et.al. |
2302.10499v1 |
null |
2023-02-21 |
Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines |
Min Cen et.al. |
2302.10406v1 |
null |
2023-02-20 |
Exploring the Limits of Transfer Learning with Unified Model in the Cybersecurity Domain |
Kuntal Kumar Pal et.al. |
2302.10346v1 |
null |
2023-02-20 |
Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey |
Xiao Wang et.al. |
2302.10035v1 |
link |
2023-02-20 |
Boosting classification reliability of NLP transformer models in the long run |
Zoltán Kmetty et.al. |
2302.10016v1 |
null |
2023-02-19 |
mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization |
Kayhan Behdin et.al. |
2302.09693v1 |
null |
2023-02-19 |
Optimization Methods in Deep Learning: A Comprehensive Overview |
David Shulman et.al. |
2302.09566v1 |
null |
2023-02-19 |
SanskritShala: A Neural Sanskrit NLP Toolkit with Web-Based Interface for Pedagogical and Annotation Purposes |
Jivnesh Sandhan et.al. |
2302.09527v1 |
link |
2023-02-18 |
BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark |
Dakuan Lu et.al. |
2302.09432v1 |
link |
2023-02-18 |
A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT |
Ce Zhou et.al. |
2302.09419v1 |
null |
2023-02-18 |
Redes Generativas Adversarias (GAN) Fundamentos Teóricos y Aplicaciones |
Jordi de la Torre et.al. |
2302.09346v1 |
null |
2023-02-18 |
Transformadores: Fundamentos teoricos y Aplicaciones |
Jordi de la Torre et.al. |
2302.09327v1 |
null |
2023-02-17 |
Extraction of Constituent Factors of Digestion Efficiency in Information Transfer by Media Composed of Texts and Images |
Koike Hiroaki et.al. |
2302.09189v1 |
null |
2023-02-17 |
Massively Multilingual Shallow Fusion with Large Language Models |
Ke Hu et.al. |
2302.08917v1 |
null |
2023-02-16 |
Role of Bias Terms in Dot-Product Attention |
Mahdi Namazifar et.al. |
2302.08626v1 |
null |
2023-02-16 |
What A Situated Language-Using Agent Must be Able to Do: A Top-Down Analysis |
David Schlangen et.al. |
2302.08590v1 |
null |
2023-02-16 |
Foundation Models for Natural Language Processing -- Pre-trained Language Models Integrating Media |
Gerhard Paaß et.al. |
2302.08575v1 |
null |
2023-02-16 |
THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression |
Minghao Li et.al. |
2302.08545v1 |
link |
2023-02-16 |
Counting Carbon: A Survey of Factors Influencing the Emissions of Machine Learning |
Alexandra Sasha Luccioni et.al. |
2302.08476v1 |
link |
2023-02-17 |
Efficiency 360: Efficient Vision Transformers |
Badri N. Patro et.al. |
2302.08374v2 |
link |
2023-02-16 |
A Survey on Event-based News Narrative Extraction |
Brian Keith Norambuena et.al. |
2302.08351v1 |
null |
2023-02-16 |
Tuning computer vision models with task rewards |
André Susano Pinto et.al. |
2302.08242v1 |
link |
2023-02-16 |
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition |
Minsu Kim et.al. |
2302.08102v1 |
null |
2023-02-16 |
Exploring the Limits of ChatGPT for Query or Aspect-based Text Summarization |
Xianjun Yang et.al. |
2302.08081v1 |
null |
2023-02-16 |
LabelPrompt: Effective Prompt-based Learning for Relation Classification |
Wenjie Zhang et.al. |
2302.08068v1 |
null |
2023-02-16 |
GraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural Networks |
Zemin Liu et.al. |
2302.08043v1 |
null |
2023-02-15 |
Commonsense Reasoning for Conversational AI: A Survey of the State of the Art |
Christopher Richardson et.al. |
2302.07926v1 |
null |
2023-02-15 |
Big Little Transformer Decoder |
Sehoon Kim et.al. |
2302.07863v1 |
link |
2023-02-15 |
Word class representations spontaneously emerge in a deep neural network trained on next word prediction |
Kishore Surendra et.al. |
2302.07588v1 |
null |
2023-02-14 |
Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models |
Shrimai Prabhumoye et.al. |
2302.07388v1 |
null |
2023-02-14 |
Few-shot learning approaches for classifying low resource domain specific software requirements |
Anmol Nayak et.al. |
2302.06951v1 |
null |
2023-02-14 |
SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains |
Koustava Goswami et.al. |
2302.06868v1 |
link |
2023-02-14 |
Language Model Analysis for Ontology Subsumption Inference |
Yuan He et.al. |
2302.06761v1 |
link |
2023-02-13 |
Large Scale Multi-Lingual Multi-Modal Summarization Dataset |
Yash Verma et.al. |
2302.06560v1 |
link |
2023-02-13 |
Visualizing Topic Uncertainty in Topic Modelling |
Peter Winker et.al. |
2302.06482v1 |
null |
2023-02-13 |
Linguistic ambiguity analysis in ChatGPT |
Miguel Ortega-Martín et.al. |
2302.06426v1 |
null |
2023-02-13 |
Dataset of Natural Language Queries for E-Commerce |
Andrea Papenmeier et.al. |
2302.06355v1 |
null |
2023-02-12 |
TextDefense: Adversarial Text Detection based on Word Importance Entropy |
Lujia Shen et.al. |
2302.05892v1 |
null |
2023-02-12 |
"Why is this misleading?": Detecting News Headline Hallucinations with Explanations |
Jiaming Shen et.al. |
2302.05852v1 |
null |
2023-02-11 |
Sequential Embedding-based Attentive (SEA) classifier for malware classification |
Muhammad Ahmed et.al. |
2302.05728v1 |
link |
2023-02-11 |
Synthesizing Human Gaze Feedback for Improved NLP Performance |
Varun Khurana et.al. |
2302.05721v1 |
null |
2023-02-11 |
MatKB: Semantic Search for Polycrystalline Materials Synthesis Procedures |
Xianjun Yang et.al. |
2302.05597v1 |
link |
2023-02-10 |
A Practical Mixed Precision Algorithm for Post-Training Quantization |
Nilesh Prasad Pandey et.al. |
2302.05397v1 |
null |
2023-02-10 |
Translating Natural Language to Planning Goals with Large-Language Models |
Yaqi Xie et.al. |
2302.05128v1 |
link |
2023-02-10 |
Step by Step Loss Goes Very Far: Multi-Step Quantization for Adversarial Text Attacks |
Piotr Gaiński et.al. |
2302.05120v1 |
link |
2023-02-09 |
Flexible, Model-Agnostic Method for Materials Data Extraction from Text Using General Purpose Language Models |
Maciej P. Polak et.al. |
2302.04914v1 |
null |
2023-02-09 |
AI-based Question Answering Assistance for Analyzing Natural-language Requirements |
Saad Ezzini et.al. |
2302.04793v1 |
null |
2023-02-09 |
Massively Multilingual Language Models for Cross Lingual Fact Extraction from Low Resource Indian Languages |
Bhavyajeet Singh et.al. |
2302.04790v1 |
link |
2023-02-09 |
Lightweight Transformers for Clinical Natural Language Processing |
Omid Rohanian et.al. |
2302.04725v1 |
link |
2023-02-09 |
Mixed-order self-paced curriculum learning for universal lesion detection |
Han Li et.al. |
2302.04677v1 |
null |
2023-02-09 |
NLP-based Decision Support System for Examination of Eligibility Criteria from Securities Prospectuses at the German Central Bank |
Christian Hänig et.al. |
2302.04562v1 |
null |
2023-02-09 |
Enhancing E-Commerce Recommendation using Pre-Trained Language Model and Fine-Tuning |
Nuofan Xu et.al. |
2302.04443v1 |
null |
2023-02-08 |
Sentiment analysis and opinion mining on educational data: A survey |
Thanveer Shaik et.al. |
2302.04359v1 |
null |
2023-02-08 |
CRL+: A Novel Semi-Supervised Deep Active Contrastive Representation Learning-Based Text Classification Model for Insurance Data |
Amir Namavar Jahromi et.al. |
2302.04343v1 |
null |
2023-02-08 |
Efficient Joint Learning for Clinical Named Entity Recognition and Relation Extraction Using Fourier Networks: A Use Case in Adverse Drug Events |
Anthony Yazdani et.al. |
2302.04185v1 |
link |
2023-02-08 |
Training-free Lexical Backdoor Attacks on Language Models |
Yujin Huang et.al. |
2302.04116v1 |
link |
2023-02-08 |
An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking |
Xubo Qin et.al. |
2302.04112v1 |
null |
2023-02-08 |
Automating Code-Related Tasks Through Transformers: The Impact of Pre-training |
Rosalia Tufano et.al. |
2302.04048v1 |
link |
2023-02-08 |
Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models |
Mohammadreza Banaei et.al. |
2302.04045v1 |
link |
2023-02-08 |
On the Applicability of Language Models to Block-Based Programs |
Elisabeth Griebl et.al. |
2302.03927v1 |
null |
2023-02-08 |
CRAFT: Criticality-Aware Fault-Tolerance Enhancement Techniques for Emerging Memories-Based Deep Neural Networks |
Thai-Hoang Nguyen et.al. |
2302.03862v1 |
null |
2023-02-07 |
Pre-train, Prompt and Recommendation: A Comprehensive Survey of Language Modelling Paradigm Adaptations in Recommender Systems |
Peng Liu et.al. |
2302.03735v1 |
link |
2023-02-07 |
Characterizing Financial Market Coverage using Artificial Intelligence |
Jean Marie Tshimula et.al. |
2302.03694v1 |
null |
2023-02-08 |
A Survey on Arabic Named Entity Recognition: Past, Recent Advances, and Future Trends |
Xiaoye Qu et.al. |
2302.03512v2 |
null |
2023-02-07 |
Natural Language Processing for Policymaking |
Zhijing Jin et.al. |
2302.03490v1 |
null |
2023-02-06 |
APAM: Adaptive Pre-training and Adaptive Meta Learning in Language Model for Noisy Labels and Long-tailed Learning |
Sunyi Chi et.al. |
2302.03488v1 |
null |
2023-02-07 |
What do Language Models know about word senses? Zero-Shot WSD with Language Models and Domain Inventories |
Oscar Sainz et.al. |
2302.03353v1 |
null |
2023-02-07 |
Continual Learning of Language Models |
Zixuan Ke et.al. |
2302.03241v1 |
link |
2023-02-07 |
Bringing the State-of-the-Art to Customers: A Neural Agent Assistant Framework for Customer Service Support |
Stephen Obadinma et.al. |
2302.03222v1 |
link |
2023-02-06 |
Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design |
Lyle Regenwetter et.al. |
2302.02913v1 |
null |
2023-02-06 |
Findings of the TSAR-2022 Shared Task on Multilingual Lexical Simplification |
Horacio Saggion et.al. |
2302.02888v1 |
null |
2023-02-07 |
Less is More: Understanding Word-level Textual Adversarial Attack via n-gram Frequency Descend |
Ning Lu et.al. |
2302.02568v2 |
null |
2023-02-06 |
Deep Learning for Time Series Classification and Extrinsic Regression: A Current Survey |
Navid Mohammadi Foumani et.al. |
2302.02515v1 |
link |
2023-02-05 |
VuLASTE: Long Sequence Model with Abstract Syntax Tree Embedding for vulnerability Detection |
Botong Zhu et.al. |
2302.02345v1 |
null |
2023-02-05 |
A Semantic Approach to Negation Detection and Word Disambiguation with Natural Language Processing |
Izunna Okpala et.al. |
2302.02291v1 |
null |
2023-02-04 |
Knowledge Distillation in Vision Transformers: A Critical Review |
Gousia Habib et.al. |
2302.02108v1 |
null |
2023-02-03 |
Witscript: A System for Generating Improvised Jokes in a Conversation |
Joe Toplyn et.al. |
2302.02008v1 |
null |
2023-02-06 |
Analyzing the impact of climate change on critical infrastructure from the scientific literature: A weakly supervised NLP approach |
Tanwi Mallick et.al. |
2302.01887v2 |
null |
2023-02-03 |
Lexical Simplification using multi level and modular approach |
Nikita Katyal et.al. |
2302.01823v1 |
null |
2023-02-03 |
Mitigating Data Scarcity for Large Language Models |
Hoang Van et.al. |
2302.01806v1 |
link |
2023-02-03 |
Bioformer: an efficient transformer language model for biomedical text mining |
Li Fang et.al. |
2302.01588v1 |
link |
2023-02-03 |
ResMem: Learn what you can and memorize the rest |
Zitong Yang et.al. |
2302.01576v1 |
null |
2023-02-03 |
Witgenstein's influence on artificial intelligence |
Piero Molino et.al. |
2302.01570v1 |
null |
2023-02-03 |
Using natural language processing and structured medical data to phenotype patients hospitalized due to COVID-19 |
Feier Chang et.al. |
2302.01536v1 |
null |
2023-02-03 |
SPADE: Self-supervised Pretraining for Acoustic DisEntanglement |
John Harvill et.al. |
2302.01483v1 |
null |
2023-02-02 |
Commonsense-Aware Prompting for Controllable Empathetic Dialogue Generation |
Yiren Liu et.al. |
2302.01441v1 |
null |
2023-02-02 |
Mixed Precision Post Training Quantization of Neural Networks with Sensitivity Guided Search |
Clemens JS Schaefer et.al. |
2302.01382v1 |
null |
2023-02-02 |
Modeling opinion polarization on social media: application to Covid-19 vaccination hesitancy in Italy |
Jonathan Franceschi et.al. |
2302.01028v1 |
null |
2023-02-02 |
Resilient Binary Neural Network |
Sheng Xu et.al. |
2302.00956v1 |
link |
2023-02-02 |
How to choose "Good" Samples for Text Data Augmentation |
Xiaotian Lin et.al. |
2302.00894v1 |
null |
2023-02-02 |
idT5: Indonesian Version of Multilingual T5 Transformer |
Mukhlish Fuadi et.al. |
2302.00856v1 |
null |
2023-02-01 |
FAVOR#: Sharp Attention Kernel Approximations via New Classes of Positive Random Features |
Valerii Likhosherstov et.al. |
2302.00787v1 |
null |
2023-02-01 |
User Study for Improving Tools for Bible Translation |
Joel Mathew et.al. |
2302.00778v1 |
null |
2023-02-01 |
Developing Hands-on Labs for Source Code Vulnerability Detection with AI |
Maryam Taeb et.al. |
2302.00750v1 |
null |
2023-02-01 |
Versatile Energy-Based Models for High Energy Physics |
Taoli Cheng et.al. |
2302.00695v1 |
link |
2023-02-01 |
Energy-Based Survival Models for Predictive Maintenance |
Olov Holmer et.al. |
2302.00629v1 |
null |
2023-02-01 |
Feed-Forward Blocks Control Contextualization in Masked Language Models |
Goro Kobayashi et.al. |
2302.00456v1 |
link |
2023-02-01 |
On the Role of Morphological Information for Contextual Lemmatization |
Olia Toporkov et.al. |
2302.00407v1 |
null |
2023-01-31 |
Large Language Models Can Be Easily Distracted by Irrelevant Context |
Freda Shi et.al. |
2302.00093v1 |
link |
2023-01-31 |
PADL: Language-Directed Physics-Based Character Control |
Jordan Juravsky et.al. |
2301.13868v1 |
link |
2023-01-31 |
Partitioning Distributed Compute Jobs with Reinforcement Learning and Graph Neural Networks |
Christopher W. F. Parsonson et.al. |
2301.13799v1 |
null |
2023-01-31 |
Zero-shot cross-lingual transfer language selection using linguistic similarity |
Juuso Eronen et.al. |
2301.13720v1 |
null |
2023-01-31 |
Friend-training: Learning from Models of Different but Related Tasks |
Mian Zhang et.al. |
2301.13683v1 |
null |
2023-02-01 |
What Makes Good Examples for Visual In-Context Learning? |
Yuanhan Zhang et.al. |
2301.13670v2 |
link |
2023-01-30 |
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation |
Minglun Han et.al. |
2301.13003v1 |
link |
2023-01-30 |
Exploring AI Ethics of ChatGPT: A Diagnostic Analysis |
Terry Yue Zhuo et.al. |
2301.12867v1 |
null |
2023-01-30 |
Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features |
Sishuo Chen et.al. |
2301.12715v1 |
link |
2023-01-30 |
UzbekTagger: The rule-based POS tagger for Uzbek language |
Maksud Sharipov et.al. |
2301.12711v1 |
null |
2023-01-29 |
Large Language Models for Biomedical Causal Graph Construction |
Vahan Arsenyan et.al. |
2301.12473v1 |
null |
2023-01-29 |
DocILE 2023 Teaser: Document Information Localization and Extraction |
Štěpán Šimsa et.al. |
2301.12394v1 |
null |
2023-01-28 |
HAT-GAE: Self-Supervised Graph Auto-encoders with Hierarchical Adaptive Masking and Trainable Corruption |
Chengyu Sun et.al. |
2301.12063v1 |
null |
2023-01-27 |
Improved knowledge distillation by utilizing backward pass knowledge in neural networks |
Aref Jafari et.al. |
2301.12006v1 |
null |
2023-01-27 |
Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation |
Jessica Huynh et.al. |
2301.12004v1 |
null |
2023-01-27 |
Gender and Prestige Bias in Coronavirus News Reporting |
Rebecca Dorn et.al. |
2301.11994v1 |
null |
2023-01-27 |
A Comparative Study of Pretrained Language Models for Long Clinical Text |
Yikuan Li et.al. |
2301.11847v1 |
link |
2023-01-27 |
Incorporating Knowledge into Document Summarization: an Application of Prefix-Tuning on GPT-2 |
Chen Chen et.al. |
2301.11719v1 |
null |
2023-01-27 |
SLCNN: Sentence-Level Convolutional Neural Network for Text Classification |
Ali Jarrahi et.al. |
2301.11696v1 |
null |
2023-01-27 |
A rule-free workflow for the automated generation of databases from scientific literature |
Luke P. J. Gilligan et.al. |
2301.11689v1 |
link |
2023-01-27 |
Down the Rabbit Hole: Detecting Online Extremism, Radicalisation, and Politicised Hate Speech |
Jarod Govers et.al. |
2301.11579v1 |
null |
2023-01-26 |
Beyond Arabic: Software for Perso-Arabic Script Manipulation |
Alexander Gutkin et.al. |
2301.11406v1 |
link |
2023-01-24 |
Semi-Automated Construction of Food Composition Knowledge Base |
Jason Youn et.al. |
2301.11322v1 |
link |
2023-01-26 |
LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization |
Laura Nguyen et.al. |
2301.11312v1 |
link |
2023-01-26 |
Box $^2$ EL: Concept and Role Box Embeddings for the Description Logic EL++ |
Mathias Jackermeier et.al. |
2301.11118v1 |
link |
2023-01-26 |
NLP as a Lens for Causal Analysis and Perception Mining to Infer Mental Health on Social Media |
Muskan Garg et.al. |
2301.11004v1 |
null |
2023-01-25 |
Qualitative Analysis of a Graph Transformer Approach to Addressing Hate Speech: Adapting to Dynamically Changing Content |
Liam Hebert et.al. |
2301.10871v1 |
null |
2023-01-25 |
Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement |
Gavin Abercrombie et.al. |
2301.10684v1 |
null |
2023-01-25 |
Understanding and Improving Deep Graph Neural Networks: A Probabilistic Graphical Model Perspective |
Jiayuan Chen et.al. |
2301.10536v1 |
null |
2023-01-25 |
Cross-lingual Argument Mining in the Medical Domain |
Anar Yeginbergenova et.al. |
2301.10527v1 |
link |
2023-01-25 |
Knowledge-augmented Graph Neural Networks with Concept-aware Attention for Adverse Drug Event Detection |
Shaoxiong Ji et.al. |
2301.10451v1 |
null |
2023-01-25 |
BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing |
Jiali Wei et.al. |
2301.10412v1 |
null |
2023-01-24 |
A Framework To Improve User Story Sets Through Collaboration |
Salih Göktuğ Köse et.al. |
2301.10070v1 |
null |
2023-01-24 |
Multitask Instruction-based Prompting for Fallacy Recognition |
Tariq Alhindi et.al. |
2301.09992v1 |
null |
2023-01-24 |
Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression |
Jaeyong Song et.al. |
2301.09830v1 |
null |
2023-01-24 |
Transformer-Patcher: One Mistake worth One Neuron |
Zeyu Huang et.al. |
2301.09785v1 |
link |
2023-01-23 |
Noisy Parallel Data Alignment |
Ruoyu Xie et.al. |
2301.09685v1 |
link |
2023-01-22 |
Face Generation from Textual Features using Conditionally Trained Inputs to Generative Adversarial Networks |
Sandeep Shinde et.al. |
2301.09123v1 |
null |
2023-01-22 |
Differentially Private Natural Language Models: Recent Advances and Future Directions |
Lijie Hu et.al. |
2301.09112v1 |
null |
2023-01-22 |
Learning to Reject with a Fixed Predictor: Application to Decontextualization |
Christopher Mohri et.al. |
2301.09044v1 |
null |
2023-01-21 |
A Semantic Modular Framework for Events Topic Modeling in Social Media |
Arya Hadizadeh Moghaddam et.al. |
2301.09009v1 |
null |
2023-01-21 |
Blacks is to Anger as Whites is to Joy? Understanding Latent Affective Bias in Large Pre-trained Neural Language Models |
Anoop Kadan et.al. |
2301.09003v1 |
link |
2023-01-21 |
Exploring Methods for Building Dialects-Mandarin Code-Mixing Corpora: A Case Study in Taiwanese Hokkien |
Sin-En Lu et.al. |
2301.08937v1 |
link |
2023-01-21 |
Rationalization for Explainable NLP: A Survey |
Sai Gurrapu et.al. |
2301.08912v1 |
null |
2023-01-20 |
A Review of the Trends and Challenges in Adopting Natural Language Processing Methods for Education Feedback Analysis |
Thanveer Shaik et.al. |
2301.08826v1 |
null |
2023-01-19 |
Reversing The Twenty Questions Game |
Parth Parikh et.al. |
2301.08718v1 |
null |
2023-01-20 |
Which Features are Learned by CodeBert: An Empirical Study of the BERT-based Source Code Representation Learning |
Lan Zhang et.al. |
2301.08427v1 |
null |
2023-01-23 |
A Survey of research in Deep Learning for Robotics for Undergraduate research interns |
Narayanan PP et.al. |
2301.08283v2 |
null |
2023-01-19 |
Language Embeddings Sometimes Contain Typological Generalizations |
Robert Östling et.al. |
2301.08115v1 |
link |
2023-01-18 |
Automatically Reproducing Android Bug Reports Using Natural Language Processing and Reinforcement Learning |
Zhaoxu Zhang et.al. |
2301.07775v1 |
null |
2023-01-18 |
A Quantitative Exploration of Natural Language Processing Applications for Electricity Demand Analysis |
Yun Bai et.al. |
2301.07535v1 |
null |
2023-01-18 |
Discrete Latent Structure in Neural Networks |
Vlad Niculae et.al. |
2301.07473v1 |
null |
2023-01-17 |
On the State of German (Abstractive) Text Summarization |
Dennis Aumiller et.al. |
2301.07095v1 |
link |
2023-01-17 |
Transformer Based Implementation for Automatic Book Summarization |
Siddhant Porwal et.al. |
2301.07057v1 |
null |
2023-01-17 |
SECOMlint: A linter for Security Commit Messages |
Sofia Reis et.al. |
2301.06959v1 |
null |
2022-12-30 |
TA-DA: Topic-Aware Domain Adaptation for Scientific Keyphrase Identification and Classification (Student Abstract) |
Răzvan-Alexandru Smădu et.al. |
2301.06902v1 |
null |
2023-01-17 |
The Recent Advances in Automatic Term Extraction: A survey |
Hanh Thi Hong Tran et.al. |
2301.06767v1 |
null |
2023-01-17 |
Word Embeddings as Statistical Estimators |
Neil Dey et.al. |
2301.06710v1 |
link |
2023-01-16 |
XNLI 2.0: Improving XNLI dataset and performance on Cross Lingual Understanding (XLU) |
Ankit Kumar Upadhyay et.al. |
2301.06527v1 |
null |
2023-01-13 |
A Survey of Self-Supervised Learning from Multiple Perspectives: Algorithms, Theory, Applications and Future Trends |
Jie Gui et.al. |
2301.05712v1 |
link |
2023-01-13 |
Natural Language Processing of Aviation Occurrence Reports for Safety Management |
Patrick Jonk et.al. |
2301.05663v1 |
link |
2023-01-13 |
The 2022 n2c2/UW Shared Task on Extracting Social Determinants of Health |
Kevin Lybarger et.al. |
2301.05571v1 |
null |
2023-01-12 |
Rock Guitar Tablature Generation via Natural Language Processing |
Josue Casco-Rodriguez et.al. |
2301.05295v1 |
link |
2023-01-12 |
Counterfactual Explanations for Concepts in $\mathcal{ELH}$ |
Leonie Nora Sieger et.al. |
2301.05109v1 |
null |
2023-01-12 |
Improving Inference Performance of Machine Learning with the Divide-and-Conquer Principle |
Alex Kogan et.al. |
2301.05099v1 |
null |
2023-01-12 |
A Dataset of Kurdish (Sorani) Named Entities -- An Amendment to Kurdish-BLARK Named Entities |
Sazan Salar et.al. |
2301.04962v1 |
link |
2023-01-12 |
Machine-learning Analysis of Opioid Use Disorder Informed by MOR, DOR, KOR, NOR and ZOR-Based Interactome Networks |
Hongsong Feng et.al. |
2301.04815v1 |
link |
2023-01-13 |
Much Ado About Gender: Current Practices and Future Recommendations for Appropriate Gender-Aware Information Access |
Christine Pinney et.al. |
2301.04780v2 |
null |
2023-01-11 |
NarrowBERT: Accelerating Masked Language Model Pretraining and Inference |
Haoxin Li et.al. |
2301.04761v1 |
link |
2023-01-11 |
Semantic Web Enabled Geographic Question Answering Framework: GeoTR |
Ceren Ocal Tasar et.al. |
2301.04752v1 |
null |
2023-01-11 |
SensePOLAR: Word sense aware interpretability for pre-trained contextual word embeddings |
Jan Engler et.al. |
2301.04704v1 |
link |
2023-01-11 |
ML-FEED: Machine Learning Framework for Efficient Exploit Detection (Extended version) |
Tanujay Saha et.al. |
2301.04314v1 |
null |
2023-01-17 |
Word-Graph2vec: An efficient word embedding approach on word co-occurrence graph using random walk sampling |
Wenting Li et.al. |
2301.04312v2 |
null |
2023-01-11 |
A Multi-Modal Geographic Pre-Training Method |
Ruixue Ding et.al. |
2301.04283v1 |
link |
2023-01-10 |
User-Centered Security in Natural Language Processing |
Chris Emmery et.al. |
2301.04230v1 |
null |
2023-01-10 |
There is No Big Brother or Small Brother: Knowledge Infusion in Language Models for Link Prediction and Question Answering |
Ankush Agarwal et.al. |
2301.04013v1 |
link |
2023-01-10 |
Language Models sounds the Death Knell of Knowledge Graphs |
Kunal Suri et.al. |
2301.03980v1 |
null |
2023-01-10 |
AI based approach to Trailer Generation for Online Educational Courses |
Prakhar Mishra et.al. |
2301.03957v1 |
null |
2023-01-09 |
Transfer learning for conflict and duplicate detection in software requirement pairs |
Garima Malik et.al. |
2301.03709v1 |
null |
2023-01-10 |
Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review |
Reza Azad et.al. |
2301.03505v2 |
link |
2023-01-09 |
Mining Healthcare Procurement Data Using Text Mining and Natural Language Processing -- Reflection From An Industrial Project |
Ziqi Zhang et.al. |
2301.03458v1 |
null |
2023-01-09 |
Making Sense of Failure Logs in an Industrial DevOps Environment |
Muhammad Abbas et.al. |
2301.03450v1 |
null |
2023-01-09 |
Universal Multimodal Representation for Language Understanding |
Zhuosheng Zhang et.al. |
2301.03344v1 |
null |
2023-01-08 |
The State of Human-centered NLP Technology for Fact-checking |
Anubrata Das et.al. |
2301.03056v1 |
null |
2023-01-08 |
Topic Modelling of Swedish Newspaper Articles about Coronavirus: a Case Study using Latent Dirichlet Allocation Method |
Bernadeta Griciūtė et.al. |
2301.03029v1 |
link |
2023-01-08 |
Semantic rule Web-based Diagnosis and Treatment of Vector-Borne Diseases using SWRL rules |
Ritesh Chandra et.al. |
2301.03013v1 |
null |
2023-01-06 |
Systems for Parallel and Distributed Large-Model Deep Learning Training |
Kabir Nagrecha et.al. |
2301.02691v1 |
null |
2023-01-06 |
CHARM: Composing Heterogeneous Accelerators for Matrix Multiply on Versal ACAP Architecture |
Jinming Zhuang et.al. |
2301.02359v1 |
link |
2023-01-05 |
Sequentially Controlled Text Generation |
Alexander Spangher et.al. |
2301.02299v1 |
null |
2023-01-05 |
A Survey of Code-switching: Linguistic and Social Perspectives for Language Technologies |
A. Seza Doğruöz et.al. |
2301.01967v1 |
null |
2023-01-05 |
Corrupted by Algorithms? How AI-generated and Human-written Advice Shape (Dis)honesty |
Margarita Leib et.al. |
2301.01954v1 |
null |
2023-01-04 |
Parameter-Efficient Fine-Tuning Design Spaces |
Jiaao Chen et.al. |
2301.01821v1 |
null |
2023-01-04 |
MessageNet: Message Classification using Natural Language Processing and Meta-data |
Adar Kahana et.al. |
2301.01808v1 |
null |
2023-01-04 |
Infomaxformer: Maximum Entropy Transformer for Long Time-Series Forecasting Problem |
Peiwang Tang et.al. |
2301.01772v1 |
null |
2023-01-04 |
Anonymous Pattern Molecular Fingerprint and its Applications on Property Identification |
Xue Liu et.al. |
2301.01620v1 |
null |
2023-01-03 |
Linear chain conditional random fields, hidden Markov models, and related classifiers |
Elie Azeraf et.al. |
2301.01293v1 |
null |
2023-01-03 |
Introducing Variational Inference in Undergraduate Statistics and Data Science Curriculum |
Vojtech Kejzlar et.al. |
2301.01251v1 |
link |
2023-01-03 |
A Survey On Few-shot Knowledge Graph Completion with Structural and Commonsense Knowledge |
Haodi Ma et.al. |
2301.01172v1 |
null |
2023-01-03 |
Policy Pre-training for End-to-end Autonomous Driving via Self-supervised Geometric Modeling |
Penghao Wu et.al. |
2301.01006v1 |
link |
2023-01-03 |
Boosting Neural Networks to Decompile Optimized Binaries |
Ying Cao et.al. |
2301.00969v1 |
null |
2022-12-29 |
Ontology-based Context Aware Recommender System Application for Tourism |
Vitor T. Camacho et.al. |
2301.00768v1 |
null |
2023-01-02 |
Tsetlin Machine Embedding: Representing Words Using Logical Expressions |
Bimal Bhattarai et.al. |
2301.00709v1 |
link |
2022-12-30 |
Active Learning for Neural Machine Translation |
Neeraj Vashistha et.al. |
2301.00688v1 |
link |
2022-12-20 |
Addressing the Selection Bias in Voice Assistance: Training Voice Assistance Model in Python with Equal Data Selection |
Kashav Piya et.al. |
2301.00646v1 |
null |
2023-01-02 |
Statistical Machine Translation for Indic Languages |
Sudhansu Bala Das et.al. |
2301.00539v1 |
null |
2023-01-02 |
Adaptive Fine-tuning for Multiclass Classification over Software Requirement Data |
Savas Yildirim et.al. |
2301.00495v1 |
null |
2023-01-01 |
Integrating Semantic Information into Sketchy Reading Module of Retro-Reader for Vietnamese Machine Reading Comprehension |
Hang Thi-Thu Le et.al. |
2301.00429v1 |
null |
2023-01-01 |
CORGI-PM: A Chinese Corpus For Gender Bias Probing and Mitigation |
Ge Zhang et.al. |
2301.00395v1 |
link |
2023-01-01 |
A Functional approach for Two Way Dimension Reduction in Time Series |
Aniruddha Rajendra Rao et.al. |
2301.00357v1 |
null |
2022-12-31 |
Rethinking with Retrieval: Faithful Large Language Model Inference |
Hangfeng He et.al. |
2301.00303v1 |
link |
2022-12-31 |
RECOMMED: A Comprehensive Pharmaceutical Recommendation System |
Mariam Zomorodi et.al. |
2301.00280v1 |
null |
2022-12-31 |
A Survey for In-context Learning |
Qingxiu Dong et.al. |
2301.00234v1 |
link |
2022-12-31 |
Logic Mill -- A Knowledge Navigation System |
Sebastian Erhardt et.al. |
2301.00200v1 |
null |
2023-01-06 |
Examining Political Rhetoric with Epistemic Stance Detection |
Ankita Gupta et.al. |
2212.14486v2 |
link |
2022-12-29 |
On Learning the Structure of Clusters in Graphs |
Peter Macgregor et.al. |
2212.14345v1 |
null |
2022-12-29 |
On Transforming Reinforcement Learning by Transformer: The Development Trajectory |
Shengchao Hu et.al. |
2212.14164v1 |
null |
2022-12-28 |
Towards automating Codenames spymasters with deep reinforcement learning |
Sherman Siu et.al. |
2212.14104v1 |
null |
2022-12-28 |
Automatic Recognition and Classification of Future Work Sentences from Academic Articles in a Specific Domain |
Chengzhi Zhang et.al. |
2212.13860v1 |
link |
2022-12-30 |
Cyber Security and Online Safety Education for Schools in the UK: Looking through the Lens of Twitter Data |
Jamie Knott et.al. |
2212.13742v2 |
null |
2022-12-28 |
Part-guided Relational Transformers for Fine-grained Visual Recognition |
Yifan Zhao et.al. |
2212.13685v1 |
link |
2022-12-27 |
SVSBI: Sequence-based virtual screening of biomolecular interactions |
Li Shen et.al. |
2212.13617v1 |
link |
2022-12-27 |
Nanomaterials for Supercapacitors: Uncovering Research Themes with Unsupervised Machine Learning |
Mridhula Venkatanarayanan et.al. |
2212.13550v1 |
null |
2022-12-27 |
A Survey on Knowledge-Enhanced Pre-trained Language Models |
Chaoqi Zhen et.al. |
2212.13428v1 |
null |
2023-01-11 |
NEEDED: Introducing Hierarchical Transformer to Eye Diseases Diagnosis |
Xu Ye et.al. |
2212.13408v3 |
link |
2022-12-26 |
VQA and Visual Reasoning: An Overview of Recent Datasets, Methods and Challenges |
Rufai Yusuf Zakari et.al. |
2212.13296v1 |
null |
2022-12-24 |
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective |
Ying Wen et.al. |
2212.12669v1 |
link |
2022-12-24 |
STRUDEL: Structured Dialogue Summarization for Dialogue Comprehension |
Borui Wang et.al. |
2212.12652v1 |
null |
2022-12-24 |
Utilizing Priming to Identify Optimal Class Ordering to Alleviate Catastrophic Forgetting |
Gabriel Mantione-Holmes et.al. |
2212.12643v1 |
null |
2022-12-23 |
Content Rating Classification for Fan Fiction |
Yu Qiao et.al. |
2212.12496v1 |
null |
2022-12-23 |
Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media |
Yuting Guo et.al. |
2212.12454v1 |
null |
2022-12-22 |
CAMeMBERT: Cascading Assistant-Mediated Multilingual BERT |
Dan DeGenaro et.al. |
2212.11456v1 |
null |
2022-12-21 |
Automatic Emotion Modelling in Written Stories |
Lukas Christ et.al. |
2212.11382v1 |
link |
2022-12-21 |
Training language models for deeper understanding improves brain alignment |
Khai Loong Aw et.al. |
2212.10898v1 |
link |
2022-12-21 |
A Survey of Mix-based Data Augmentation: Taxonomy, Methods, Applications, and Explainability |
Chengtai Cao et.al. |
2212.10888v1 |
link |
2022-12-21 |
A Portal Dedicated to Higgs Bosons for Experts and the General Public |
Andre Sopczak et.al. |
2212.10857v1 |
null |
2022-12-21 |
End-to-End Automatic Speech Recognition model for the Sudanese Dialect |
Ayman Mansour et.al. |
2212.10826v1 |
null |
2022-12-21 |
MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning |
Zhiyang Xu et.al. |
2212.10773v1 |
link |
2022-12-21 |
Investigation of Network Architecture for Multimodal Head-and-Neck Tumor Segmentation |
Ye Li et.al. |
2212.10724v1 |
null |
2022-12-20 |
KronA: Parameter Efficient Tuning with Kronecker Adapter |
Ali Edalati et.al. |
2212.10650v1 |
null |
2022-12-20 |
A Survey of Deep Learning for Mathematical Reasoning |
Pan Lu et.al. |
2212.10535v1 |
link |
2022-12-20 |
A Measure-Theoretic Characterization of Tight Language Models |
Li Du et.al. |
2212.10502v1 |
null |
2022-12-20 |
Is GPT-3 a Good Data Annotator? |
Bosheng Ding et.al. |
2212.10450v1 |
null |
2022-12-20 |
Towards Reasoning in Large Language Models: A Survey |
Jie Huang et.al. |
2212.10403v1 |
link |
2022-12-20 |
SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers |
Hongyi Yuan et.al. |
2212.10325v1 |
link |
2022-12-20 |
CSMPQ:Class Separability Based Mixed-Precision Quantization |
Mingkai Wang et.al. |
2212.10220v1 |
null |
2022-12-20 |
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator |
Jian Yang et.al. |
2212.10218v1 |
link |
2022-12-20 |
Human-Guided Fair Classification for Natural Language Processing |
Florian E. Dorner et.al. |
2212.10154v1 |
link |
2022-12-20 |
When Federated Learning Meets Pre-trained Language Models' Parameter-Efficient Tuning Methods |
Zhuo Zhang et.al. |
2212.10025v1 |
link |
2022-12-19 |
A Comparative Study on Textual Saliency of Styles from Eye Tracking, Annotations, and Language Models |
Karin de Langis et.al. |
2212.09873v1 |
link |
2022-12-19 |
Exploring Hybrid and Ensemble Models for Multiclass Prediction of Mental Health Status on Social Media |
Sourabh Zanwar et.al. |
2212.09839v1 |
null |
2022-12-19 |
Do CoNLL-2003 Named Entity Taggers Still Work Well in 2023? |
Shuheng Liu et.al. |
2212.09747v1 |
link |
2022-12-19 |
MANER: Mask Augmented Named Entity Recognition for Extreme Low-Resource Languages |
Shashank Sonkar et.al. |
2212.09723v1 |
null |