Manner implicatures in large language models

Abstract In human speakers’ daily conversations, what we do not say matters. We not only compute the literal semantics but also go beyond and draw inferences from what we could have said but chose not to. How well is this pragmatic reasoning process represented in pre-trained large language models (...

Full description

Saved in:

Bibliographic Details
Main Author:	Yan Cong
Format:	Article
Language:	English
Published:	Nature Portfolio 2024-11-01
Series:	Scientific Reports
Subjects:	Explainability Large language models Pragmatic reasoning Semantics Natural language understanding Conversational implicatures
Online Access:	https://doi.org/10.1038/s41598-024-80571-3
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850216432461152256
author	Yan Cong
author_facet	Yan Cong
author_sort	Yan Cong
collection	DOAJ
description	Abstract In human speakers’ daily conversations, what we do not say matters. We not only compute the literal semantics but also go beyond and draw inferences from what we could have said but chose not to. How well is this pragmatic reasoning process represented in pre-trained large language models (LLM)? In this study, we attempt to address this question through the lens of manner implicature, a pragmatic inference triggered by a violation of the Grice manner maxim. Manner implicature is a central member of the class of context-sensitive phenomena. The current work investigates to what extent pre-trained LLMs are able to identify and tease apart different shades of meaning in manner implicature. We constructed three metrics to explain LLMs’ behavior, including LLMs-surprisals, embedding vectors’ similarities, and natural language prompting. Results showed no striking evidence that LLMs have explainable representations of meaning. First, the LLMs-surprisal findings suggest that some LLMs showed above chance accuracy in capturing different dimensions of meaning, and they were able to differentiate neutral relations from entailment or implications, but they did not show consistent and robust sensitivities to more nuanced comparisons, such as entailment versus implications and equivalence versus entailment. Second, the similarity findings suggest that the perceived advantage of contextual over static embeddings was minimal, and contextual LLMs did not notably outperform static GloVe embeddings. LLMs and GloVe showed no significant difference, though distinctions between entailment and implication were slightly more observable in LLMs. Third, the prompting findings suggest no further supportive evidence indicating LLM’s competence in fully representing different shades of meaning. Overall, our study suggests that current dominant pre-training paradigms do not seem to lead to significant competence in manner implicature within our models. Our investigation sheds light on the design of datasets and benchmark metrics driven by formal and distributional linguistic theories.
format	Article
id	doaj-art-6b84cbdb89e14da0a435442b1b880b41
institution	OA Journals
issn	2045-2322
language	English
publishDate	2024-11-01
publisher	Nature Portfolio
record_format	Article
series	Scientific Reports
spelling	doaj-art-6b84cbdb89e14da0a435442b1b880b412025-08-20T02:08:19ZengNature PortfolioScientific Reports2045-23222024-11-0114111610.1038/s41598-024-80571-3Manner implicatures in large language modelsYan Cong0School of Languages and Cultures, Purdue UniversityAbstract In human speakers’ daily conversations, what we do not say matters. We not only compute the literal semantics but also go beyond and draw inferences from what we could have said but chose not to. How well is this pragmatic reasoning process represented in pre-trained large language models (LLM)? In this study, we attempt to address this question through the lens of manner implicature, a pragmatic inference triggered by a violation of the Grice manner maxim. Manner implicature is a central member of the class of context-sensitive phenomena. The current work investigates to what extent pre-trained LLMs are able to identify and tease apart different shades of meaning in manner implicature. We constructed three metrics to explain LLMs’ behavior, including LLMs-surprisals, embedding vectors’ similarities, and natural language prompting. Results showed no striking evidence that LLMs have explainable representations of meaning. First, the LLMs-surprisal findings suggest that some LLMs showed above chance accuracy in capturing different dimensions of meaning, and they were able to differentiate neutral relations from entailment or implications, but they did not show consistent and robust sensitivities to more nuanced comparisons, such as entailment versus implications and equivalence versus entailment. Second, the similarity findings suggest that the perceived advantage of contextual over static embeddings was minimal, and contextual LLMs did not notably outperform static GloVe embeddings. LLMs and GloVe showed no significant difference, though distinctions between entailment and implication were slightly more observable in LLMs. Third, the prompting findings suggest no further supportive evidence indicating LLM’s competence in fully representing different shades of meaning. Overall, our study suggests that current dominant pre-training paradigms do not seem to lead to significant competence in manner implicature within our models. Our investigation sheds light on the design of datasets and benchmark metrics driven by formal and distributional linguistic theories.https://doi.org/10.1038/s41598-024-80571-3ExplainabilityLarge language modelsPragmatic reasoningSemanticsNatural language understandingConversational implicatures
spellingShingle	Yan Cong Manner implicatures in large language models Scientific Reports Explainability Large language models Pragmatic reasoning Semantics Natural language understanding Conversational implicatures
title	Manner implicatures in large language models
title_full	Manner implicatures in large language models
title_fullStr	Manner implicatures in large language models
title_full_unstemmed	Manner implicatures in large language models
title_short	Manner implicatures in large language models
title_sort	manner implicatures in large language models
topic	Explainability Large language models Pragmatic reasoning Semantics Natural language understanding Conversational implicatures
url	https://doi.org/10.1038/s41598-024-80571-3
work_keys_str_mv	AT yancong mannerimplicaturesinlargelanguagemodels

Manner implicatures in large language models

Similar Items