Sense Rel: A Sense-Level Benchmark for Denotational and Connotational Meaning Relations

May 30, 2026·
Pierluigi Cassotti
Naomi Baes
Naomi Baes
,
Stefano De Pascale
,
Jader Martins Camboim De Sá
,
Francesco Periti
,
Nick Haslam
,
Dirk Geeraerts
,
Nina Tahmasebi
· 0 min read
Distribution of connotational differences across denotational relation types.
Abstract
Polysemy enables a single word to convey multiple related meanings, reflecting the conceptual and emotional aspects of language evolution. We introduce the first sense-level benchmark for modeling semantic relations between word senses, uniting denotational and connotational aspects of meanings. The benchmark distinguishes denotational relations, such as generalization or metaphor, as well as two connotational dimensions: valence and arousal. We evaluate large language models (LLMs), GPT-4o, Llama 3.1, and DeepSeek, in zero-shot and fine-tuned settings. Results show that GPT-4o best aligns with human affective judgments, while a fine-tuned RoBERTa model excels at classifying denotational relations.
Type
Publication
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics