Sense Rel: A Sense-Level Benchmark for Denotational and Connotational Meaning Relations

May 30, 2026·

Pierluigi Cassotti*

Naomi Baes*

Stefano De Pascale*

Jader Martins Camboim De Sá

Francesco Periti

Nick Haslam

Dirk Geeraerts

Nina Tahmasebi

· 0 min read

PDF Project

Distribution of connotational differences across denotational relation types.

Abstract

Polysemy enables a single word to convey multiple related meanings, reflecting the conceptual and emotional aspects of language evolution. We introduce the first sense-level benchmark for modeling semantic relations between word senses, uniting denotational and connotational aspects of meanings. The benchmark distinguishes denotational relations, such as generalization or metaphor, as well as two connotational dimensions: valence and arousal. We evaluate large language models (LLMs), GPT-4o, Llama 3.1, and DeepSeek, in zero-shot and fine-tuned settings. Results show that GPT-4o best aligns with human affective judgments, while a fine-tuned RoBERTa model excels at classifying denotational relations.

Type

Conference paper

Publication

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics

Last updated on May 30, 2026

Lexical Semantics Semantic Change Word Senses Polysemy Denotation Connotation Valence Arousal Human--LLM Evaluation Benchmarking

A Multidimensional Computational Analysis of Dehumanization in Incel Discourse May 25, 2026 →