Naomi Baes

PhD Candidate - Psychology · Natural Language Processing

About Me

Researcher bridging computational linguistics, psychology, and natural language processing to study how societally relevant concepts change their meanings over time. Using large language models and historical text corpora, I co-developed the theory-driven linguistic framework SIBling (with Professor Nick Haslam and Dr Ekaterina Vylomova), which models semantic change along three dimensions — Sentiment, Intensity, and Breadth. Alongside SIBling, I co-developed LSC-Eval (with Dr Haim Dubossarsky and Raphaël Merx), a benchmarking framework that uses LLM-generated diachronic corpora to evaluate methods for detecting semantic change. Together, these frameworks advance both computational methods and applications, providing new tools to trace cultural and social dynamics across concepts and language domains. My research is supported by an Australian Government Research Training Program Scholarship and I am an active member of the NLP community - contributing to shared tasks (BRIGHTER; BLEnD), co-organizing workshops (LChange'26), and serving on program committees (NLP4Democracy; SEM2025).

Interests

Computational Linguistics
Computational Social Science
Language Change
Natural Language Processing
Psychology

Education

PhD, Psychology/ Natural Language Processing
University of Melbourne (2023-)
Graduate Diploma in Psychology (Advanced) with Honours
University of Melbourne

Research Program

Broadly speaking, I use computational approaches to study how language reflects social and cultural change, treating it as a window into the human mind and society. My work develops theory-driven measures to quantify linguistic, psychological, and social constructs by integrating insights from linguistics and psychology with methods from computational linguistics and Natural Language Processing (a subfield of Artificial Intelligence). Because labeled data are scarce in the social sciences, I primarily use pretrained language models, unsupervised learning, normed lexical resources, and statistical modelling. My current focus is on tracing how societally relevant concepts evolve in meaning over time using large language models and historical text corpora.

With my PhD supervisors, I have developed a linguistic framework (SIBling) and associated measures to model lexical semantic change (LSC) along three major dimensions that are typically overlooked by existing approaches.

Key Contributions:

SIBling: A theoretical model integrating insights from historical linguistics and psychology, reducing six types of LSC to three core dimensions: Sentiment, Intensity, and Breadth (SIB). [Prototype]
SIB Toolkit: A computational implementation of SIBling that quantifies semantic change across SIB, and complementary features (salience and thematic content). Designed for broad application across the social sciences and language domains (scientific, media, everyday).
LSC-Eval: An evaluation framework that uses LLM-generated synthetic corpora to simulate kinds of LSC and validate LSC detection methods, identifying optimal dimension- and domain-specific approaches. [Prototype]
Applications: Applying SIBling to trace the historical semantic evolution of mental health-related concepts (e.g., autism, schizophrenia), my research examines broader cultural dynamics such as concept creep, pathologisation, and stigmatisation.

This program: (1) offers a multidimensional model of semantic change (SIBling), (2) develops computational tools for its application (SIB Toolkit), (3) establishes a principled evaluation framework for LSC detection methods (LSC-Eval), and (4) demonstrates its value through detailed case studies. Together, these efforts lays the groundwork for future extensions across disciplines (e.g., law, humanities), domains, and languages.

Featured Publications

LLM-Generated Synthetic Data

LSC-Eval: A General Framework to Evaluate Methods for Assessing Dimensions of Lexical Semantic Change Using LLM-Generated Synthetic Data

We introduce a scalable, domain-general framework that creates diachronic, LLM-generated synthetic datasets to simulate theory-driven Lexical Semantic Change (LSC) and evaluates various methods for measuring kinds of LSC--using examples from psychology, we apply this framework to assess the sensitivity of a suite of methods in detecting artificially induced changes in dimensions of Sentiment, Intensity, and Breadth (SIB), ultimately identifying the most suitable approach for each dimension.

Mar 11, 2025

Lexical Semantic Change

A Multidimensional Framework for Evaluating Lexical Semantic Change with Social Science Applications

This study proposes a computational framework to evaluate lexical semantic change in a way that economically integrates forms identified by historical linguists and uses it to analyze semantic shifts in mental health and mental illness.

Aug 11, 2024

Relevant Publications

Naomi Baes, Raphaël Merx, Nick Haslam, Ekaterina Vylomova, Haim Dubossarsky (2025). LSC-Eval: A General Framework to Evaluate Methods for Assessing Dimensions of Lexical Semantic Change Using LLM-Generated Synthetic Data. ACL Findings.

PDF Code Dataset Poster

Naomi Baes, Nick Haslam, Ekaterina Vylomova (2024). A Multidimensional Framework for Evaluating Lexical Semantic Change with Social Science Applications. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics.

PDF Code Dataset Poster Video

Naomi Baes, Nick Haslam (2024). What should we call mental ill health? Historical shifts in the popularity of generic terms. PLOS Mental Health.

PDF Code

Nick Haslam, Naomi Baes, Milad Haghani (2024). The structure and evolution of social psychology: a co-citation network analysis. The Journal of Social Psychology.

PDF Code

Melissa a Wheeler, Samuel G Wilson, Naomi Baes, Vlad Demsar (2024). A search for commonalities in defining the common good: Using folk theories to unlock shared conceptions. The British Journal of Social Psychology.

PDF Code

See all publications

Invited Talks

Semantic Shifts in Mental Health-Related Concepts

Presented my work (drawing on findings from 3 PhD studies) on modelling semantic change in mental health-related concepts. Event: Mental Health PhD Program Conference (University of Melbourne)

Oct 3, 2025

Dimensions of Semantic Change: Validation and Application of the SIBling Framework

Invited talk (thanks to Principal Research Scientist Saif M Mohammad) at the National Research Council Canada on two complementary frameworks for studying lexical semantic change: (1) SIBling, which models change along three dimensions (Sentiment, Intensity, Breadth); and (2) LSC-Eval, which generates synthetic benchmarks for evaluating the suitability of methods for their sensitivity to detecting induced change.

Sep 24, 2025

Dimensions of Semantic Change: Applying the SIBling Framework to Mental Health Concepts

Talk at Utrecht University's Natural Language and Text Processing Lab on two complementary frameworks for studying lexical semantic change: (1) SIBling, which models change along three dimensions (Sentiment, Intensity, Breadth); and (2) LSC-Eval, which generates synthetic benchmarks for evaluating methods. Together, they provide tools for tracing socially significant conceptual shifts, with applications to mental health concepts.

Sep 16, 2025

Dimensions of Semantic Change: Validation and Application of the SIBling Framework

Presented my PhD work on modelling conceptual change using two developed frameworks: (1) SIBling, a linguistic model of semantic change; and (2) LSC-Eval, a general-purpose framework for evaluating methods for assessing dimensions of semantic change. Event: Change is Key! Conference (University of Gothenburg, Dept. of Philosophy, Linguistics & Theory of Science)

Sep 12, 2025

Semantic Shifts in Mental Health-Related Concepts

Presented on the 'Semantic Shifts in Mental Health-Related Concepts' at The 4th International Workshop on Computational Approaches to Historical Language Change 2023 (LChange'23), collocated with the EMNLP-2023 conference (Sentosa, Singapore).

Dec 6, 2023

Academic Conferences

Sep 14, 2025

Grateful to have presented and participated at international conferences, workshops, and research events.

Sep 14, 2025

Google Research @ Sydney Event

Feb 18, 2025

Honoured to have been selected to attend the Google Research @ Sydney Event at the first Google research facility in Australia.

Feb 18, 2025

Quick Updates

Delighted to share my PhD research at (1) the Change is Key! conference in Gothenburg (Sweden), (2) University of Utrecht (Netherlands), (3) National Research Council Canada and (4) the Mental Health PhD Program Conference!
5 Aug – 30 Sept 2025 — Interned at Change is Key!. The program develops computational tools to trace how language, society, and culture evolve, applying NLP and corpus methods to study semantic change and variation across linguistics, digital humanities, and the social sciences.
Presented our new method evaluation framework LSC-Eval: A General Evaluation Framework for Assessing Methods for Measuring Lexical Semantic Change with LLM-Generated Synthetic Data, at ACL 2025, Vienna two frameworks for modeling conceptual change — SIBling and LSC-Eval — at IC2S2’25 (Norrköping), the International Conference on Computational Social Science.
New corpus data and scripts publicly available — see Resources tab.