Naomi Baes
Naomi Baes

PhD Candidate - Psychology · Natural Language Processing

About Me

Researcher bridging computational linguistics, psychology, and natural language processing to study how societally relevant concepts change their meanings over time. Using large language models and historical text corpora, I co-developed the theory-driven linguistic framework SIBling (with Professor Nick Haslam and Dr Ekaterina Vylomova), which models semantic change along three dimensions — Sentiment, Intensity, and Breadth. Alongside SIBling, I co-developed LSC-Eval (with Dr Haim Dubossarsky and Raphaël Merx), a benchmarking framework that uses LLM-generated diachronic corpora to evaluate methods for detecting semantic change. Together, these frameworks advance both computational methods and applications, providing new tools to trace cultural and social dynamics across concepts and language domains. My research is supported by an Australian Government Research Training Program Scholarship and I am an active member of the NLP community - contributing to shared tasks (BRIGHTER; BLEnD), co-organizing workshops (LChange'26), and serving on program committees (NLP4Democracy; SEM2025).

CV
Interests
  • Computational Linguistics
  • Computational Social Science
  • Language Change
  • Natural Language Processing
  • Psychology
Education
  • PhD, Psychology/ Natural Language Processing

    University of Melbourne (2023-)

  • Graduate Diploma in Psychology (Advanced) with Honours

    University of Melbourne

Research Program

Broadly speaking, I use computational approaches to study how language reflects social and cultural change, treating it as a window into the human mind and society. My work develops theory-driven measures to quantify linguistic, psychological, and social constructs by integrating insights from linguistics and psychology with methods from computational linguistics and Natural Language Processing (a subfield of Artificial Intelligence). Because labeled data are scarce in the social sciences, I primarily use pretrained language models, unsupervised learning, normed lexical resources, and statistical modelling. My current focus is on tracing how societally relevant concepts evolve in meaning over time using large language models and historical text corpora.

With my PhD supervisors, I have developed a linguistic framework (SIBling) and associated measures to model lexical semantic change (LSC) along three major dimensions that are typically overlooked by existing approaches.

Key Contributions:

  • SIBling: A theoretical model integrating insights from historical linguistics and psychology, reducing six types of LSC to three core dimensions: Sentiment, Intensity, and Breadth (SIB). [Prototype]
  • SIB Toolkit: A computational implementation of SIBling that quantifies semantic change across SIB, and complementary features (salience and thematic content). Designed for broad application across the social sciences and language domains (scientific, media, everyday).
  • LSC-Eval: An evaluation framework that uses LLM-generated synthetic corpora to simulate kinds of LSC and validate LSC detection methods, identifying optimal dimension- and domain-specific approaches. [Prototype]
  • Applications: Applying SIBling to trace the historical semantic evolution of mental health-related concepts (e.g., autism, schizophrenia), my research examines broader cultural dynamics such as concept creep, pathologisation, and stigmatisation.

This program: (1) offers a multidimensional model of semantic change (SIBling), (2) develops computational tools for its application (SIB Toolkit), (3) establishes a principled evaluation framework for LSC detection methods (LSC-Eval), and (4) demonstrates its value through detailed case studies. Together, these efforts lays the groundwork for future extensions across disciplines (e.g., law, humanities), domains, and languages.

Featured Publications
Relevant Publications
(2024). A Multidimensional Framework for Evaluating Lexical Semantic Change with Social Science Applications. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics.
(2024). The structure and evolution of social psychology: a co-citation network analysis. The Journal of Social Psychology.
(2024). A search for commonalities in defining the common good: Using folk theories to unlock shared conceptions. The British Journal of Social Psychology.
Invited Talks
Recent News

Academic Conferences

Grateful to have presented and participated at international conferences, workshops, and research events.

Google Research @ Sydney Event

Honoured to have been selected to attend the Google Research @ Sydney Event at the first Google research facility in Australia.

Quick Updates
  • Delighted to share my PhD research at (1) the Change is Key! conference in Gothenburg (Sweden), (2) University of Utrecht (Netherlands), (3) National Research Council Canada and (4) the Mental Health PhD Program Conference!

  • 5 Aug – 30 Sept 2025Interned at Change is Key!. The program develops computational tools to trace how language, society, and culture evolve, applying NLP and corpus methods to study semantic change and variation across linguistics, digital humanities, and the social sciences.

  • Presented our new method evaluation framework LSC-Eval: A General Evaluation Framework for Assessing Methods for Measuring Lexical Semantic Change with LLM-Generated Synthetic Data, at ACL 2025, Vienna two frameworks for modeling conceptual change — SIBling and LSC-Eval — at IC2S2’25 (Norrköping), the International Conference on Computational Social Science.

  • New corpus data and scripts publicly available — see Resources tab.