Naomi Baes
Naomi Baes

PhD Candidate | Computational Social Science · Psychology · NLP

About Me

I am a computational social scientist working at the intersection of psychology and natural language processing, developing theory-driven methods to study how socially meaningful concepts change over time and across domains. My research uses historical text corpora, contextual embeddings, and large language models to examine conceptual and lexical semantic change, with a particular focus on the changing representations of mental health-related concepts. I co-developed SIBling, a framework for modelling semantic change across the dimensions of Sentiment, Intensity, and Breadth, and LSC-Eval, a benchmarking framework for evaluating methods for detecting lexical semantic change using synthetic historical data. More broadly, I am interested in conceptual change, language change and variation, mental health discourse, and computational approaches to societally significant questions. I am active in the NLP community through workshop organisation, shared tasks, and program committee service, and also contribute through peer review in psychology and computational social science.

CV
Interests
  • Computational Linguistics
  • Computational Social Science
  • Language Change
  • Natural Language Processing
  • Psychology
Education
  • PhD, Psychology/ Natural Language Processing

    University of Melbourne (2023-)

  • Graduate Diploma in Psychology (Advanced) with Honours

    University of Melbourne

Research Program

Broadly, my research uses computational methods to study how socially meaningful concepts change over time, and how those shifts reflect wider cultural and societal dynamics. At the intersection of psychology, linguistics, and natural language processing, I develop theory-driven approaches for tracing conceptual change in historical text corpora using contextual embeddings, large language models, lexical resources, and statistical modelling. My current work focuses especially on mental health-related concepts and how their meanings and representations shift across scientific, media, and everyday language. This research program develops new ways of modelling conceptual change, evaluating computational methods, and applying them to important societal and cultural questions across languages, domains and disciplines.

Key Contributions:

  • SIBling: A multidimensional framework for modelling lexical semantic change across three core dimensions: Sentiment, Intensity, and Breadth (SIB). [Paper]
  • SIB Toolkit: A computational implementation of SIBling for measuring semantic change across SIB and related features such as salience and thematic content across scientific, media, and everyday domains.
  • LSC-Eval: A benchmarking framework that uses LLM-generated synthetic historical data to evaluate methods for detecting lexical semantic change and identify dimension- and domain-sensitive approaches. [Paper]
  • Sense-tracking pipeline, for estimating the prevalence of word senses in historical text corpora, to assists with LSC interpretability. [Paper])
  • Applications: Case studies of mental health-related concepts, such as schizophrenia and autism, examining broader cultural trends including concept creep, pathologisation, and stigmatisation.
Featured Publications
Selected Publications
Invited Talks
Recent News

Academic Conferences

Grateful to have presented and participated at international conferences, workshops, and research events.

Google Research @ Sydney Event

Honoured to have been selected to attend the Google Research @ Sydney Event at the first Google research facility in Australia.

Highlights
  • New sense-tracking pipeline for estimating the prevalence of word senses in historical text corpora: link

  • Delighted to share my PhD research at (1) the Change is Key! conference in Gothenburg (Sweden), (2) University of Utrecht (Netherlands), (3) National Research Council Canada, (4) the Mental Health PhD Program Conference!, and (5) The LChange'26 Workshop, colocated with EACL.

  • 5 Aug – 30 Sept 2025Interned at Change is Key!. The program develops computational tools to trace how language, society, and culture evolve, applying NLP and corpus methods to study semantic change and variation across linguistics, digital humanities, and the social sciences.

  • Corpus data and scripts publicly available — see Resources tab.