
A threshold-calibrated, prototype-based pipeline for estimating word sense prevalence in diachronic text corpora. Applied to schizophrenia in historical U.S. news, the pipeline combines sense inventories, generated prototype usages, target-aware embeddings, human-calibrated similarity thresholds, and sense prevalence estimation over time. The repository includes a sample of expert labeled U.S. news sentences (containing the term schizophrenia annotated for which Oxford English Dictionary sense they express).
Mar 28, 2026

Evaluation framework for LSC detection methods in experimental settings using LLM-generated Data (Synthetic datasets to evaluate key dimensions of LSC, generated using LLMs and WordNet).
Apr 22, 2025

Evaluate dimensions of LSC concurrently, and sociocultural drivers, to study conceptual change.
Aug 10, 2024