Modeling the semantics of emotion: A culturally grounded approach to Flemish narratives
DOI:
https://doi.org/10.3765/plsa.v11i1.6113Keywords:
daily narratives, emotion words, topic modeling, large language models, embeddings, under resourced languagesAbstract
Expression of emotions through natural language is deeply embedded in culturally specific contexts and extends beyond simple lexical labels. A central difficulty lies in extracting structured semantic knowledge from unstructured daily narratives, a task particularly challenging for under-resourced language varieties such as Flemish (Belgian Dutch), which have historically received minimal computational attention. This study evaluates transformer-based models, specifically BERTopic, against traditional co-occurrence-based models (LDA) and clustering baselines (KMeans) on a uniquely large corpus of 24,854 daily narratives collected from 102 Dutch speakers over 70 days in Belgium, using both automated coherence metrics and a human evaluation. Our findings demonstrate that moving beyond frequency-based models is essential for semantically and culturally accurate analysis of naturalistic emotional language in under-resourced varieties.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Ratna Kandala, Katie Hoemann

This work is licensed under a Creative Commons Attribution 4.0 International License.
Published by the LSA with permission of the author(s) under a CC BY 4.0 license.
