Word formation is aware of morpheme family size

PLoS One. 2014 Apr 4;9(4):e93978. doi: 10.1371/journal.pone.0093978. eCollection 2014.

Abstract

Words are built from smaller meaning bearing parts, called morphemes. As one word can contain multiple morphemes, one morpheme can be present in different words. The number of distinct words a morpheme can be found in is its family size. Here we used Birth-Death-Innovation Models (BDIMs) to analyze the distribution of morpheme family sizes in English and German vocabulary over the last 200 years. Rather than just fitting to a probability distribution, these mechanistic models allow for the direct interpretation of identified parameters. Despite the complexity of language change, we indeed found that a specific variant of this pure stochastic model, the second order linear balanced BDIM, significantly fitted the observed distributions. In this model, birth and death rates are increased for smaller morpheme families. This finding indicates an influence of morpheme family sizes on vocabulary changes. This could be an effect of word formation, perception or both. On a more general level, we give an example on how mechanistic models can enable the identification of statistical trends in language change usually hidden by cultural influences.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Awareness
  • Humans
  • Language*
  • Models, Theoretical
  • Vocabulary*

Grants and funding

DBK was financed by the BMBF project 01UA0815C ‘Interaction between linguistic and bioinformatic procedures, methods and algorithms.’ This publication was funded by the German Research Foundation (DFG) and the University of Wuerzburg in the funding programme Open Access Publishing. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.