Ling 431/631: Corpus Linguistics

Ben Bergen

 

Meeting 6: Morphology

September 24, 2007

 

Productivity

 

One of the most interesting things about the grammar of human languages is the productivity of syntactic and morphological processes - namely the possibility of producing new things from component parts and processes. This is presumably the property that gives language its infinite potential.

 

But how do you measure it? (adapted loosely from Bauer 2001)

 

  1. Productivity is how thoroughly a morphological process is distributed in the language:

 

P = ntype/Htype

where ntype is the number of types using the rule and Htype is the hypothetical total number of types which the word formation rule could give rise to.

 

                  Any problems with this?

 

  1. Productivity is how likely a process is to be used in comparison with competitors

 

P = ntype/Ttype or ntoken/Ttoken

where ntype and ntoken are the type and token frequencies of the process and Ttype and Ttoken are the total type and token frequencies of the particular process and its competitors

 

                  Problems with these?

 

  1. Productivity is how often a process is used to create new words, compared to its overall use

 

P = n1/Ntoken

where n1 is the number of words formed by the appropriate process occurring in a corpus precisely once (i.e., hapax legomena) and Ntoken is the total token frequency of words created by that morphological process in the corpus.

 

hapax legomenon (pl. legomena): a word form occurring only once in a corpus

 

                  Problems with this?

 

  1. Productivity is the rate of addition of words using a morphological process over a period of time

 

How can you calculate this?

 

So though there are a number of measures, they're mostly indirect.

 

Beyond productivity

 

Because pretty much all morphological processes are only partly productive (if at all), sometimes there are competing members of paradigms. These can be studied and their contexts of use described.

 

Also related to productivity are a variety of questions

 

Even pretty much unproductive stuff can have psychological reality, like phonaesthemes (Bergen 2004):

the frequency of words that have the phonaesthemes in question over the total number of words with a similar meaning

the frequency of words with the phonaestheme over the total number of words with the same form.

 

Or you can look at other statistics internal to words, such as the noun and verb business we saw last time.

 

 

References

 

Bauer , Laurie. 2001. Morphological productivity.  New York and Cambridge: Cambridge University Press.

Bergen, Benjamin. 2004. The psychological reality of phonaesthemes. Language 80(2).