Type-based bigram frequencies for five-letter words (original) (raw)

Abstract

Researchers often require subjects to make judgments that call upon their knowledge of the orthographic structure of English words. Such knowledge is relevant in experiments on, for example, reading, lexical decision, and anagram solution. One common measure of orthographic structure is the sum of the frequencies of consecutive bigrams in the word. Traditionally, researchers have relied on_token-based_ norms of bigram frequencies. These norms confound bigram frequency with word frequency because each instance (i.e., token) of a particular word in a corpus of running text increments the frequencies of the bigrams that it contains. In this article, the authors report a set of_type-based_ bigram frequencies in which each word (i.e., type) contributes only once, thereby unconfounding bigram frequency from word frequency. The authors show that type-based bigram frequency is a better predictor of the difficulty of anagram solution than is token-based frequency. These norms can be downloaded fromwww.psychonomic.org/archive/.

Article PDF

Similar content being viewed by others

References

Download references

Author information

Authors and Affiliations

  1. Department of Psychology and Human Development, Vanderbilt University, Peabody College #512,230 Appleton Place, 37203-5721, Nashville, TN
    Laura R. Novick
  2. Indiana University, Bloomington, Indiana
    Steven J. Sherman

Authors

  1. Laura R. Novick
  2. Steven J. Sherman

Corresponding author

Correspondence toLaura R. Novick.

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Novick, L.R., Sherman, S.J. Type-based bigram frequencies for five-letter words.Behavior Research Methods, Instruments, & Computers 36, 397–401 (2004). https://doi.org/10.3758/BF03195587

Download citation

Keywords