Type-based bigram frequencies for five-letter words (original) (raw)

Abstract

Researchers often require subjects to make judgments that call upon their knowledge of the orthographic structure of English words. Such knowledge is relevant in experiments on, for example, reading, lexical decision, and anagram solution. One common measure of orthographic structure is the sum of the frequencies of consecutive bigrams in the word. Traditionally, researchers have relied on_token-based_ norms of bigram frequencies. These norms confound bigram frequency with word frequency because each instance (i.e., token) of a particular word in a corpus of running text increments the frequencies of the bigrams that it contains. In this article, the authors report a set of_type-based_ bigram frequencies in which each word (i.e., type) contributes only once, thereby unconfounding bigram frequency from word frequency. The authors show that type-based bigram frequency is a better predictor of the difficulty of anagram solution than is token-based frequency. These norms can be downloaded fromwww.psychonomic.org/archive/.

Article PDF

References

Dorfman, J. (1999). Utilization of sublexical components in implicit memory for novel words.Psychological Science,10,387–392.
Article Google Scholar
Gilhooly, K. J. (1978). Bigram statistics for 205 five-letter words having single-solution anagrams.Behavior Research Methods & Instrumentation,10, 389–392.
Article Google Scholar
Kučera, H., &Francis, W. (1967).Computational analysis of present-day American English. Providence, RI: Brown University Press.
Google Scholar
Mayzner, M. S., &Tresselt, M. E. (1963). Anagram solution times: A function of word length and letter position variables.Journal of Psychology,55,469–475.
Article Google Scholar
Mayzner, M. S., &Tresselt, M. E. (1965). Tables of single-letter and digram frequency counts for various word-length and letter-position combinations.Psychonomic Monograph Supplements,1 (Whole No. 2), 13–32.
Google Scholar
Mendelsohn, G. A., &O’Brien, A. T. (1974). The solution of anagrams: A reexamination of the effects of transition letter probabilities, letter moves, and word frequency on anagram difficulty.Memory & Cognition,2, 566–574.
Article Google Scholar
Olson, R., &Schwartz, R. (1967). Single and multiple solution five-letter words.Psychonomic Monograph Supplements,2 (8, Whole No. 24), 105–152.
Google Scholar
Pratt, F. (1942).Secret and urgent: The story of codes and ciphers. Garden City, NY: Blue Ribbon Books.
Google Scholar
Rice, G. A., &Robinson, D. O. (1975). The role of bigram frequency in the perception of words and nonwords.Memory & Cognition,3, 513–518.
Article Google Scholar
Salthouse, T. A. (1984). Effects of age and skill in typing.Journal of Experimental Psychology: General,113,345–371.
Article Google Scholar
Seidenberg, M. S., Waters, G. S., Barnes, M. A., &Tanenhaus, M. K. (1984). When does irregular spelling or pronunciation influence word recognition?Journal of Verbal Learning & Verbal Behavior,23,383–404.
Article Google Scholar
Solso, R. L., &Juel, C. L. (1980). Positional frequency and versatility of bigrams for two-through nine-letter English words.Behavior Research Methods & Instrumentation,12, 297–343.
Article Google Scholar
Srinivas, K., Roediger, H. L., III, &Rajaram, S. (1992). The role of syllabic and orthographic properties of letter cues in solving word fragments.Memory & Cognition,20,219–230.
Article Google Scholar
Underwood, B. J., &Schulz, R. W. (1960).Meaningfulness and verbal learning. Chicago: Lippincott.
Google Scholar
Westbury, C., &Buchanan, L. (2002). The probability of the least likely non-length-controlled bigram affects lexical decision reaction times.Brain & Language,8166–78.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Psychology and Human Development, Vanderbilt University, Peabody College #512,230 Appleton Place, 37203-5721, Nashville, TN
Laura R. Novick
Indiana University, Bloomington, Indiana
Steven J. Sherman

Authors

Laura R. Novick
Steven J. Sherman

Corresponding author

Correspondence toLaura R. Novick.

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Novick, L.R., Sherman, S.J. Type-based bigram frequencies for five-letter words.Behavior Research Methods, Instruments, & Computers 36, 397–401 (2004). https://doi.org/10.3758/BF03195587

Download citation

Received: 29 December 2003
Accepted: 12 July 2004
Issue date: August 2004
DOI: https://doi.org/10.3758/BF03195587