Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Dec 21:1:218.
doi: 10.3389/fpsyg.2010.00218. eCollection 2010.

Subtitle-based word frequencies as the best estimate of reading behavior: the case of greek

Affiliations

Subtitle-based word frequencies as the best estimate of reading behavior: the case of greek

Maria Dimitropoulou et al. Front Psychol. .

Abstract

Previous evidence has shown that word frequencies calculated from corpora based on film and television subtitles can readily account for reading performance, since the language used in subtitles greatly approximates everyday language. The present study examines this issue in a society with increased exposure to subtitle reading. We compiled SUBTLEX-GR, a subtitled-based corpus consisting of more than 27 million Modern Greek words, and tested to what extent subtitle-based frequency estimates and those taken from a written corpus of Modern Greek account for the lexical decision performance of young Greek adults who are exposed to subtitle reading on a daily basis. Results showed that SUBTLEX-GR frequency estimates effectively accounted for participants' reading performance in two different visual word recognition experiments. More importantly, different analyses showed that frequencies estimated from a subtitle corpus explained the obtained results significantly better than traditional frequencies derived from written corpora.

Keywords: cultural variations; frequency estimates; subtitles; word recognition.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of the percentages of summed word frequency as a function of word length (measured in number of letters) for the GreekLex and the SUBTLEX-GR entries.

References

    1. Adelman J. S., Brown G. D. A., Quesada J. F. (2006). Contextual diversity, not word frequency, determines word naming and lexical decision times. Psychol. Sci. 17, 814–823 10.1111/j.1467-9280.2006.01787.x - DOI - PubMed
    1. Alija M., Cuetos F. (2006). Effects of the lexical-semantic variables in visual word recognition. Psicothema 18, 485–491 - PubMed
    1. Baayen R. H., Feldman L. B., Schreuder R. (2006). Morphological influences on the recognition of monosyllabic monomorphemic words. J. Mem. Lang. 55, 290–313 10.1016/j.jml.2006.03.008 - DOI
    1. Baayen R. H., Piepenbrock R., Gulikers L. (1995). The CELEX Lexical Database (Release 2) [CDROM]. Philadelphia, PA: Linguistic Data Consortium
    1. Baayen R. H., Piepenbrock R., van Rijn H. (1993). The CELEX Lexical Database [CD-ROM]. Philadelphia, PA: Linguistic Data Consortium

LinkOut - more resources