Bidirectional Interaction of Psycholinguistics and Corpus Linguistics; Some Challenges Persian Frequency Dictionaries Meet

Document Type : Original Article

Authors

1 Associate Professor of Linguistics at Ayatollah Boroujerdi University, Broujerd, Iran (Corresponding Author)

2 Associate Professor of Linguistics at Fine Art College of Tehran University, Tehran. Iran.

10.22075/jlrs.2025.37997.2679

Abstract

Lexical decision tasks in psycholinguistics is the exact intersection of psycholinguistics and corpus linguistic findings and outputs. Here is the ground which the corpus analyses will be used directly. Throughout the course of using the frequency dictionaries of Persians the researcher(s) may encounter(s) some shortcomings and deficiencies that lead to the conclusions: Frequency estimates are not sufficient. Part of this relates to the fact that the word frequency as depicted in frequency dictionaries is affected by other factors (like the kind of sources used in data compilation) and the other part relates to the effects of other variables which correlates with word knowledge (like word prevalence, word length, age of acquisition of the word, similarity to other words, and concreteness of the word content). Here, the research problem is that it seems current Persian frequency dictionaries are insufficient for psycholinguistic researches. The present article tries to find why it is the case.  The main question of the present study is the psychological reality of the frequency effects which two main Persian frequency dictionaries (abbreviated as RPFD and PFD here) are able to depict. It asks what psycholinguistic factors will provide corpus compilation methodology with frequency dictionaries to be more realistic and valid. Using lexical decision tasks, the present article shows that some other psychological factors (like the recency effect, prominence, and prevalence) will correlates with word knowledge which these dictionaries miss to show or they ignore. Considering these factors will enhance Persian frequency dictionaries. The article systematically addresses each limitation (e.g., lack of prevalence data, recency effects). It also proposes solutions (e.g., integrating psycholinguistic metrics into corpus design).

Keywords


  • Ambridge, B., Kidd, E., Rowland C. F, & Theakston, A.L. (2015). The Ubiquity of Frequency     Effects in First Language Acquisition. Journal of Child Language 42(2): 239–273.
  • Bijankhan, M. & Mohseni, M. (2018). Frequency Dictionary According to a Written Corpus of Today Persian Language. Tehran: University of Tehran.
  • Blumenthal-Dramé, A. (2012). Entrenchment in Usage-Based Theories: What Corpus Data Do and Do Not Reveal About The Mind. Berlin/New York: de Gruyter.
  • Brysbaert, M., & Ellis, A. W. (2016). Aphasia and Age of Acquisition: Are Early-learned Words More Resilient? Aphasiology, 30, 1240–1263.
  • Brysbaert, M., & New, B. (2009). Moving beyond Kucera and Francis: A Critical Evaluation of Current Word Frequency Norms and the Introduction of a New and Improved Word Frequency Measure for American English. Behavior Research Methods, 41, 977–990.
  • Brysbaert,, Mandera, P., & Keuleers, E. (2018). The Word Frequency Effect in Word Processing: An Updated Review: Current Directions in Psychological Science. vol. 27 (1).
  • Brysbaert,, Mandera, P., & Stevense, M. (2015). The Impact of Word Prevalence on Lexical Decision Times: Evidence from the Dutch Lexicon Project 2. Journal of Experimental Psychology Human Learning & Memory.
  • Bybee, J. L. (2006). From Usage to Grammar: The Mind’s Response to Repetition. Language, 82(4): 711–733.
  • Bybee, J. L. (2010). Language, Usage, and Cognition. Cambridge: Cambridge University Press.
  • Cermak, F. & Kren, M. (2005). New Generation Corpus-based Frequency Dictionaries. International Journal of Corpus Linguistics, 4. Pp. 453-467.
  • Divjak, Dagmar & Catherine L. Caldwell-Harris. (2015) Frequency and entrenchment. In: Ewa Dabrowska and Dagmar Divjak (eds.), Handbook of Cognitive Linguistics, 53–75. Berlin/ New York: de Gruyter.
  • Ellis, N. C. (2012). What Can We Count in Language, and What Counts in Language Acquisition Cognition, and Use? In: Stefan Th. Gries and Dagmar S. Divjak (eds.), Frequency Effects.
  • Ferrand, L., New, B., Brysbaert, M., Keuleers, E., Bonin, P., Meot, A., Augustinova, M., & Pallier, C. (2010). The French Lexicon Project: Lexical Decision Data for 38,840 French Words and 38,840 Pseudowords. Behavior Research Methods, 42, 488-496.
  • Gimenes, M., and New, B. (2016). Worldlex: Twitter and Blog Word Frequencies for 66 Languages. Behavior Research Methods, 48, 963–972.
  • Herdag˘delen, A., & Marelli, M. (2017). Social Media and Language Processing: How Facebook and Twitter Provide the Best Frequency Estimates for Studying Word Recognition. Cognitive Science, 41, 976–995. doi:10.1111/cogs.12392.
  • Hasani, H. (2005). The Most Frequent Contemporary Persian Words. Tehran: Iran Language Center.
  • Jacoby, L. L. & Dallas, M. (1981). On the Relationship between Autobiographical and Perceptual Learning. Journal of Experimental Psychology: General 110: 306–340.
  • Johns, B. T., Jones, M. N., & Mewhort, D. J. K. (2016). Experience as a Free Parameter in the Cognitive Modeling of Language. In A. Papafragou, D. Grodner, D. Mirman, & J. C. Trueswell (Eds.), Proceedings of the 38th Annual Conference of the Cognitive Science Society (pp. 2291– 2296). Austin, TX: Cognitive Science Society.
  • Juhasz, B. J., Yap, M. J., Dicke, J., Taylor, S. C., & Gullick, M. M. (2011). Tangible Words Are Recognized Faster: The Grounding of Meaning in Sensory and Perceptual Systems. The Quarterly Journal of Experimental Psychology, 64(9), 1683-1691.
  • Kaeding, F.W. (1897). Haufigkeitsworterbuch der DeutschenSpreche. SteglitzEigenverlag.
  • Keuleers, E., Diependaele, K. & Brysbaert, M. (2010). Practice Effects in Large-scale Visual Word Recognition Studies: A Lexical Decision Study on 14,000 Dutch Mono- and Disyllabic Words and Nonwords. Frontiers in Psychology 1:174. doi: 10.3389/fpsyg.2010.00174.
  • Keuleers, E., Lacey, P., Rastle, K., & Brysbaert, M. (2012). The British Lexicon Project: Lexical Decision Data for 28,730 Monosyllabic and Disyllabic English Words. Behavior Research Methods, 44, 287-304.
  • Keuleers, M., Stevens, M., Mandera, P., & Brysbaert, M. (2015). Word Knowledge in the Crowd: Measuring Vocabulary Size and Word Prevalence in a Massive Online Experiment. Quarterly Journal of Experimental Psychology.
  • Langacker, R. W. (1987). Foundations of Cognitive Grammar, Vol. 1: Theoretical Prerequisites.Stanford: Stanford University Press.
  • Miller, C. & Aghajanian-Stewart, K. (2018). A Frequency Dictionary

    of Persian, Core Vocabulary for Learners. London: Routledge.

  • McEnery, T., & Hardie, A. (2012). Corpus Linguistics; Method, Theory, and Practice. Cambridge: Cambridge University Press.
  • Mehrabi, M. (2014). Lexical Information of Persian Transitive Verbs during Sentence Comprehension. Language Related Research. 5, No. 1, pp. 271-295.
  • Mehrabi, M. (2015). The Effect of Farsi Verb Representational Complexity on Processing Time During Listening Comprehension. Journal of Researches in Linguistics, Vol. 7(1), Pp.77-92.
  • Mehrabi, M. & Mahmoodi-Bakhtiari, B. (2020). A Comparative Study of Comprehension of Evidentiality in Persian, English, and Turkish; a Psycholinguistic Approach. Comparative Linguistic Researches. 20. Pp. 243-257.
-      Mehrabi, M. & Mahmoodi-Bakhtiari, B. (2021a). The Psychological Reality of Evidentiality Hierarchy in Persian during Sentence Listening Comprehension. Language Related Research. N.12. Pp. 539-566. https://doi.org/10.29252/LRR.12.2.17
  • Mehrabi, M. & Mahmoodi-Bakhtiari, B. (2021b). Mental Representations of Persian and English Absolute and Relative Tenses: A Contrastive Psycholinguistic Approach. Journal of Researches in Linguistics, Vol. 13(1), Pp.89-112.
  • Mehrabi, M. & Mahmoodi-Bakhtiari, B. (2022a). Representational Complexity of Persian Absolute Tenses during Listening Comprehension. Scientific Journal of Language Research, Vol. 13, No. 41. Pp.55-80.
  • Mehrabi, M. & Mahmoodi-Bakhtiari, B. (2022b). Representational Complexity of Persian Relative tenses during listening Comprehension. Language Related Research. 13, No. 2, Pp. 1-32. https://doi.org/10.52547/LRR.13.2.1
  • Mehrabi, M. & Mahmoodi-Bakhtiari, B. & Vaezi, H. (2021). Auditory Perception of Persian Interrogatives from a Psycholinguistic Approach and its Application in Persian teaching. Journal of Teaching Persian to Speakers of Other Languages. Vol. 10, No. 2. Pp. 137-156.
  • Mehrabi, M. & Zaker, A. (2013). Lexical Information of Persian Transitive Verbs during Sentence Comprehension. Iranian Studies. 46, No. 6, Pp. 959–971.
  • Miller, C., & Aghajanian-Stewart, K. (2018). A Frequency Dictionary of Persian; Core Vocabulary for Learners. London:
  • Monsell, S., Doyle, M. C., & Haggard, P. N. (1989). Effects of Frequency on Visual Word Recognition Tasks: Where Are They? Journal of Experimental Psychology: General, 118, 43–71.
  • Niemikorpi, A. (1997). Equilibrium of Words in the Finnish Frequency Dictionary. Journal of Quantitative Linguistics. Vol. 4. NO: 1-3. Pp. 190-196.
  • Schwenter, S. A. (2013). Strength of Priming and the Maintenance of Variation in the Spanish Past Subjunctive. Paper Presented at NWAV 2012 Pittsburgh.
  • Shapiro, L.P. & B. Levine. (1990). Verb Processing during Sentence Comprehension in Aphasia. Brain and Language, 38: 21-47.
  • Szmrecsanyi, B. (2006). Morphosyntactic Persistence in Spoken English: A Corpus Study at the Intersection of Variationist Sociolinguistics, Psycholinguistics, and Discourse

    Analysis. Berlin/New York: de Gruyter.

  • Tomlin, R. S. & Myachykov, A. (2015). Attention and Salience. In: Ewa Dabrowska and Dagmar Divjak (eds.), Handbook of Cognitive Linguistics, 31–52. Berlin/New York: de Gruyter.
  • Warschauer, M. (2006). Laptops and Literacy: Learning in the Wireless Classroom. New York: Teachers College Press.
  • Yap, M. J., & Balota, D. A. (2009). Visual Word Recognition of Multisyllabic Words. Journal of Memory & Language, 60, 502-529.
  • Yonelinas, A. P. (2002). The Nature of Recollection and Familiarity: A Review of 30 Years of Research. Journal of Memory and Language, 46, 441–517.
  • Zolfaghari, H. & Hasani Kochaki, E. (2023). Stylistic Analysis of Prose of Qajar Period Newspapers. The Journal of Linguistic and Rhetorical studies. Volume 14, Consecutive Number 31. Pp. 65-98.