Using Learner Corpora to Inform Material Development and Classroom Practices

Authors

DOI:

https://doi.org/10.55393/babylonia.v3i.638

Keywords:

Learner-corpora, Vocabulary, Lexical Diversity, Lexical Sophistication

Abstract

This article examines how a university in Japan uses a longitudinal learner corpus to bridge the gap between research and classroom practice in a two-year English for Academic Purposes (EAP) program. This large-scale longitudinal corpus is comprised of texts collected from first- and second-year Japanese university L2 English learners (n = 701) at multiple points of time and across different writing prompts. The corpus was correlated to participants' demographic and linguistic information to better understand the learners' vocabulary needs.

The article explores the insights that can be gained from an analysis of this corpus and investigates how these insights can but used to develop materials tailored to learners' backgrounds and proficiency levels. This builds on  earlier studies that have used corpora derived from L1 texts to help identify what expressions learners need to know. However, L2 speaker vocabulary usage differs from L1 usage. Using learner corpora can reveal actual patterns of L2 vocabulary development, allowing for classroom practices that target student needs. This article highlights how written corpora allowed us to develop holistic classroom practices that helped learners improve their proficiency.

The article contributes to the dialogue between research and practice by demonstrating how learner corpora can inform teaching. It illustrates the interplay between research and classroom application and highlights how learner corpora can enhance language education.

References

Botarleanu, R.-M., Dascalu, M., Watanabe, M., Crossley, S. A., & McNamara, D. S. (2022). Age of Exposure 2.0: Estimating word complexity using iterative models of word embeddings. Behavior Research Methods, 54(6), 3015–3042. https://doi.org/10.3758/s13428-022-01797-5 DOI: https://doi.org/10.3758/s13428-022-01797-5

Brezina, V., Gablasova, D., & McEnery, T. (2019). Corpus-based approaches to spoken L2 production: Evidence from the Trinity Lancaster Corpus. International Journal of Learner Corpus Research, 5(2), 119–125. https:// doi.org/10.1075/ijlcr.00008.int DOI: https://doi.org/10.1075/ijlcr.5.2

Brooks, G., Clenton, J., & Fraser, S. (2025). Exploring potential factors affecting reading comprehension in EAL learners: A preliminary corpus-based analysis. Languages. https://doi. org/10.3390/languages10020030 DOI: https://doi.org/10.3390/languages10020030

Callies, M. (2019). Integrating corpus literacy into language teacher education. In S. Götz & J. Mukherjee (Eds.), Learner Corpora and Language Teaching (pp. 245–263). John Benjamins Publishing. https://doi.org/10.1075/ scl.92.12cal DOI: https://doi.org/10.1075/slcs.201.12cal

Clenton, J., de Jong, N. H., Clingwall, D., & Fraser, S. (2021). Investigating the extent to which vocabulary knowledge and skills can predict aspects of fluency for a small group of pre-intermediate Japanese L1 users of English (L2). In J. Clenton & P. Booth (Eds.), Vocabulary and the four skills (pp. 126–145). Routledge. https://doi.org/10.4324/9780429285400-15 DOI: https://doi.org/10.4324/9780429285400-15

Coxhead, A. (2000). A new academic word list. TESOL Quarterly, 34(2), 213–238. https://doi. org/10.2307/3587951 DOI: https://doi.org/10.2307/3587951

Coxhead, A., & Demecheleer, M. (2018). Investigating the technical vocabulary of Plumbing. English for Specific Purposes, 51, 84– 97. https://doi.org/10.1016/j.esp.2018.03.006 DOI: https://doi.org/10.1016/j.esp.2018.03.006

de Jong, N. H., Steinel, M. P., Florijn, A., Schoonen, R., & Hulstijn, J. H. (2013). Linguistic skills and speaking fluency in a second language. Applied Psycholinguistics, 34(5), 893–916. https://doi.org/10.1017/ s0142716412000069 DOI: https://doi.org/10.1017/S0142716412000069

Gablasova, D., Brezina, V., & McEnery, A. (2019). The Trinity Lancaster Corpus: Applications in language teaching and materials development. In S. Gotz & J. Mukherjee (Eds.), Learner corpora and language teaching (pp. 7–27). John Benjamins. https://doi.org/10.1075/scl.92.02gab DOI: https://doi.org/10.1075/scl.92.02gab

Gilquin, G. (2024). Learner corpora to cater for learners’ individual needs. Babylonia Journal of Language Education, 2, 14–19. https://doi. org/10.55393/babylonia.v2i.386 DOI: https://doi.org/10.55393/babylonia.v2i.386

Granger, S. (2015). The contribution of learner corpora to reference and instructional materials design. In S. Granger, G. Gilquin, & F. Meunier (Eds.), The Cambridge Handbook of Learner Corpus Research (pp. 485–510). Cambridge. https://doi.org/10.1017/ CBO9781139649414.022 DOI: https://doi.org/10.1017/CBO9781139649414.022

Green, C., & Lambert, J. (2019). Position vectors, homologous chromosomes and gamma rays: Promoting disciplinary literacy through Secondary Phrase Lists. English for Specific Purposes, 53, 1–12. https://doi. org/10.1016/j.esp.2018.08.004 DOI: https://doi.org/10.1016/j.esp.2018.08.004

Henriksen, B., & Danelund, L. (2015). Studies of Danish L2 learners’ vocabulary knowledge and the lexical richness of their written production in English. In P. Pietilä, K. Doró, & P. Renata (Eds.), Lexical issues in L2 writing (pp. 29–55). Cambridge Scholars Publishing.

Hicks, N., & Studer, T. (2024). Learner corpora in foreign language education: examples from the multilingual SWIKO corpus. Babylonia Journal of Language Education. https://doi. org/10.55393/babylonia.v2i.388 DOI: https://doi.org/10.55393/babylonia.v2i.388

Jarvis, S. (2013). Capturing the diversity in lexical diversity. Language Learning, 63, 87-106. https://doi.org/10.1111/j.1467-9922.2012.00739.x DOI: https://doi.org/10.1111/j.1467-9922.2012.00739.x

Kikuchi, K. (2009). Listening to our learners’ voices: what demotivates Japanese high school students? Language Teaching Research, 13(4), 453–471. https://doi. org/10.1177/1362168809341520 DOI: https://doi.org/10.1177/1362168809341520

Lei, L., & Liu, D. (2016). A new medical academic word list: A corpus-based study with enhanced methodology. Journal of English for Academic Purposes, 22, 42–53. https://doi. org/10.1016/j.jeap.2016.01.00897 DOI: https://doi.org/10.1016/j.jeap.2016.01.008

Ma, Q., Chiu, M. M., Lin, S., & Mendoza, N. B. (2023). Teachers’ perceived corpus literacy and their intention to integrate corpora into classroom teaching: A survey study. ReCALL, 35(1), 19–39. doi:10.1017/S0958344022000180 DOI: https://doi.org/10.1017/S0958344022000180

Malvern, D., Richards, B., Chipere, N., & Durán, P. (2004). Lexical diversity and language development. Palgrave Macmillan UK. https:// doi.org/10.1057/9780230511804 DOI: https://doi.org/10.1057/9780230511804

McEnery, T., & Xiao, R. (2011). What corpora can offer in language teaching and learning. In E. Hinkel (Ed.), Handbook of research in second language teaching and learning (pp. 382–398). Routledge.

Mitchell, R., & Myles, F. (2024). Teaching and learning foreign languages. Babylonia Journal of Language Education. https://doi. org/10.55393/babylonia.v2i.376 DOI: https://doi.org/10.55393/babylonia.v2i.376

Mizumoto, A., & Takeuchi, O. (2009). Examining the effectiveness of explicit instruction of vocabulary learning strategies with Japanese EFL university students. Language Teaching Research, 13(4), 425–449. https://doi.org/10.1177/1362168809341511 DOI: https://doi.org/10.1177/1362168809341511

Nation, P. (2020). The BNC/COCA word family lists [PDF file]. https://www.wgtn.ac.nz/__ data/assets/pdf_file/0005/1857641/about-bnc-coca-vocabulary-list.pdf

Nation, P. (2022). Learning vocabulary in another language (3rd ed.). Cambridge University Press. DOI: https://doi.org/10.1017/9781009093873

Pérez-Paredes, P. (2022). A systematic review of the uses and spread of corpora and data-driven learning in CALL research during 2011–2015. Computer Assisted Language Learning, 35(1-2), 36–61. https://doi.org/10.108 0/09588221.2019.1667832 DOI: https://doi.org/10.1080/09588221.2019.1667832

Tannenbaum, R. J., & Baron, P. A. (2011). Mapping TOEFL ITP scores onto the Common European Framework of Reference (ETS Research Memorandum No. RM–11–33). Educational Testing Service. https://www.ets. org/Media/Research/pdf/RM-11-33.pdf

Tavakoli, P., & Foster, P. (2011). Task design and second language performance: The effect of narrative type on learner output. Language Learning, 61, 37–72. https://doi.org/10.1111/ j.1467-9922.2011.00642.x DOI: https://doi.org/10.1111/j.1467-9922.2011.00642.x

Treffers-Daller, J., Parslow, P., & Williams, S. (2018). Back to basics: How measures of lexical diversity can help discriminate between CEFR levels. Applied Linguistics, 39(3), 302–327. https://doi.org/10.1093/applin/amw009 DOI: https://doi.org/10.1093/applin/amw009

Uchihara, T., & Clenton, J. (2020). Investigating the role of vocabulary size in second language speaking ability. Language Teaching Research, 24(4), 540–556. https://doi. org/10.1177/1362168818799371 DOI: https://doi.org/10.1177/1362168818799371

Uchihara, T., & Saito, K. (2019). Exploring the relationship between productive vocabulary knowledge and second language oral ability. The Language Learning Journal, 47(1), 64–75. https://doi.org/10.1080/09571736.2016.1191527 DOI: https://doi.org/10.1080/09571736.2016.1191527

Wadden, P., Browne, C., & Nation, P. (2018). Teaching and learning vocabulary in the Japanese university. In P. Wadden & C. C. Hale (Eds.), Teaching English at Japanese Universities (pp. 84–96). Routledge. https:// doi.org/10.4324/9781315147239 DOI: https://doi.org/10.4324/9781315147239-9

Webb, S., & Nation, P. (2017). How vocabulary is learned. Oxford University Press. DOI: https://doi.org/10.25170/ijelt.v12i1.1458

Downloads

Published

2026-01-14

How to Cite

Brooks, G., Clenton, J., & Jordan, J. (2026). Using Learner Corpora to Inform Material Development and Classroom Practices. Babylonia Journal of Language Education, 3, 88–97. https://doi.org/10.55393/babylonia.v3i.638