CLEC: Colombian Learner English Corpus

Maria Victoria Pardo, Antonio Tamayo, Manuel Alejandro Gómez & Nicolás Alberto Henao

Universidad del Norte

The objective of this presentation is to introduce to the research community the CLEC (Colombian Learner English Corpus). This corpus was created following the guidelines of the Computational Corpus Linguistics (McEnery & Hardie, 2011) and according to the compilation parameters of corpus of learners defined as "electronic collections of natural or almost natural data produced by foreign or second language learners (L2) and gathered according to explicit design criteria "Granger (2002, p. 7), Gilquin (2015, p.1). The TNT (Translation and New Technologies) research group of the University of Antioquia created the CLEC. It is an application that compiles 515 written compositions of students of English as a foreign language at university level. The application allows the search for information in the tagged data, it filters error labels systematically by category or type and allows you to find the trend of learner errors. The resulting product is a web responsive application that completely performs searches and does analysis on the tagged corpus of errors.

Week 5 2020/2021

Thursday 5th November 2020
2:00-3:00pm

Online: join mailing list or contact organisers to receive link