The Kolipsi Corpus Family: Resources for Learner Corpus Research in Italian and German

This article describes the Kolipsi Corpus Family (KCF), a collection of eight related resources for learner corpus research in German and Italian. The KCF supports the study of second language (L2) acquisition of Italian and German in upper secondary schools. It subsumes four L2 corpora with compara...

Full description

Saved in:
Bibliographic Details
Main Authors: Aivars Glaznieks, Jennifer-Carmen Frey, Andrea Abel, Lionel Nicolas, Chiara Vettori
Format: Article
Language:English
Published: Accademia University Press 2024-03-01
Series:IJCoL
Online Access:https://journals.openedition.org/ijcol/1210
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850262856237318144
author Aivars Glaznieks
Jennifer-Carmen Frey
Andrea Abel
Lionel Nicolas
Chiara Vettori
author_facet Aivars Glaznieks
Jennifer-Carmen Frey
Andrea Abel
Lionel Nicolas
Chiara Vettori
author_sort Aivars Glaznieks
collection DOAJ
description This article describes the Kolipsi Corpus Family (KCF), a collection of eight related resources for learner corpus research in German and Italian. The KCF supports the study of second language (L2) acquisition of Italian and German in upper secondary schools. It subsumes four L2 corpora with comparable corpus design (with respect to data collection, writing tasks, additional metadata, annotation and processing), portraying two homogeneous learner groups and their learner varieties. The corpora are representative of language learners in the multilingual Italian province of South Tyrol, where both languages are taught daily. The L2 corpora were collected at two different points in time, in 2007 (Kolipsi-1) and 2014 (Kolipsi-2), and all texts were labeled with CEFR levels to allow comparisons of proficiency levels across time. L2 German texts were collected in schools with Italian as the main language of instruction, whereas L2 Italian texts were collected in schools with German as the main language of instruction. Additional resources within the KCF allow researchers to compare the students’ language competences in their L2 with the language competences in their first language (L1) in a different task (Kolipsi-Matura) and with similarly aged L1 writers performing the same task (Kolipsi-1-L1). All texts are freely available to the scientific community. Access to the data is granted via an ANNIS search interface and via the Eurac Research CLARIN Repository, from which corpus data can be downloaded in various formats.
format Article
id doaj-art-d2e57fb420424922af6ed7f0d4c7ad5e
institution OA Journals
issn 2499-4553
language English
publishDate 2024-03-01
publisher Accademia University Press
record_format Article
series IJCoL
spelling doaj-art-d2e57fb420424922af6ed7f0d4c7ad5e2025-08-20T01:55:06ZengAccademia University PressIJCoL2499-45532024-03-019210.4000/ijcol.1210The Kolipsi Corpus Family: Resources for Learner Corpus Research in Italian and GermanAivars GlaznieksJennifer-Carmen FreyAndrea AbelLionel NicolasChiara VettoriThis article describes the Kolipsi Corpus Family (KCF), a collection of eight related resources for learner corpus research in German and Italian. The KCF supports the study of second language (L2) acquisition of Italian and German in upper secondary schools. It subsumes four L2 corpora with comparable corpus design (with respect to data collection, writing tasks, additional metadata, annotation and processing), portraying two homogeneous learner groups and their learner varieties. The corpora are representative of language learners in the multilingual Italian province of South Tyrol, where both languages are taught daily. The L2 corpora were collected at two different points in time, in 2007 (Kolipsi-1) and 2014 (Kolipsi-2), and all texts were labeled with CEFR levels to allow comparisons of proficiency levels across time. L2 German texts were collected in schools with Italian as the main language of instruction, whereas L2 Italian texts were collected in schools with German as the main language of instruction. Additional resources within the KCF allow researchers to compare the students’ language competences in their L2 with the language competences in their first language (L1) in a different task (Kolipsi-Matura) and with similarly aged L1 writers performing the same task (Kolipsi-1-L1). All texts are freely available to the scientific community. Access to the data is granted via an ANNIS search interface and via the Eurac Research CLARIN Repository, from which corpus data can be downloaded in various formats.https://journals.openedition.org/ijcol/1210
spellingShingle Aivars Glaznieks
Jennifer-Carmen Frey
Andrea Abel
Lionel Nicolas
Chiara Vettori
The Kolipsi Corpus Family: Resources for Learner Corpus Research in Italian and German
IJCoL
title The Kolipsi Corpus Family: Resources for Learner Corpus Research in Italian and German
title_full The Kolipsi Corpus Family: Resources for Learner Corpus Research in Italian and German
title_fullStr The Kolipsi Corpus Family: Resources for Learner Corpus Research in Italian and German
title_full_unstemmed The Kolipsi Corpus Family: Resources for Learner Corpus Research in Italian and German
title_short The Kolipsi Corpus Family: Resources for Learner Corpus Research in Italian and German
title_sort kolipsi corpus family resources for learner corpus research in italian and german
url https://journals.openedition.org/ijcol/1210
work_keys_str_mv AT aivarsglaznieks thekolipsicorpusfamilyresourcesforlearnercorpusresearchinitalianandgerman
AT jennifercarmenfrey thekolipsicorpusfamilyresourcesforlearnercorpusresearchinitalianandgerman
AT andreaabel thekolipsicorpusfamilyresourcesforlearnercorpusresearchinitalianandgerman
AT lionelnicolas thekolipsicorpusfamilyresourcesforlearnercorpusresearchinitalianandgerman
AT chiaravettori thekolipsicorpusfamilyresourcesforlearnercorpusresearchinitalianandgerman
AT aivarsglaznieks kolipsicorpusfamilyresourcesforlearnercorpusresearchinitalianandgerman
AT jennifercarmenfrey kolipsicorpusfamilyresourcesforlearnercorpusresearchinitalianandgerman
AT andreaabel kolipsicorpusfamilyresourcesforlearnercorpusresearchinitalianandgerman
AT lionelnicolas kolipsicorpusfamilyresourcesforlearnercorpusresearchinitalianandgerman
AT chiaravettori kolipsicorpusfamilyresourcesforlearnercorpusresearchinitalianandgerman