Adding a Novel Italian Treebank of Marked Constructions to Universal Dependencies

In this paper we present a novel treebank developed to analyse marked constructions in Italian called MarkIT. The resource contains almost 1,300 sentences manually annotated with dependency relations following the Universal Dependencies paradigm. The sentences have been extracted from essays written...

Full description

Saved in:
Bibliographic Details
Main Authors: Teresa Paccosi, Alessio Palmero Aprosio, Sara Tonelli
Format: Article
Language:English
Published: Accademia University Press 2023-08-01
Series:IJCoL
Online Access:https://journals.openedition.org/ijcol/1110
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850165628391915520
author Teresa Paccosi
Alessio Palmero Aprosio
Sara Tonelli
author_facet Teresa Paccosi
Alessio Palmero Aprosio
Sara Tonelli
author_sort Teresa Paccosi
collection DOAJ
description In this paper we present a novel treebank developed to analyse marked constructions in Italian called MarkIT. The resource contains almost 1,300 sentences manually annotated with dependency relations following the Universal Dependencies paradigm. The sentences have been extracted from essays written by high-school students along several years, which accounts for the structure and the topic variability of the sentences. In this work, we detail the process to select the sentences, parse them automatically and then manually correct them. The resource covers seven types of marked constructions (839 sentences overall) plus some sentences, whose syntax can be wrongly classified as marked and which can serve as negative examples of markedness (453 sentences). We also present an evaluation of parsing performance, comparing a model trained on existing Italian treebanks with the model obtained by adding MarkIT to the training set.
format Article
id doaj-art-7b8c3b2d998941cba03b21f53fac04be
institution OA Journals
issn 2499-4553
language English
publishDate 2023-08-01
publisher Accademia University Press
record_format Article
series IJCoL
spelling doaj-art-7b8c3b2d998941cba03b21f53fac04be2025-08-20T02:21:41ZengAccademia University PressIJCoL2499-45532023-08-019110.4000/ijcol.1110Adding a Novel Italian Treebank of Marked Constructions to Universal DependenciesTeresa PaccosiAlessio Palmero AprosioSara TonelliIn this paper we present a novel treebank developed to analyse marked constructions in Italian called MarkIT. The resource contains almost 1,300 sentences manually annotated with dependency relations following the Universal Dependencies paradigm. The sentences have been extracted from essays written by high-school students along several years, which accounts for the structure and the topic variability of the sentences. In this work, we detail the process to select the sentences, parse them automatically and then manually correct them. The resource covers seven types of marked constructions (839 sentences overall) plus some sentences, whose syntax can be wrongly classified as marked and which can serve as negative examples of markedness (453 sentences). We also present an evaluation of parsing performance, comparing a model trained on existing Italian treebanks with the model obtained by adding MarkIT to the training set.https://journals.openedition.org/ijcol/1110
spellingShingle Teresa Paccosi
Alessio Palmero Aprosio
Sara Tonelli
Adding a Novel Italian Treebank of Marked Constructions to Universal Dependencies
IJCoL
title Adding a Novel Italian Treebank of Marked Constructions to Universal Dependencies
title_full Adding a Novel Italian Treebank of Marked Constructions to Universal Dependencies
title_fullStr Adding a Novel Italian Treebank of Marked Constructions to Universal Dependencies
title_full_unstemmed Adding a Novel Italian Treebank of Marked Constructions to Universal Dependencies
title_short Adding a Novel Italian Treebank of Marked Constructions to Universal Dependencies
title_sort adding a novel italian treebank of marked constructions to universal dependencies
url https://journals.openedition.org/ijcol/1110
work_keys_str_mv AT teresapaccosi addinganovelitaliantreebankofmarkedconstructionstouniversaldependencies
AT alessiopalmeroaprosio addinganovelitaliantreebankofmarkedconstructionstouniversaldependencies
AT saratonelli addinganovelitaliantreebankofmarkedconstructionstouniversaldependencies