Variation in Universal Dependencies annotation : A token-based typological case study on adpossessive constructions

Show full item record



Permalink

http://hdl.handle.net/10138/323330

Citation

Sinnemäki , K & Haakana , V L J 2020 , Variation in Universal Dependencies annotation : A token-based typological case study on adpossessive constructions . in M-C de Marneffe , M de Lhoneux , J Nivre & S Schuster (eds) , Proceedings of the Fourth Workshop on Universal Dependencies (UDW 2020) . The Association for Computational Linguistics , Stroudsburg , pp. 158-167 , Workshop on Universal Dependencies , 13/12/2020 . < https://universaldependencies.org/udw20/papers/2020.udw2020-1.18.pdf >

Title: Variation in Universal Dependencies annotation : A token-based typological case study on adpossessive constructions
Author: Sinnemäki, Kaius; Haakana, Viljami Lauri Juhana
Editor: de Marneffe, Marie-Catherine; de Lhoneux, Miryam; Nivre, Joakim; Schuster, Sebastian
Contributor: University of Helsinki, General Linguistics
University of Helsinki, Department of Languages
Publisher: The Association for Computational Linguistics
Date: 2020
Language: eng
Number of pages: 10
Belongs to series: Proceedings of the Fourth Workshop on Universal Dependencies (UDW 2020)
ISBN: 978-1-952148-48-4
URI: http://hdl.handle.net/10138/323330
Abstract: In this paper we present a method for identifying and analyzing adnominal possessive constructions in 66 Universal Dependencies treebanks. We classify adpossessive constructions in terms of their morphological type (locus of marking) and present a workflow for detecting and analyzing them typologically. Based on a preliminary evaluation, the algorithm works fairly reliably in adpossessive constructions that are morphologically marked. However, it performs rather poorly in adpossessive constructions that are not marked morphologically, so-called zero-marked constructions, because of difficulties in identifying these constructions with the current annotation. We also discuss different types of variation in annotation in different treebanks for the same language and for treebanks of closely related languages. The research focuses on one well-circumscribed and universal construction in the hope of generating more interest in using UD for cross-linguistic comparison and for contributing towards developing yet more consistent annotation of constructions in the UD annotation scheme.
Subject: 6121 Languages
universal dependencies
possessive constructions
open-source
language typology
113 Computer and information sciences
Rights:


Files in this item

Total number of downloads: Loading...

Files Size Format View
2020.udw2020_1.18.pdf 536.1Kb PDF View/Open

This item appears in the following Collection(s)

Show full item record