Rueter , J & Hämäläinen , M 2017 , Synchronized Mediawiki based analyzer dictionary development . in F M Tyers , M Rießler , T A Pirinen & T Trosterud (eds) , 3rd International Workshop for Computational Linguistics of Uralic Languages (IWCLUL 2017) : St. Petersburg, Russia 23 – 24 January 2017 . , 2 , The Association for Computational Linguistics , Stroudsburg , pp. 1-7 , International Workshop for Computational Linguistics of Uralic Languages , St. Petersburg , Russian Federation , 23/01/2017 . https://doi.org/10.18653/v1/w17-0601
Title: | Synchronized Mediawiki based analyzer dictionary development |
Author: | Rueter, Jack; Hämäläinen, Mika |
Other contributor: |
Tyers, Francis M.
Rießler, Michael Pirinen , Tommi A. Trosterud , Trond |
Contributor organization: | Department of Modern Languages 2010-2017 Language Technology Department of Computer Science |
Publisher: | The Association for Computational Linguistics |
Date: | 2017 |
Language: | eng |
Number of pages: | 7 |
Belongs to series: | 3rd International Workshop for Computational Linguistics of Uralic Languages (IWCLUL 2017) |
ISBN: | 978-1-5108-3665-5 |
DOI: | https://doi.org/10.18653/v1/w17-0601 |
URI: | http://hdl.handle.net/10138/232470 |
Abstract: | Open-source analyzer dictionary development is being implemented for Skolt Sami, Ingrian, Moksha-Mordvin, etc. in the Helsinki CSC infrastructure; home of the Finnish Kielipankki ’Language Bank’ and Termipankki ’Term Bank’. The proximity of minority-language corpora in need of annotation and the multiple usage of controlled wikimedia-type dictionaries make CSC an attractive site for synchronized transducer dictionary development. The open-source FST develop- ment of Uralic and other minority languages at Giellatekno-Divvun in Tromsø demonstrates a vast potential for reusage of FST-s, only augmented by open- source work in OmorFi, Apertium and Universal Dependency <http://univer- saldependencies.org/#language-urj>. The initial idea is to allow synchronized editing of Giellatekno xml and CSC wiki structures via github. In addition to allowing for simple lexc LEMMA:STEM CONTINUATION_LEXICON ”TRANS- LATION” ; line exports, the parallel dictionaries will provide for documentation of derivation, morpho-syntactic information on valency and government, seman- tics and etymology. |
Subject: |
6121 Languages
Open-source Analyzer dictionary development Wiki-based dictionary Synchronized dictionary editing Uralic Languages Semantics Morphology Morpho-syntactic data Etymology |
Peer reviewed: | Yes |
Rights: | cc_by |
Usage restriction: | openAccess |
Self-archived version: | publishedVersion |
Total number of downloads: Loading...
Files | Size | Format | View |
---|---|---|---|
W17_0601.pdf | 83.19Kb |
View/ |