The Helsinki submission to the AmericasNLP shared task

Show full item record



Permalink

http://hdl.handle.net/10138/334239

Citation

Vázquez , R , Scherrer , Y , Virpioja , S & Tiedemann , J 2021 , The Helsinki submission to the AmericasNLP shared task . in M Mager [et al.] (ed.) , Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas . The Association for Computational Linguistics , Stroudsburg , pp. 255-264 , Workshop on Natural Language Processing for Indigenous Languages of the Americas , 11/06/2021 . https://doi.org/10.18653/v1/2021.americasnlp-1.29

Title: The Helsinki submission to the AmericasNLP shared task
Author: Vázquez, Raúl; Scherrer, Yves; Virpioja, Sami; Tiedemann, Jörg
Editor: Mager [et al.], Manuel
Contributor: University of Helsinki, Department of Digital Humanities
University of Helsinki, Department of Digital Humanities
University of Helsinki, Department of Digital Humanities
University of Helsinki, Department of Digital Humanities
Publisher: The Association for Computational Linguistics
Date: 2021-06-01
Language: eng
Number of pages: 10
Belongs to series: Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas
ISBN: 978-1-954085-44-2
URI: http://hdl.handle.net/10138/334239
Abstract: The University of Helsinki participated in the AmericasNLP shared task for all ten language pairs. Our multilingual NMT models reached the first rank on all language pairs in track 1, and first rank on nine out of ten language pairs in track 2. We focused our efforts on three aspects: (1) the collection of additional data from various sources such as Bibles and political constitutions, (2) the cleaning and filtering of training data with the OpusFilter toolkit, and (3) different multilingual training techniques enabled by the latest version of the OpenNMT-py toolkit to make the most efficient use of the scarce data. This paper describes our efforts in detail.
Subject: 113 Computer and information sciences
6121 Languages
Rights:


Files in this item

Total number of downloads: Loading...

Files Size Format View
2021.americasnlp_1.29.pdf 378.4Kb PDF View/Open

This item appears in the following Collection(s)

Show full item record