Finite-State Spell-Checking with Weighted Language and Error Models : Building and Evaluating Spell-Checkers with Wikipedia as Corpus

Show full item record



Permalink

http://hdl.handle.net/10138/29358

Citation

Pirinen , T & Linden , K 2010 , Finite-State Spell-Checking with Weighted Language and Error Models : Building and Evaluating Spell-Checkers with Wikipedia as Corpus . in Proceedings of LREC 2010 : Workshop on Creation and use of basic lexical resources for less-resourced languages . LREC 2010 , Malta , Malta , 17/05/2010 .

Title: Finite-State Spell-Checking with Weighted Language and Error Models : Building and Evaluating Spell-Checkers with Wikipedia as Corpus
Author: Pirinen, Tommi; Linden, Krister
Contributor: University of Helsinki, Department of Modern Languages 2010-2017
University of Helsinki, Department of Modern Languages 2010-2017
Date: 2010-05
Language: eng
Belongs to series: Proceedings of LREC 2010 Workshop on Creation and use of basic lexical resources for less-resourced languages
ISBN: 2-9517408-6-7
URI: http://hdl.handle.net/10138/29358
Abstract: In this paper we present simple methods for construction and evaluation of finite-state spell-checking tools using an existing finite-state lexical automaton, freely available finite-state tools and Internet corpora acquired from projects such as Wikipedia. As an example, we use a freely available open-source implementation of Finnish morphology, made with traditional finite-state morphology tools, and demonstrate rapid building of Northern Sámi and English spell checkers from tools and resources available from the Internet.
Subject: 612 Languages and Literature
Rights:


Files in this item

Total number of downloads: Loading...

Files Size Format View
PirinenLrec2010.pdf 192.6Kb PDF View/Open

This item appears in the following Collection(s)

Show full item record