A Finnish news corpus for named entity recognition

Show simple item record

dc.contributor.author Ruokolainen, Teemu
dc.contributor.author Kauppinen, Pekka
dc.contributor.author Silfverberg, Miikka
dc.contributor.author Lindén, Krister
dc.date.accessioned 2021-10-31T23:04:46Z
dc.date.available 2021-10-31T23:04:46Z
dc.date.issued 2020-03
dc.identifier.citation Ruokolainen , T , Kauppinen , P , Silfverberg , M & Lindén , K 2020 , ' A Finnish news corpus for named entity recognition ' , Language Resources and Evaluation , vol. 54 , no. 1 , pp. 247-272 . https://doi.org/10.1007/s10579-019-09471-7
dc.identifier.other PURE: 138683175
dc.identifier.other PURE UUID: 60d824da-d7e7-44e6-9e1d-084d88ffdf27
dc.identifier.other RIS: urn:40991D0CA3F7EEABC7E5BD2F5EE7E50B
dc.identifier.other RIS: Ruokolainen2020
dc.identifier.other WOS: 000530845200010
dc.identifier.other Scopus: 85081760338
dc.identifier.other ORCID: /0000-0003-2337-303X/work/83433996
dc.identifier.other ORCID: /0000-0003-2071-5110/work/83435862
dc.identifier.other ORCID: /0000-0001-7454-5300/work/83436230
dc.identifier.uri http://hdl.handle.net/10138/335859
dc.description.abstract We present a corpus of Finnish news articles with a manually prepared named entity annotation. The corpus consists of 953 articles (193,742 word tokens) with six named entity classes (organization, location, person, product, event, and date). The articles are extracted from the archives of Digitoday, a Finnish online technology news source. The corpus is available for research purposes. We present baseline experiments on the corpus using a rule-based and two deep learning systems on two, in-domain and out-of-domain, test sets. en
dc.format.extent 26
dc.language.iso eng
dc.relation.ispartof Language Resources and Evaluation
dc.rights unspecified
dc.rights.uri info:eu-repo/semantics/closedAccess
dc.subject 6121 Languages
dc.subject Named entity recognition
dc.subject Finnish
dc.subject Newswire
dc.subject Wikipedia
dc.subject AGREEMENT
dc.title A Finnish news corpus for named entity recognition en
dc.type Article
dc.contributor.organization Centre for Preservation and Digisation
dc.contributor.organization The National Library of Finland
dc.contributor.organization Department of Digital Humanities
dc.contributor.organization Language Technology
dc.description.reviewstatus Peer reviewed
dc.relation.doi https://doi.org/10.1007/s10579-019-09471-7
dc.relation.issn 1574-020X
dc.rights.accesslevel closedAccess
dc.type.version submittedVersion
dc.identifier.url https://arxiv.org/abs/1908.04212

Files in this item

Total number of downloads: Loading...

Files Size Format View
1908.04212.pdf 246.3Kb PDF View/Open

This item appears in the following Collection(s)

Show simple item record