TY - T1 - Detecting Articles in a Digitized Finnish Historical Newspaper Collection 1771–1929: Early Results Using the PIVAJ Software SN - / UR - http://hdl.handle.net/10138/312739 T3 - A1 - Kettunen, Kimmo; Ruokolainen, Teemu; Liukkonen, Erno Samuli; Tranouez, Pierrick; Antelme, Daniel; Paquet, Thierry A2 - PB - The Association for Computing Machinery Y1 - 2019 LA - eng AB - This paper describes first large scale article detection and extraction efforts on the Finnish Digi newspaper material of the National Library of Finland (NLF) using data of one newspaper, Uusi Suometar 1869-1898 . The historical digital newspaper archive environment of the NLF is based on commercial docWorks software. The software is capable of article detection and extraction, but our material does not seem to behave well in the system in t his respect. Therefore, we have been in search of an ... VO - IS - SP - OP - KW - 113 Computer and information sciences; 518 Media and communications N1 - PP - ER -