Information Retrieval with Finnish Case Law Embeddings

Show full item record

Title: Information Retrieval with Finnish Case Law Embeddings
Author: Sarsa, Sami
Other contributor: Helsingin yliopisto, Matemaattis-luonnontieteellinen tiedekunta
University of Helsinki, Faculty of Science
Helsingfors universitet, Matematisk-naturvetenskapliga fakulteten
Publisher: Helsingin yliopisto
Date: 2019
Language: eng
Thesis level: master's thesis
Discipline: Tietojenkäsittelytiede
Abstract: In this work, five text vectorisation models' capability in embedding Finnish case law texts to vector space for inter-textual similarity computation is studied. The embeddings and their computed similarities are used to create a Finnish case law retrieval system that allows effective querying with full documents. A working web application is presented as a part of the work. The case law data for the work is provided by the Finnish Ministry of Justice, and the studied models are: TF-IDF, LDA, Word2Vec, Doc2Vec and Doc2vecC.

Files in this item

Total number of downloads: Loading...

Files Size Format View
grappa-gradu.pdf 1.863Mb PDF View/Open

This item appears in the following Collection(s)

Show full item record