Information Retrieval with Finnish Case Law Embeddings
Show simple item record
dc.contributor |
Helsingin yliopisto, Matemaattis-luonnontieteellinen tiedekunta |
fi |
dc.contributor |
University of Helsinki, Faculty of Science |
en |
dc.contributor |
Helsingfors universitet, Matematisk-naturvetenskapliga fakulteten |
sv |
dc.contributor.author |
Sarsa, Sami |
|
dc.date.issued |
2019 |
|
dc.identifier.uri |
URN:NBN:fi:hulib-202001211119 |
|
dc.identifier.uri |
http://hdl.handle.net/10138/310006 |
|
dc.description.abstract |
In this work, five text vectorisation models' capability in embedding Finnish case law texts to vector space for inter-textual similarity computation is studied. The embeddings and their computed similarities are used to create a Finnish case law retrieval system that allows effective querying with full documents.
A working web application is presented as a part of the work. The case law data for the work is provided by the Finnish Ministry of Justice, and the studied models are: TF-IDF, LDA, Word2Vec, Doc2Vec and Doc2vecC. |
en |
dc.language.iso |
eng |
|
dc.publisher |
Helsingin yliopisto |
fi |
dc.publisher |
University of Helsinki |
en |
dc.publisher |
Helsingfors universitet |
sv |
dc.title |
Information Retrieval with Finnish Case Law Embeddings |
en |
dc.type.ontasot |
pro gradu -tutkielmat |
fi |
dc.type.ontasot |
master's thesis |
en |
dc.type.ontasot |
pro gradu-avhandlingar |
sv |
dc.subject.discipline |
Tietojenkäsittelytiede |
und |
dct.identifier.urn |
URN:NBN:fi:hulib-202001211119 |
|
Files in this item
Total number of downloads: Loading...
This item appears in the following Collection(s)
Show simple item record