Discovering Synonyms and Other Related Words

Show simple item record

dc.contributor University of Helsinki, Department of Modern Languages 2010-2017 en
dc.contributor University of Helsinki, Department of Modern Languages 2010-2017 en
dc.contributor.author Linden, Krister
dc.contributor.author Piitulainen, Jussi Olavi
dc.contributor.editor Ananadiou, Sophia
dc.contributor.editor Zweigenbaum, Pierre
dc.date.accessioned 2012-06-04T07:23:32Z
dc.date.available 2012-06-04T07:23:32Z
dc.date.issued 2004-08
dc.identifier.citation Linden , K & Piitulainen , J O 2004 , Discovering Synonyms and Other Related Words . in S Ananadiou & P Zweigenbaum (eds) , Proceedings of COLING 2004 : CompuTerm 2004: 3rd International Workshop on Computational Terminology . pp. 63-70 , CompuTerm 2004: 3rd International Workshop on Computational Terminology , Geneva , Switzerland , 29/08/2004 . en
dc.identifier.citation conference en
dc.identifier.other PURE: 10035474
dc.identifier.other PURE UUID: 254c1fad-7bef-425b-9a93-bb375db8f6fb
dc.identifier.other ORCID: /0000-0003-2337-303X/work/29934385
dc.identifier.uri http://hdl.handle.net/10138/33867
dc.description.abstract Discovering synonyms and other related words among the words in a document collection can be seen as a clustering problem, where we expect the words in a cluster to be closely related to one another. The intuition is that words occurring in similar contexts tend to convey similar meaning. We introduce a way to use translation dictionaries for several languages to evaluate the rate of synonymy found in the word clusters. We also apply the information radius to calculating similarities between words using a full dependency syntactic feature space, and introduce a method for similarity recalculation during clustering as a fast approximation of the high-dimensional feature space. Finally, we show that 69-79% of the words in the clusters we discover are useful for thesaurus construction. en
dc.language.iso eng
dc.relation.ispartof Proceedings of COLING 2004 CompuTerm 2004: 3rd International Workshop on Computational Terminology
dc.rights en
dc.subject 612 Languages and Literature en
dc.subject 113 Computer and information sciences en
dc.title Discovering Synonyms and Other Related Words en
dc.type Conference contribution
dc.type.uri info:eu-repo/semantics/other
dc.contributor.pbl
dc.contributor.pbl

Files in this item

Total number of downloads: Loading...

Files Size Format View
linden04b.pdf 110.9Kb PDF View/Open

This item appears in the following Collection(s)

Show simple item record