A Constrained Randomization Approach to Interactive Visual Data Exploration with Subjective Feedback

Show full item record



Permalink

http://hdl.handle.net/10138/321140

Citation

Kang , B , Puolamäki , K , Lijffijt , J & Bie , T D 2020 , ' A Constrained Randomization Approach to Interactive Visual Data Exploration with Subjective Feedback ' , IEEE Transactions on Knowledge and Data Engineering , vol. 32 , no. 9 , pp. 1666-1679 . https://doi.org/10.1109/TKDE.2019.2907082

Title: A Constrained Randomization Approach to Interactive Visual Data Exploration with Subjective Feedback
Author: Kang, Bo; Puolamäki, Kai; Lijffijt, Jefrey; Bie, Tijl de
Contributor: University of Helsinki, Department of Computer Science
Date: 2020-09
Language: eng
Number of pages: 14
Belongs to series: IEEE Transactions on Knowledge and Data Engineering
ISSN: 1041-4347
URI: http://hdl.handle.net/10138/321140
Abstract: Data visualization and iterative/interactive data mining are growing rapidly in attention, both in research as well as in industry. However, while there are a plethora of advanced data mining methods and lots of works in the field of visualization, integrated methods that combine advanced visualization and/or interaction with data mining techniques in a principled way are rare. We present a framework based on constrained randomization which lets users explore high-dimensional data via 'subjectively informative' two-dimensional data visualizations. The user is presented with 'interesting' projections, allowing users to express their observations using visual interactions that update a background model representing the user's belief state. This background model is then considered by a projection-finding algorithm employing data randomization to compute a new 'interesting' projection. By providing users with information that contrasts with the background model, we maximize the chance that the user encounters striking new information present in the data. This process can be iterated until the user runs out of time or until the difference between the randomized and the real data is insignificant. We present two case studies, one controlled study on synthetic data and another on census data, using the proof-of-concept tool SIDE that demonstrates the presented framework.
Subject: 113 Computer and information sciences
Data visualization
Data models
Computational modeling
Data mining
Reactive power
Visualization
Tools
Exploratory data mining
dimensionality reduction
data randomization
subjective interestingness
NONLINEAR DIMENSIONALITY REDUCTION
LINES
FIT
Rights:


Files in this item

Total number of downloads: Loading...

Files Size Format View
08693735.pdf 6.060Mb PDF View/Open

This item appears in the following Collection(s)

Show full item record