Title
The Exploratory Labeling Assistant: Mixed-Initiative Label Curation with Large Document Collections.
Abstract
In this paper, we define the concept of exploratory labeling: the use of computational and interactive methods to help analysts categorize groups of documents into a set of unknown and evolving labels. While many computational methods exist to analyze data and build models once the data is organized around a set of predefined categories or labels, few methods address the problem of reliably discovering and curating such labels in the first place. In order to move first steps towards bridging this gap, we propose an interactive visual data analysis method that integrates human-driven label ideation, specification and refinement with machine-driven recommendations. The proposed method enables the user to progressively discover and ideate labels in an exploratory fashion and specify rules that can be used to automatically match sets of documents to labels. To support this process of ideation, specification, as well as evaluation of the labels, we use unsupervised machine learning methods that provide suggestions and data summaries. We evaluate our method by applying it to a real-world labeling problem as well as through controlled user studies to identify and reflect on patterns of interaction emerging from exploratory labeling activities.
Year
DOI
Venue
2018
10.1145/3242587.3242596
UIST '18: The 31st Annual ACM Symposium on User Interface Software and Technology Berlin Germany October, 2018
Keywords
Field
DocType
Exploratory Labeling, Text Analysis, Visualization, Document Labeling
Ideation,Categorization,Information retrieval,Computer science,Visualization,Bridging (networking),Human–computer interaction,Unsupervised learning,User studies,Labeling Problem
Conference
ISBN
Citations 
PageRank 
978-1-4503-5948-1
1
0.35
References 
Authors
28
3
Name
Order
Citations
PageRank
Cristian Felix1462.67
Aritra Dasgupta217512.02
Enrico Bertini3115457.38