Title
How "small" reflects "large"? - Representative information measurement and extraction.
Abstract
While web services avail a rapid growth of data volume for use, identifying helpful information is of great value, especially when users face with an unwilling glut of information. Thus, it is deemed relevant and meaningful to provide users with a representative subset (i.e., small set) that could well reflect the original information corpus (i.e., large set). In such a large–small context, this paper addresses the issues of representativeness in light of measurement and extraction by reviewing our previous efforts. Specifically, we first discuss various metrics from different perspectives of representativeness, then present a series of related representativeness extraction methods. Finally as a supplement and extension, a recent effort is introduced, which aims to take information quality into account in deriving a ranked subset. The proposed extraction method is justified by extensive real-world data experiments, showing its superiority to others in both effectiveness and efficiency.
Year
DOI
Venue
2018
10.1016/j.ins.2017.08.096
Information Sciences
Keywords
Field
DocType
Representative,Coverage,Redundancy,Consistency,Compactness,Extraction
Ranking,Information retrieval,Computer science,Representativeness heuristic,Redundancy (engineering),Web service,Small set,Information quality
Journal
Volume
ISSN
Citations 
460
0020-0255
0
PageRank 
References 
Authors
0.34
35
5
Name
Order
Citations
PageRank
Guoqing Chen191271.58
Cong Wang24920.13
Mingyue Zhang344.14
Qiang Wei428426.14
Baojun Ma5477.38