Title
Assessing web article quality by harnessing collective intelligence
Abstract
Existing approaches assess web article's quality mainly based on syntax, but seldom work is given on how to quantify its quality based on semantics. In this paper we propose a novel Semantic Quality Assessment(SQA) approach to automatically determine data quality in terms of two most important quality dimensions, namely accuracy and completeness. First, alternative context with respect to source article is built by collecting alternative web articles. Second, each alternative article is transformed and represented by semantic corpus and dimension baselines are synthetically generated from these semantic corpora. Finally, quality dimension of source article is determined by comparing its semantic corpus with dimension baseline. Our approach is promising way to assess web article quality by exploiting available collective knowledge. Experiments show that our approach performs well.
Year
DOI
Venue
2012
10.1007/978-3-642-29038-1_31
DASFAA
Keywords
Field
DocType
alternative web article,semantic corpus,quality dimension,collective intelligence,important quality dimension,web article,web article quality,alternative context,source article,alternative article,data quality,assessing web article quality
Data mining,Latent Dirichlet allocation,Data quality,Information retrieval,Computer science,Collective intelligence,Baseline (configuration management),Social Semantic Web,Latent semantic analysis,Syntax,Database,Semantics
Conference
Citations 
PageRank 
References 
0
0.34
14
Authors
4
Name
Order
Citations
PageRank
Jingyu Han1164.67
Xueping Chen200.34
Kejia Chen317915.82
Dawei Jiang438021.67