Title
Multi-source deep learning for information trustworthiness estimation
Abstract
In recent years, information trustworthiness has become a serious issue when user-generated contents prevail in our information world. In this paper, we investigate the important problem of estimating information trustworthiness from the perspective of correlating and comparing multiple data sources. To a certain extent, the consistency degree is an indicator of information reliability--Information unanimously agreed by all the sources is more likely to be reliable. Based on this principle, we develop an effective computational approach to identify consistent information from multiple data sources. Particularly, we analyze vast amounts of information collected from multiple review platforms (multiple sources) in which people can rate and review the items they have purchased. The major challenge is that different platforms attract diverse sets of users, and thus information cannot be compared directly at the surface. However, latent reasons hidden in user ratings are mostly shared by multiple sources, and thus inconsistency about an item only appears when some source provides ratings deviating from the common latent reasons. Therefore, we propose a novel two-step procedure to calculate information consistency degrees for a set of items which are rated by multiple sets of users on different platforms. We first build a Multi-Source Deep Belief Network (MSDBN) to identify the common reasons hidden in multi-source rating data, and then calculate a consistency score for each item by comparing individual sources with the reconstructed data derived from the latent reasons. We conduct experiments on real user ratings collected from Orbitz, Priceline and TripAdvisor on all the hotels in Las Vegas and New York City. Experimental results demonstrate that the proposed approach successfully finds the hotels that receive inconsistent, and possibly unreliable, ratings.
Year
DOI
Venue
2013
10.1145/2487575.2487612
KDD
Keywords
Field
DocType
consistent information,multiple review platform,information world,information consistency degree,latent reason,information reliability,multiple source,information trustworthiness,information trustworthiness estimation,different platform,multiple data source,deep learning
Data science,Data mining,Multiple data,Trustworthiness,Computer science,Deep belief network,Artificial intelligence,Deep learning,Multi-source
Conference
Citations 
PageRank 
References 
16
0.86
18
Authors
4
Name
Order
Citations
PageRank
Liang Ge1816.73
Jing Gao22723131.05
Xiaoyi Li31009.25
Aidong Zhang42970405.63