Title
Identifying Authoritative and Reliable Contents in Community Question Answering with Domain Knowledge
Abstract
Community Question Answering (CQA) has emerged as a popular forum for users to ask and answer questions. Over the last few years, CQA portals such as Yahoo answersand Baidu Zhidao have exploded in popularity, and now provide a viable alternative to general purpose Web search. A number of answers submitted to address questions on CQA sites compose a valuable knowledge repository, which could be a gold mine for information retrieval as well as text mining. Two important questions in CQA research are focused on the quality of contents and the reputation of the answerers. Previous approaches for retrieving relevant and high quality content have been proposed, but not much work has been done on providing an integrated framework to solve these two problems. Besides, no research work has used both text and link information in their methods via leveraging existing ratings of answers and questions. In this paper, we present a novel approach to analyze questions and answers based on the topic modeling framework with Dirichlet forest priors (LDA-DF)[8]. We utilize information obtained from LDA-DF to construct a joint topical and link model to identify authorities and reliable answers on a CQA site.We evaluate our methods in a dataset obtained from Yahoo! Answers. With the new representation of topical structures on CQA datasets, using a limited amount of web resource, we show significant improvements over the state-of-art methods LDA-DF, LDA, and HLDA on performance of authority identification and answer ranking.
Year
DOI
Venue
2013
10.1007/978-3-642-40319-4_12
PAKDD Workshops
Field
DocType
Citations 
Web resource,Data mining,Question answering,Information retrieval,General purpose,Domain knowledge,Ranking,Computer science,Popularity,Topic model,Reputation
Conference
1
PageRank 
References 
Authors
0.36
26
2
Name
Order
Citations
PageRank
Lifan Guo1364.94
Xiaohua Hu22819314.15