Title
Probabilistic Latent Semantic Analysis for Search and Mining of Corporate Blogs
Abstract
Blogs, or weblogs, have rapidly gained in popularity over the past decade. Because of the huge volume of existing blog posts, information in the blogosphere is difficult to access and retrieve. Existing studies have focused on analyzing personal blogs, but few have looked at corporate blogs, the numbers of which are dramatically rising. In this paper, we use probabilistic latent semantic analysis to detect keywords from corporate blogs with respect to certain topics. We then demonstrate how this method can represent the blogosphere in terms of topics with measurable keywords, hence tracking popular conversations and topics in the blogosphere. By applying a probabilistic approach, we can improve information retrieval in blog search and keywords detection, and provide an analytical foundation for the future of corporate blog search and mining.
Year
Venue
Keywords
2007
DMBiz@PAKDD
blog search,corporate blogs,probabilistic latent semantic analysis,information retrieval,corporate blog search,analytical foundation,blog post,keywords detection,personal blogs,probabilistic approach,web mining
Field
DocType
Citations 
World Wide Web,Web mining,Computer science,Popularity,Probabilistic latent semantic analysis,Blogosphere,Probabilistic logic,Spam blog
Conference
2
PageRank 
References 
Authors
0.45
8
3
Name
Order
Citations
PageRank
Flora S. Tsai135223.96
Yun Chen2492.53
Kap Luk Chan3103977.99