Title
Information Retrieval on the Blogosphere
Abstract
Blogs have recently emerged as a new open, rapidly evolving and reactive publishing medium on the Web. Rather than managed by a central entity, the content on the blogosphere — the collection of all blogs on the Web — is produced by millions of independent bloggers, who can write about virtually anything. This open publishing paradigm has led to a growing mass of user-generated content on the Web, which can vary tremendously both in format and quality when looked at in isolation, but which can also reveal interesting patterns when observed in aggregation. One field particularly interested in studying how information is produced, consumed, and searched in the blogosphere is information retrieval. In this survey, we review the published literature on searching the blogosphere. In particular, we describe the phenomenon of blogging and the motivations for searching for information on blogs. We cover both the search tasks underlying blog searchers' information needs and the most successful approaches to these tasks. These include blog post and full blog search tasks, as well as blog-aided search tasks, such as trend and market analysis. Finally, we also describe the publicly available resources that support research on searching the blogosphere.
Year
DOI
Venue
2012
10.1561/1500000026
Foundations and Trends in Information Retrieval
Keywords
Field
DocType
information retrieval,full blog search task,information need,blog post,reactive publishing medium,user-generated content,open publishing paradigm,search task,blog-aided search task,blog searcher
World Wide Web,Market analysis,Information needs,Information retrieval,Computer science,Open publishing,Publishing,Blogosphere
Journal
Volume
Issue
ISSN
6
1
1554-0669
Citations 
PageRank 
References 
14
0.53
169
Authors
5
Search Limit
100169
Name
Order
Citations
PageRank
Rodrygo L.T. Santos188346.30
Craig Macdonald22588178.50
Richard Mccreadie340332.43
Iadh Ounis43438234.59
Ian Soboroff51907218.39