Abstract | ||
---|---|---|
In this paper, we propose a new framework for searchable web sites recommendation. Given a query, our system will recommend a list of searchable web sites ranked by relevance, which can be used to complement the web page results and ads from a search engine. We model the conditional probability of a searchable web site being relevant to a given query in term of three main components: the language model of the query, the language model of the content within the web site, and the reputation of the web site searching capability (static rank). The language models for queries and searchable sites are built using information mined from client-side browsing logs. The static rank for each searchable site leverages features extracted from these client-side logs such as number of queries that are submitted to this site, and features extracted from general search engines such as the number of web pages that indexed for this site, number of clicks per query, and the dwell-time that a user spends on the search result page and on the clicked result web pages. We also learn a weight for each kind of feature to optimize the ranking performance. In our experiment, we discover 10.5 thousand searchable sites and use 5 million unique queries, extracted from one week of log data to build and demonstrate the effectiveness of our searchable web site recommendation system. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1145/1935826.1935890 | Web Search and Data Mining |
Keywords | Field | DocType |
web page result,searchable web sites recommendation,vertical search engines,web page,language model,web site,searchable site,searchable web site,result web page,searchable web site recommendation,static rank,dwell time,web pages,feature extraction,conditional probability,recommender system,indexation,search engine | Static web page,Web search engine,Web search query,Data mining,Site map,World Wide Web,Web page,Information retrieval,Semantic search,Computer science,Web query classification,Web service | Conference |
Citations | PageRank | References |
2 | 0.42 | 19 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yang Song | 1 | 2338 | 128.89 |
Nam Nguyen | 2 | 331 | 16.64 |
Li-wei He | 3 | 1943 | 165.91 |
Scott Imig | 4 | 2 | 0.42 |
Robert Rounthwaite | 5 | 314 | 50.09 |