Title
Web Page Summarization for Just-in-Time Contextual Advertising
Abstract
Contextual advertising is a type of Web advertising, which, given the URL of a Web page, aims to embed into the page the most relevant textual ads available. For static pages that are displayed repeatedly, the matching of ads can be based on prior analysis of their entire content; however, often ads need to be matched to new or dynamically created pages that cannot be processed ahead of time. Analyzing the entire content of such pages on-the-fly entails prohibitive communication and latency costs. To solve the three-horned dilemma of either low relevance or high latency or high load, we propose to use text summarization techniques paired with external knowledge (exogenous to the page) to craft short page summaries in real time. Empirical evaluation proves that matching ads on the basis of such summaries does not sacrifice relevance, and is competitive with matching based on the entire page content. Specifically, we found that analyzing a carefully selected 6% fraction of the page text can sacrifice only 1%--3% in ad relevance. Furthermore, our summaries are fully compatible with the standard JavaScript mechanisms used for ad placement: they can be produced at ad-display time by simple additions to the usual script, and they only add 500--600 bytes to the usual request. We also compared our summarization approach, which is based on structural properties of the HTML content of the page, with a more principled one based on one of the standard text summarization tools (MEAD), and found their performance to be comparable.
Year
DOI
Venue
2011
10.1145/2036264.2036278
ACM TIST
Keywords
Field
DocType
low relevance,ad-display time,static page,page text,just-in-time contextual advertising,entire page content,ad relevance,html content,short page summary,entire content,web page,web page summarization,real time,text summarization,web pages
Static web page,Automatic summarization,Printer-friendly,Byte,Contextual advertising,Web page,Information retrieval,Computer science,Artificial intelligence,Page view,Machine learning,JavaScript
Journal
Volume
Issue
ISSN
3
1
2157-6904
Citations 
PageRank 
References 
7
0.55
46
Authors
5
Name
Order
Citations
PageRank
Aris Anagnostopoulos1105467.08
Andrei Broder27357920.20
Evgeniy Gabrilovich34573224.48
Vanja Josifovski42265148.84
Lance Riedel545419.42