Treffer: A Web-Page Summarization for Just-in-Time Contextual Advertising

Title:
A Web-Page Summarization for Just-in-Time Contextual Advertising
Contributors:
The Pennsylvania State University CiteSeerX Archives
Collection:
CiteSeerX
Subject Terms:
Document Type:
Fachzeitschrift text
File Description:
application/pdf
Language:
English
Rights:
Metadata may be used without restrictions as long as the oai identifier remains attached to it.
Accession Number:
edsbas.6859F536
Database:
BASE

Weitere Informationen

Contextual advertising is a type of Web advertising, which, given the URL of a Web page, aims to embed into the page the most relevant textual ads available. For static pages that are displayed repeatedly, the matching of ads can be based on prior analysis of their entire content; however, often ads need to be matched to new or dynamically created pages that cannot be processed ahead of time. Analyzing the entire content of such pages on-the-fly entails prohibitive communication and latency costs. To solve the threehorned dilemma of either low-relevance or high-latency or high-load, we propose to use text-summarization techniques paired with external knowledge (exogenous to the page) to craft short page summaries in real time. Empiricalevaluation proves that matching ads on the basis of such summaries does not sacrifice relevance, and is competitive with matching based on the entire page content. Specifically, we found that analyzing a carefully selected 6 % fraction of the page text can sacrifice only 1%–3 % in ad relevance. Furthermore, our summaries are fully compatible with the standard JavaScript mechanisms used for ad placement: they can be produced at ad-display time by simple additions to the usual script, and they only add 500–600 bytes to the usual request. We also compared our summarization approach, which is based on structural properties of the HTML content of the page, with a more principled one based on one of the standard text