Title
Integrating Structure in the Probabilistic Model for Information Retrieval
Abstract
In databases or in the World Wide Web, many documents are in a structured format (e.g. XML). We propose in this article to extend the classical IR probabilistic model in order to take into account the structure through the weighting of tags. Our approach includes a learning step in which the weight of each tag is computed. This weight estimates the probability that the tag distinguishes the terms which are the most relevant. Our model has been evaluated on a large collection during INEX IR evaluation campaigns.
Year
DOI
Venue
2008
10.1109/WIIAT.2008.346
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference
Keywords
Field
DocType
information retrieval,learning (artificial intelligence),probability,World Wide Web,classical information retrieval probabilistic model,document retrieval,integration structure,learning step,weight estimation,XML,probabilistic model,structure,tags
Data mining,Divergence-from-randomness model,IR evaluation,Weighting,XML,Information retrieval,Computer science,Search engine indexing,Statistical model,Probabilistic logic,The Internet
Conference
Volume
ISBN
Citations 
1
978-0-7695-3496-1
5
PageRank 
References 
Authors
0.49
18
3
Name
Order
Citations
PageRank
Mathias Gery150.49
Christine Largeron253.19
Franck Thollard322717.10