Title
Finding Maximal Common Sub-parse Thickets for Multi-sentence Search.
Abstract
We develop a graph representation and learning technique for parse structures for paragraphs of text. We introduce Parse Thicket (PT) as a set of syntactic parse trees augmented by a number of arcs for inter-sentence word-word relations such as co-reference and taxonomic relations. These arcs are also derived from other sources, including Speech Act and Rhetoric Structure theories. We provide a detailed illustration of how PTs are built from parse trees and generalized as phrases by computing maximal common subgraphs. The proposed approach is subject to evaluation in the product search and recommendation domain, where search queries include multiple sentences. We draw the comparison for search relevance improvement by pair-wise sentence generalization, phrase-level generalization, and generalizations of PTs as graphs.
Year
DOI
Venue
2013
10.1007/978-3-319-04534-4_4
GKR
Field
DocType
Volume
Graph,Parse tree,Generalization,Theoretical computer science,Suffix tree,Parsing,Syntax,Sentence,Mathematics,Graph (abstract data type)
Conference
8323
ISSN
Citations 
PageRank 
0302-9743
1
0.39
References 
Authors
27
4
Name
Order
Citations
PageRank
Boris Galitsky124837.81
Dmitry I. Ilvovsky2147.38
Sergei O. Kuznetsov31630121.46
Fedor Strok4142.08