Title
New Features in Spoken Language Search Hawk (SpLaSH): Query Language and Query Sequence
Abstract
In this work we present further development of the SpLaSH (Spoken Language Search Hawk) project. SpLaSH implements a data model for annotated speech corpora integrated with textual markup (i.e. POS tagging, syntax, pragmatics) including a toolkit used to perform complex queries across speech and text labels. The integration of time aligned annotations (TMA), represented making use of Annotation Graphs, with text aligned ones (TXA), stored in generic XML files, are provided by a data structure, the Connector Frame, acting as table-look-up linking temporal data to words in the text. SpLaSH imposes a very limited number of constraints to the data model design, allowing the integration of annotations developed separately within the same dataset and without any relative dependency. It also provides a GUI allowing three types of queries: simple query on TXA or TMA structures, sequence query on TMA structure and cross query on both TXA and TMA integrated structures. In this work new SpLaSH features will be presented: SpLaSH Query Language (SpLaSHQL) and Query Sequence.
Year
Venue
Keywords
2010
LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
query language,temporal data,data model,data structure
Field
DocType
Citations 
Query optimization,Web search query,RDF query language,Query language,Query expansion,Computer science,Web query classification,Query by Example,Natural language processing,Artificial intelligence,Object Query Language
Conference
0
PageRank 
References 
Authors
0.34
8
2
Name
Order
Citations
PageRank
Sara Romano1222.75
Francesco Cutugno27618.01