Title
Structural metadata annotation: moving beyond English
Abstract
The goal of metadata extraction (MDE) is to enable technology that can take raw speech-to-text output and refine it into forms that are more useful to humans and to downstream automatic processes. Starting in 2003, a structural metadata annotation task was defined for English as part of the DARPA EARS Program. A significant new challenge for MDE is the addition of new languages. This paper reports on work undertaken to apply MDE annotation to data from three very different languages: Mandarin Chinese, Levantine Arabic, and conversational Czech. Details of annotation task modifications are provided for each language; along with a general overview of data and annotation tools for non-English MDE.
Year
Venue
Keywords
2005
INTERSPEECH
speech to text,mandarin chinese
Field
DocType
Citations 
Metadata,Czech,Annotation,Information retrieval,Arabic,Computer science,Image retrieval,Speech recognition,Artificial intelligence,Natural language processing,Mandarin Chinese
Conference
3
PageRank 
References 
Authors
0.48
3
5
Name
Order
Citations
PageRank
Stephanie Strassel151258.41
Jáchym Kolár2163.30
Zhiyi Song3236.94
Leila Barclay430.48
Meghan Lammie Glenn5174.77