Structural metadata annotation: moving beyond English - Citegraph

Paper Info

Title
Structural metadata annotation: moving beyond English

Abstract
The goal of metadata extraction (MDE) is to enable technology that can take raw speech-to-text output and refine it into forms that are more useful to humans and to downstream automatic processes. Starting in 2003, a structural metadata annotation task was defined for English as part of the DARPA EARS Program. A significant new challenge for MDE is the addition of new languages. This paper reports on work undertaken to apply MDE annotation to data from three very different languages: Mandarin Chinese, Levantine Arabic, and conversational Czech. Details of annotation task modifications are provided for each language; along with a general overview of data and annotation tools for non-English MDE.

Year	Venue	Keywords
2005	INTERSPEECH	speech to text,mandarin chinese
Field	DocType	Citations
Metadata,Czech,Annotation,Information retrieval,Arabic,Computer science,Image retrieval,Speech recognition,Artificial intelligence,Natural language processing,Mandarin Chinese	Conference	3
PageRank	References	Authors
0.48	3	5

Authors (5 rows)

Cited by (3 rows)

References (3 rows)

Name	Order	Citations	PageRank
Stephanie Strassel	1	512	58.41
Jáchym Kolár	2	16	3.30
Zhiyi Song	3	23	6.94
Leila Barclay	4	3	0.48
Meghan Lammie Glenn	5	17	4.77

1