Abstract | ||
---|---|---|
The goal of metadata extraction (MDE) is to enable technology that can take raw speech-to-text output and refine it into forms that are more useful to humans and to downstream automatic processes. Starting in 2003, a structural metadata annotation task was defined for English as part of the DARPA EARS Program. A significant new challenge for MDE is the addition of new languages. This paper reports on work undertaken to apply MDE annotation to data from three very different languages: Mandarin Chinese, Levantine Arabic, and conversational Czech. Details of annotation task modifications are provided for each language; along with a general overview of data and annotation tools for non-English MDE. |
Year | Venue | Keywords |
---|---|---|
2005 | INTERSPEECH | speech to text,mandarin chinese |
Field | DocType | Citations |
Metadata,Czech,Annotation,Information retrieval,Arabic,Computer science,Image retrieval,Speech recognition,Artificial intelligence,Natural language processing,Mandarin Chinese | Conference | 3 |
PageRank | References | Authors |
0.48 | 3 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Stephanie Strassel | 1 | 512 | 58.41 |
Jáchym Kolár | 2 | 16 | 3.30 |
Zhiyi Song | 3 | 23 | 6.94 |
Leila Barclay | 4 | 3 | 0.48 |
Meghan Lammie Glenn | 5 | 17 | 4.77 |