Abstract | ||
---|---|---|
In speech technology more and more databases of spo- ken language are becoming available. For research the availability of these data offers the possibility to study huge corpora. Apart from the fact that these corpora may be represented in different formats, it is sometimes diffi- cult to relate annotations of one corpus to those of an- other corpus. This contribution argues for a representa- tion of information in speech corpora that allows for the integrated representation of information on various lev- els of description in XML. Secondly, the study of huge amounts of speech data requires adequate retrieval mechanisms. A query architecture is described that al- lows for the retrieval of encoded entities by specifying their properties or various relations to other entities. The output of the query processor is represented in XML and thus can be used for further queries or a new level of description. The work presented here is part of the re- sults of the MATE project (http://mate.mip.ou.dk). |
Year | Venue | Keywords |
---|---|---|
1999 | EUROSPEECH | information retrieval,xml |
Field | DocType | Citations |
Architecture,XML,Information retrieval,Computer science,Natural language processing,Artificial intelligence,Reusability,Speech technology,Spoken language | Conference | 2 |
PageRank | References | Authors |
0.82 | 2 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Andreas Mengel | 1 | 41 | 12.12 |
Ulrich Heid | 2 | 190 | 40.48 |