Title
Decisions in thesaurus construction and use
Abstract
A thesaurus and an ontology provide a set of structured terms, phrases, and metadata, often in a hierarchical arrangement, that may be used to index, search, and mine documents. We describe the decisions that should be made when including a term, deciding whether a term should be subdivided into its subclasses, or determining which of more than one set of possible subclasses should be used. Based on retrospective measurements or estimates of future performance when using thesaurus terms in document ordering, decisions are made so as to maximize performance. These decisions may be used in the automatic construction of a thesaurus. The evaluation of an existing thesaurus is described, consistent with the decision criteria developed here. These kinds of user-focused decision-theoretic techniques may be applied to other hierarchical applications, such as faceted classification systems used in information architecture or the use of hierarchical terms in ''breadcrumb navigation''.
Year
DOI
Venue
2007
10.1016/j.ipm.2006.08.011
Inf. Process. Manage.
Keywords
Field
DocType
existing thesaurus,structured term,controlled vocabulary,hierarchical term,performance measurement,thesaurus construction,possible subclasses,breadcrumb navigation,evaluation,thesaurus,ontology,automatic construction,future performance,hierarchical arrangement,hierarchical application,thesaurus term,indexation,information architecture
Ontology,Data mining,Metadata,Multiple-criteria decision analysis,Information retrieval,Computer science,Information architecture,Systems design,Controlled vocabulary,Performance measurement,Faceted classification
Journal
Volume
Issue
ISSN
43
4
Information Processing and Management
Citations 
PageRank 
References 
4
0.44
15
Authors
1
Name
Order
Citations
PageRank
Robert M. Losee127636.01