Abstract | ||
---|---|---|
This paper describes a conceptual and theoretical framework to allow better user control over data summarization for knowledge discovery. Basic to the approach is a measure of quality of summarization of data using categories provided by the hierarchical structure of concept ontology. This involves the modeling, using a fuzzy sets approach, of the four criteria implicit in a summarization imperative: minimum coverage, minimum relevance, succinctness, and usefulness. With these criteria modeled, a multicriteria approach is presented, using a decision function aggregating these criteria that provides an overall quality measure to guide the summarization of the data. The development of the theory is first presented for the simple case of a single attribute to clearly delineate the basic issues and approach and then extended to multiple attributes. Finally, approaches to provide a more user-oriented presentation of the summarized data are considered |
Year | DOI | Venue |
---|---|---|
2006 | 10.1109/TFUZZ.2006.879954 | IEEE T. Fuzzy Systems |
Keywords | Field | DocType |
data summarization,fuzzy sets approach,overall quality measure,summarization imperative,better user control,multicriteria approach,basic issue,minimum relevance,summarized data,concept ontologies,minimum coverage,fuzzy set theory,fuzzy set,data analysis,knowledge discovery,data mining,fuzzy sets | Ontology (information science),Automatic summarization,Data mining,Ontology,User control,Computer science,Succinctness,Fuzzy set,Artificial intelligence,Knowledge extraction,Vocabulary,Machine learning | Journal |
Volume | Issue | ISSN |
14 | 6 | 1063-6706 |
Citations | PageRank | References |
22 | 0.97 | 21 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ronald R. Yager | 1 | 986 | 206.03 |
F. E. Petry | 2 | 184 | 19.59 |