Title
Cosine Similarity-Based Pruning for Concept Discovery.
Abstract
In this work we focus on improving the time efficiency of Inductive Logic Programming (ILP)-based concept discovery systems. Such systems have scalability issues mainly due to the evaluation of large search spaces. Evaluation of the search space cosists translating candidate concept descriptor into SQL queries, which involve a number of equijoins on several tables, and running them against the dataset. We aim to improve time efficiency of such systems by reducing the number of queries executed on a DBMS. To this aim, we utilize cosine similarity to measure the similarity of arguments that go through equijoins and prune those with 0 similarity. The proposed method is implemented as an extension to an existing ILP-based concept discovery system called Tabular Cris w-EF and experimental results show that the poposed method reduces the number of queries executed around 15 %.
Year
DOI
Venue
2016
10.1007/978-3-319-47217-1_10
Communications in Computer and Information Science
Field
DocType
Volume
Query optimization,SQL,Inductive logic programming,Cosine similarity,Computer science,Theoretical computer science,Concept Descriptor,Artificial intelligence,Machine learning,Scalability,Pruning,Distributed computing
Conference
659
ISSN
Citations 
PageRank 
1865-0929
0
0.34
References 
Authors
10
3
Name
Order
Citations
PageRank
Abdullah Dogan110.69
Alev Mutlu2227.19
Pinar Karagoz315428.34