Title
An Interactive Web-Based Toolset for Knowledge Discovery from Short Text Log Data.
Abstract
Many companies maintain human-written logs to capture data on events such as workplace incidents and equipment failures. However, the sheer volume and unstructured nature of this data prevent it from being utilised for knowledge acquisition. Our web-based prototype software system provides a cohesive computational methodology for analysing and visualising log data that requires minimal human involvement. It features an interface to support customisable, modularised log data processing and knowledge discovery. This enables owners of eventbased datasets containing short textual descriptions, such as occupational health & safety officers and machine operators, to identify latent knowledge not previously acquirable without significant time and effort. The software system comprises five distinct stages, corresponding to standard data mining milestones: exploratory analysis, data warehousing, association rule mining, entity clustering, and predictive analysis. To the best of our knowledge, it is the first dedicated system to computationally analyse short text log data and provides a powerful interface that visualises the analytical results and supports human interaction.
Year
DOI
Venue
2017
10.1007/978-3-319-69179-4_61
ADVANCED DATA MINING AND APPLICATIONS, ADMA 2017
Keywords
Field
DocType
Knowledge discovery,Visualisation,Unstructured data mining
Data science,Data warehouse,Data mining,Visualization,Computer science,Software system,Association rule learning,Knowledge extraction,Web application,Cluster analysis,Knowledge acquisition
Conference
Volume
ISSN
Citations 
10604
0302-9743
0
PageRank 
References 
Authors
0.34
2
4
Name
Order
Citations
PageRank
Michael Stewart18414.83
Wei Liu225822.36
Rachel Cardell-Oliver327133.25
Mark Griffin400.34