Title
Topic discovery and topic-driven clustering for audit method datasets
Abstract
As the promotion of China's Golden Auditing Project and the fast growth of on-line auditing, there are thousands of new computer audit methods emerged every year to fulfill various needs of audit practices. How to organize these existing computer audit methods and use them intelligently have become a fundamental and challenging problem. In this paper, we propose to use topic-driven clustering methods to organize computer audit methods according to the system of computer audit methods that is issued by the National Audit Office of China. We also apply Latent Dirichlet allocation (LDA) analysis to audit method datasets at different levels of granularity. Our experimental results on social insurance computer audit methods show that the topic-driven clustering scheme with topics created by domain experts is the overall best scheme. It achieved an average purity of 0.862 across the datasets. Topics discovered by LDA were consistent with classes defined in the taxonomy for four out of five datasets, and they were effective when used in the topic-driven clustering scheme.
Year
DOI
Venue
2011
10.1007/978-3-642-25856-5_26
ADMA
Keywords
Field
DocType
golden auditing project,method datasets,computer audit method,topic-driven clustering scheme,new computer audit method,overall best scheme,existing computer audit method,audit practice,topic discovery,topic-driven clustering method,audit method datasets,social insurance computer audit
Data science,Data mining,Latent Dirichlet allocation,Audit,Social insurance,Computer science,Artificial intelligence,Granularity,Cluster analysis,Machine learning
Conference
Volume
ISSN
Citations 
7121
0302-9743
0
PageRank 
References 
Authors
0.34
18
3
Name
Order
Citations
PageRank
Ying Zhao190249.19
Wanyu Fu200.34
Shaobin Huang3117.93