Title
A Simple Algorithm for Topic Identification in 0-1 Data
Abstract
Topics in 0-1 datasets are sets of variables whose occurrences are positively connected together. Earlier, we described a simple generative topic model. In this paper we show that, given data produced by this model, the lift statistics of attributes can be described in matrix form. We use this result to obtain a simple algorithm for finding topics in 0-1 data. We also show that a problem related to the identification of topics is NP-hard. We give experimental results on the topic identification problem, both on generated and real data.
Year
DOI
Venue
2003
10.1007/978-3-540-39804-2_38
Lecture Notes in Artificial Intelligence
Field
DocType
Volume
Data mining,Computer science,Latent variable model,Non-negative matrix factorization,Artificial intelligence,Binary data,SIMPLE algorithm,Topic model,System identification,Latent semantic analysis,Machine learning,Parameter identification problem
Conference
2838
ISSN
Citations 
PageRank 
0302-9743
12
1.30
References 
Authors
21
3
Name
Order
Citations
PageRank
Jouni K. Seppänen11249.09
Ella Bingham291758.70
Heikki Mannila365951495.69