Title
On domain independence of author identification
Abstract
Latent Dirichlet Allocation (LDA) is a probabilistic framework by which we may assume each word carries probability distribution to each topic and a topic carries a distribution to each document. By putting all the documents together into one collection by each author, it is possible to identify authors. Here we show that author identification is fully reliable within a framework of LDA independent of documents domains by learning incomplete and massive documents.
Year
DOI
Venue
2011
10.1007/978-3-642-23878-9_2
IDEAL
Keywords
Field
DocType
probabilistic framework,documents domain,latent dirichlet allocation,massive document,domain independence,author identification,probability distribution,text mining
Latent Dirichlet allocation,Domain independence,Information retrieval,Computer science,Probability distribution,Probabilistic framework
Conference
Volume
ISSN
Citations 
6936
0302-9743
4
PageRank 
References 
Authors
0.51
4
2
Name
Order
Citations
PageRank
Masato Shirai141.52
Takao Miura26122.10