Title
Dimensionality reduction for blog tag mining
Abstract
Blog tags are labels of blog documents that classify them into different categories. Most tags are user-generated, which create problems such as inconsistencies in tags across different users, blogs without tags, lack of descriptive tags, lack of semantic distinction, etc. In this paper, we utilise dimensionality reduction techniques to reduce the inherent noise in blog tags. A tag-topic model is combined with dimensionality reduction, and then evaluated on real-world blog data. By employing dimensionality reduction techniques to reduce the document-tag space, better classification results were achieved. This indicates that the noise in tags can be effectively reduced by representing the original set of tags with a smaller number of latent tags, which can lead to more accurate real-time categorisation of blog documents.
Year
DOI
Venue
2011
10.1504/IJWET.2011.040726
Int. J. Web Eng. Technol.
Keywords
DocType
Volume
blog tag mining,dimensionality reduction technique,accurate real-time categorisation,different user,blog document,Dimensionality reduction,Blog tag,real-world blog data,better classification result,inherent noise,different category,dimensionality reduction
Journal
6
Issue
Citations 
PageRank 
3
2
0.37
References 
Authors
19
1
Name
Order
Citations
PageRank
Flora S. Tsai135223.96