Similarity word-sequence kernels for sentence clustering - Citegraph

Paper Info

Title
Similarity word-sequence kernels for sentence clustering

Abstract
In this paper, we present a novel clustering approach based on the use of kernels as similarity functions and the C-means algorithm. Several word-sequence kernels are defined and extended to verify the properties of similarity functions. Afterwards, these monolingual wordsequence kernels are extended to bilingual word-sequence kernels, and applied to the task of monolingual and bilingual sentence clustering. The motivation of this proposal is to group similar sentences into clusters so that specialised models can be trained for each cluster, with the purpose of reducing in this way both the size and complexity of the initial task.We provide empirical evidence for proving that the use of bilingual kernels can lead to better clusters, in terms of intra-cluster perplexities.

Year	DOI	Venue
2010	10.1007/978-3-642-14980-1_60	SSPR/SPR
Keywords	Field	DocType
initial task,similarity word-sequence kernel,bilingual sentence clustering,similarity function,bilingual kernel,better cluster,c-means algorithm,monolingual wordsequence kernel,bilingual word-sequence kernel,word-sequence kernel,empirical evidence	Cluster (physics),Pattern recognition,Sentence clustering,Machine translation,Support vector machine,Natural language processing,Artificial intelligence,Cluster analysis,Mathematics	Conference
Volume	ISSN	ISBN
6218	0302-9743	3-642-14979-0
Citations	PageRank	References
1	0.36	14
Authors
3

Authors (3 rows)

Cited by (1 rows)

References (14 rows)

Name	Order	Citations	PageRank
Jesús Andrés-Ferrer	1	73	7.52
Germán Sanchis-Trilles	2	101	16.95
francisco casacuberta	3	1439	161.33

1