Rule Discovery and Probabilistic Modeling for Onomastic Data - Citegraph

Paper Info

Title
Rule Discovery and Probabilistic Modeling for Onomastic Data

Abstract
The naming of natural features, such as hills, lakes, springs, meadows etc., provides a wealth of linguistic information; the study of the names and naming systems is called onomastics. We consider a data set containing all names and locations of about 58,000 lakes in Finland. Using computational techniques, we address two major onomastic themes. First, we address the existence of local dependencies or repulsion between occurrences of names. For this, we derive a simple form of spatial association rules. The results partially validate and partially contradict results obtained by traditional onomastic techniques. Second, we consider the existence of relatively homogeneous spatial regions with respect to the distributions of place names. Using mixture modeling, we conduct a global analysis of the data set. The clusterings of regions are spatially connected, and correspond quite well with the results obtained by other techniques; there are, however, interesting differences with previous hypotheses.

Year	DOI	Venue
2003	10.1007/978-3-540-39804-2_27	Lecture Notes in Artificial Intelligence
Keywords	Field	DocType
global analysis,mixture model,probabilistic model	Toponymy,Rule-based machine translation,Data mining,Homogeneous,Computer science,Onomastics,Association rule learning,Knowledge extraction,Probabilistic logic,Proper noun	Conference
Volume	ISSN	Citations
2838	0302-9743	3
PageRank	References	Authors
0.52	2	3

Authors (3 rows)

Cited by (3 rows)

References (2 rows)

Name	Order	Citations	PageRank
Antti Leino	1	3	0.85
Heikki Mannila	2	6595	1495.69
Ritva Liisa Pitkänen	3	3	0.52

1