Title
Rule Discovery and Probabilistic Modeling for Onomastic Data
Abstract
The naming of natural features, such as hills, lakes, springs, meadows etc., provides a wealth of linguistic information; the study of the names and naming systems is called onomastics. We consider a data set containing all names and locations of about 58,000 lakes in Finland. Using computational techniques, we address two major onomastic themes. First, we address the existence of local dependencies or repulsion between occurrences of names. For this, we derive a simple form of spatial association rules. The results partially validate and partially contradict results obtained by traditional onomastic techniques. Second, we consider the existence of relatively homogeneous spatial regions with respect to the distributions of place names. Using mixture modeling, we conduct a global analysis of the data set. The clusterings of regions are spatially connected, and correspond quite well with the results obtained by other techniques; there are, however, interesting differences with previous hypotheses.
Year
DOI
Venue
2003
10.1007/978-3-540-39804-2_27
Lecture Notes in Artificial Intelligence
Keywords
Field
DocType
global analysis,mixture model,probabilistic model
Toponymy,Rule-based machine translation,Data mining,Homogeneous,Computer science,Onomastics,Association rule learning,Knowledge extraction,Probabilistic logic,Proper noun
Conference
Volume
ISSN
Citations 
2838
0302-9743
3
PageRank 
References 
Authors
0.52
2
3
Name
Order
Citations
PageRank
Antti Leino130.85
Heikki Mannila265951495.69
Ritva Liisa Pitkänen330.52