Abstract | ||
---|---|---|
Distances in biological databases are known not to be ultrametric. Deviations from ultrametricity can however reveal useful features of biodata. In the present study we examine deviations from ultrametricity of the distances between known phage proteins quantified in two senses: (1) the failure of triangles to be isosceles and (2) failure of every point to be the center of any sphere in which it resides. The deviations from these two ultrametric properties undergo qualitative changes as a function of the distance. Below we describe these changes and how they can be observed. We further argue that the distances at which the qualitative changes take place reveal intrinsic scales in the dataset. Such scales are important for choosing threshold values of the distance in various algorithms and reveal natural chuncking of the data that can be used to decide clade levels in phage phylogeny. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1142/S1230161209000062 | OPEN SYSTEMS & INFORMATION DYNAMICS |
Field | DocType | Volume |
Combinatorics,Biological database,Biodata,Chunking (psychology),Ultrametric space,Isosceles triangle,Mathematics | Journal | 16 |
Issue | ISSN | Citations |
1 | 1230-1612 | 0 |
PageRank | References | Authors |
0.34 | 1 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Chad Wagner | 1 | 0 | 0.34 |
Anna Salamon | 2 | 0 | 0.34 |
Robert Edwards | 3 | 186 | 18.91 |
Forest Rohwer | 4 | 109 | 12.51 |
Peter Salamon | 5 | 70 | 26.39 |