|
Vol 57(2023) N 2 p. 176-181; DOI 10.1134/S002689332302005X V.M. Efimov1,2,3,4*, K.V. Efimov5, V.Yu. Kovaleva2 Geometric Approach to Phylogeographic Analysis Molecular Genetic Sequences: Principal Components and Dendrograms 1Institute of Cytology and Genetics, Siberian Branch, Russian Academy of Sciences, Novosibirsk, 630090 Russia2Institute of Systematics and Ecology of Animals, Siberian Branch, Russian Academy of Sciences, Novosibirsk, 630091 Russia 3Novosibirsk State University, Novosibirsk, 630090 Russia 4Tomsk State University, Tomsk, 634050 Russia 5Higher School of Economics, Moscow, 101000 Russia *efimov@bionet.nsc.ru Received - 2022-07-27; Revised - 2022-07-27; Accepted - 2022-08-21 Currently, the search for manifestations of selection under the influence of the environment in molecular sequences is usually carried out within closely related species or at the intraspecific level. It is believed that at high taxonomic levels this is unpromising due to phylogenetic relationship. Cytochrome b amino acid sequences of 67 rodent and lagomorph species with known geographic coordinates were digitized using the AAindex database. Based on more than 200 thousand characters, the principal components were obtained. A well-known statistical method, which has not been previously used for such problems, was used, which makes it possible to orthogonally decompose multidimensional variability into intra- and intertaxon variability and analyze them separately. The subfamily level was selected. For the second principal component (17.05% of intertaxon variability), a correlation with latitude was found (r = 0.561; n = 67; p < E-5). The clear division into two groups, revealed by the first principal component (39.48% of intertaxon variability), which does not coincide with the taxonomic one, indicates a possible physicochemical underlying cause for the differences between them. This requires further research. Rodentia, cytochrome b, Euclidean distances, cluster analysis, orthogonal decomposition, geographic coordinates |