JMB-HEADER RAS-JOURNALS EIMB Pleiades Publishing

RUS

             

ENG

YearIMPACT-FACTOR
2024  1,200
2023  1,500
2022  1,200
2021  1,540
2020  1,374
2019  1,023
2018  0,932
2017  0,977
2016  0,799
2015  0,662
2014  0,740
2013  0,739
2012  0,637
2011  0,658
2010  0,654
2009  0,570
2008  0,849
2007  0,805
2006  0,330
2005  0,435
2004  0,623
2003  0,567
2002  0,641
2001  0,490
2000  0,477
1999  0,762
1998  0,785
1997  0,507
1996  0,518
1995  0,502
Vol 59(2025) N 2 p. 263-271; DOI 10.1134/S0026893324700894 Full Text

V.M. Efimov1,2,3,4*, K.V. Efimov5, V.Yu. Kovaleva2

Division of the Standard Set of Amino Acids into Groups According to Their Evolutionary Age

1Institute of Cytology and Genetics, Siberian Branch, Russian Academy of Sciences, Novosibirsk, 630090 Russia
2Institute of Animal Systematics and Ecology, Siberian Branch, Russian Academy of Sciences, Novosibirsk, 630091 Russia
3Novosibirsk State University, Novosibirsk, 630090 Russia
4Tomsk State University, Tomsk, 634050 Russia
5Higher School of Economics, Moscow, 101000 Russia

*vmefimov@gmail.com
Received - 2024-08-05; Revised - 2024-10-07; Accepted - 2024-10-16

It is generally accepted that the existing set of proteinogenic amino acids encoded by the standard genetic code was formed step by step in the course of evolution. Most studies name Ala, Asp, Glu, Gly, Ile, Leu, Pro, Ser, Thr, and Val as early amino acids, presumably of extraterrestrial origin. However, other studies have chosen a consensus list of early amino acids in which Ile is replaced by Arg. We compared the differences between early and late amino acids for the lists with Ile and with Arg based on their physicochemical properties (AAindex database). The point-biserial correlation coefficient rpb, Student's t-test, and its reliability, the p-value, were calculated between the binary lists with Ile and Arg and each AA index. Since in total 2x553 p-values were obtained, the problem of multiple comparisons was solved using the Bonferroni correction and the Benjamini-Hochberg method. Next, we used the 2B-PLS method, which is applied to two different sets of variables related to the same objects, to find information common to both sets. The first set was the binary lists of Trifonov (Arg) and Wong (Ile), and the second set was 553 AA indexes. The maximum correlation with both the list with Ile and with Arg (1.0 and 0.8, respectively) was demonstrated by the binary AA index CHAM830108, which characterizes the ability of an amino acid to be a charge donor: late amino acids are capable of being donors, while early ones are not. Apparently, this is due to the differences in the conditions under which the standard set of amino acids evolved: prebiotic and biotic. The results of the 2B-PLS analysis also show that in the list of ten evolutionarily early amino acids, Ile appears preferable to Arg. The allocation of the last six amino acids (Cys, His, Met, Phe, Trp, and Tyr) obtained on the basis of the reduction of the HOMO-LUMO gap in a separate, third stage of the evolution of the set of standard amino acids is confirmed. A compact arrangement on the 2B-PLS plane of the physicochemical properties of three groups of amino acids, in which adenine, thymine, and cytosine are located in the second position of the codons, respectively, as well as the maximum dispersion of amino acids with guanine in the second position of the codons, is revealed.

early and late amino acids, AAindex, CHAM830108, point-biserial correlation coefficient, p-value, Bonferroni correction, Benjamini-Hochberg method, 2B-PLS analysis



JMB-FOOTER RAS-JOURNALS