JMB-HEADER RAS-JOURNALS EIMB Pleiades Publishing

RUS

             

ENG

YearIMPACT-FACTOR
2023  1,500
2022  1,200
2021  1,540
2020  1,374
2019  1,023
2018  0,932
2017  0,977
2016  0,799
2015  0,662
2014  0,740
2013  0,739
2012  0,637
2011  0,658
2010  0,654
2009  0,570
2008  0,849
2007  0,805
2006  0,330
2005  0,435
2004  0,623
2003  0,567
2002  0,641
2001  0,490
2000  0,477
1999  0,762
1998  0,785
1997  0,507
1996  0,518
1995  0,502
Vol 58(2024) N 4 p. 611-635; DOI 10.1134/S0026893324700213 Full Text

E.E. Zelenova1,2*, A.A. Karlsen3, D.V. Avdoshina4, K.K. Kyuregyan3, M.G. Belikova4,5, I.D. Trotsenko6,7

Amino Acid Substitution Patterns in the E6 and E7 Proteins of HPV Type 16: Phylogeography and Evolution

1Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 119991 Russia
2Blokhin National Medical Research Center of Oncology, Moscow, 115478 Russia
3Mechnikov Research Institute of Vaccines and Sera, Moscow, 105064 Russia
4Chumakov Federal Scientific Center for Research and Development of Immunobiological Drugs, Russian Academy of Sciences, Moscow, 117218 Russia
5Gamaleya National Research Center for Epidemiology and Microbiology, Ministry of Health of the Russian Federation, Moscow, 123098 Russia
6Peoples' Friendship University of Russia, Moscow, 117198 Russia


*zelenovayeye@gmail.com
Received - 2023-10-05; Revised - 2023-12-28; Accepted - 2023-12-29

The E6 and E7 proteins of the high risk human papillomaviruses (HR HPVs) play a key role in the oncogenesis associated with papillomavirus infection. Data on the variability of these proteins are limited, and the factors affecting their variability are still poorly understood. We analyzed the variability of the currently known sequences of the HPV type 16 (HPV16) E6 and E7 proteins, taking into account their geographic origin and year of sample collection, as well as the direction of their evolution in the major geographic regions of the world. All sequences belonging to HPV16 genome fragments encoding the E6 and E7 oncoproteins were downloaded from the NCBI GenBank database on October 6, 2022. Samples were filtered according to the following parameters: the sequence has to include at least one of the two whole open reading frames, and given date of collection, and the country of origin. A total of 3651 full-genome nucleotide sequences encoding the E6 protein and 4578 full-genome nucleotide sequences encoding the E7 protein were sampled. The nucleotide sequences obtained after sampling and alignment were converted to amino acid sequences and analyzed using the MEGA11, R, RStudio, Jmodeltest 2.1.20, BEAST v1.10.4, Fastcov, and Biostrings software. The highest variability in the E6 protein was recorded for amino acid (AA) residues in the positions 17, 21, 32, 85, and 90. The most variable in E7 were aa positions 28, 29, 51, and 77. The samples were divided geographically into five heterogeneous groups as derived from Africa, Europe, America, SouthWest and South Asia, and South-East Asia. Unique amino acid substitutions (AA-substitutions) in the E6/E7 proteins of HPV16, presumably characteristic to certain ethnic groups, were identified for a number of countries. They weare mainly localized in the sites of known B- and T-cell epitopes and relatively rarely the domains critical for in structure and protein function. The revealed differences in AA-substitutions in different ethnic groups and their colocalization with the clusters of B- and T-cell epitopes suggested their possible relation to the geographical distribution of alleles and haplotypes of the major histocompatibility complex (HLA). This may lead to the recognition of a different set of B- and T-cell epitopes of the virus in different geographic areas, resulting in the regional differences in the direction of epitopic drift. Phylogenetic analysis of the nucleotide sequences encoding the E6 protein of HPV16 revealed a common ancestor, confirmed regional clustering of the E6 protein sequences sharing common AA-substitutions, and identified cases of reversion of individual AA-substitutions when the change of geographical localization. For the E7 protein, such analysis was not possible due to the high sequence homology. Covariance analysis of the pooled of E6 and E7 sequences revealed that there was no associations between amino acid residues in any aa position within E6 or E7 as well as aa positions of E6 and E7 proteins. The data presented here are important for the development of universal therapeutic vaccines against HPV of high carcinogenic risk.

human papillomavirus type 16, oncoprotein E6, oncoprotein E7, amino acid substitutions, phylogenetic analysis, covariance analysis, B-cell epitopes, T-cell epitopes



JMB-FOOTER RAS-JOURNALS