Vol 53(2019) N 1 p. 97-106; DOI 10.1134/S0026893319010023
М. Bartas1, P. Bažantová1, V. Brázda2, J.С. Liao2,3, J. Červeň1, P. Pečinka1*
Identification of Distinct Amino Acid Composition of Human Cruciform Binding Proteins1Department of Biology and Ecology / Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Ostrava, 71000 Czech Republic
2Institute of Biophysics, Academy of Sciences of the Czech Republic v.v.i., Brno, 61265 Czech Republic
3School of Medicine, The University of Queensland, Greenslopes Private Hospital, Greenslopes, 4120 Australia
Received - 2017-10-11; Revised - 2018-02-20; Accepted - 2018-02-20
Cruciform structures are preferential targets for many architectural and regulatory proteins, as well as a number of DNA binding proteins with weak sequence specificity. Some of these proteins are also capable of inducing the formation of cruciform structures upon DNA binding. In this paper we analyzed the amino acid composition of eighteen cruciform binding proteins of Homo sapiens. Comparison with general amino acid frequencies in all human proteins revealed unique differences, with notable enrichment for lysine and serine and/or depletion for alanine, glycine, glutamine, arginine, tyrosine and tryptophan residues. Based on bootstrap resampling and fuzzy cluster analysis, multiple molecular mechanisms of interaction with cruciform DNA structures could be suggested, including those involved in DNA repair, transcription and chromatin regulation. The proteins DEK, HMGB1 and TOP1 in particular formed a very distinctive group. Nonetheless, a strong interaction network connecting nearly all the cruciform binding proteins studied was demonstrated. Data reported here will be very useful for future prediction of new cruciform binding proteins or even construction of predictive tool/web-based application.
cruciform structures, DNA-protein binding, cluster analysis, lysine, tryptophan