Abstract
The molecules of Zn-finger transcription factors consist of several similar small protein units. We analyzed the crystal structures 46 basic units of 22 complexes of Zn-Cys2His2 family with the fragments of operator DNA. We showed that the recognition of DNA occurs via five protein contacts. The canonical binding positions of the recognizing α-helix were −1, 3, 6, and 7, which make contacts with the tetra-nucleotide sequence ZXYZ of the coding DNA strand; here the canonical binding triplet is underlined. The non-coding DNA strand forms only one contact at α-helix position 2. We have discovered that there is a single highly conservative contact His7α with the phosphate group of nucleotide Z, which precedes each triplet XYZ of the coding DNA chain. This particular contact is invariant for the all Zn-Cys2His2 family with high frequency of occurrence 83%, which we considered as an invariant recognition rule. We have also selected a previously unreported Zn-Cys2His2-Arg subfamily of 21 Zn-finger units bound with DNA triplets, which make two invariant contacts with residues Arg6α and His7α with the coding DNA chain. These contacts show frequency of occurrence 100 and 90%, and are invariant recognition rule. Three other variable protein-DNA contacts are formed mainly with the bases and specify the recognition patterns of individual factor units. The revealed recognition rules are inherent for the Zn-Cys2His2 family and Zn-Cys2His2-Arg subfamily of different taxonomic groups and can distinguish members of these families from any other family of transcription factors.
Acknowledgments
This work was supported by the Russian Foundation for Basic Research; Project No. 11-07-00374a. We thank the reviewers for the valuable notes which improved the manuscript. One of us, professor Yuri N. Chirgadze, would like to express his sincere gratitude to his wife for her invaluable help and encouragement during the work on this study.