Analysis of structures from Protein Data Bank using atom displacement parameters

Research article: Analysis of structures from Protein Data Bank using atom displacement parameters

Author:  R.C. Masmaliyeva

Institute of Molecular Biology and Biotechnologies, Azerbaijan National Academy of Sciences,

Baku, Azerbaijan

Abstract: Currently there are more than 156,000 macromolecular structures in Protein Data Bank (PDB). If at first times of development of Structural biology the priority purpose was solving as many structures as possible, currently the main purpose is solving structures, which allow understanding different biological processes. This is why validation and refinement of macromolecular structures become more and more important. New validation tools need to be developed. Classic methods of validation are based on physical and chem-ical features of macromolecules, as well as the geometry of amino acids. In our researches, we describe a method of validation by using Atom Displacement Parameters (ADP). ADP illustrates the variation of atom from its main position. Development of the method of statistical analysis of ADP distribution may be used as an effective validation tool. This contribution is devoted to application of statistical analysis of ADPs to PDB and possible use of this method for macromolecular validation.

Keywords: Atom displacement parameters, macromolecular refinement, validation 

References 

Berman H.M., Battistuz T., Bhat T.N., Bluhm W.F., Bourne P.E., Burkhardt K., Feng Z., Gil-liland G.L., Iype L., Jain S., Fagan P., Marvin J., Padilla D., Ravichandran V., Schneider B., Thanki N., Weissig H., Westbrook J.D., Zar-decki C. (2002) The Protein Data Bank. Acta Cryst., D58: 899-907. 

Carugo O. (2018) How large B-factors can be in protein crystal structures. BMC bioinformatics, 19(1): 61. 

Dauter Z., Murshudov G.N., Wilson K.S. (2006) International Tables for Crystallography. Vol. F: 393-402. Chester: International Union of Crys-tallography. 

Grosse-Kunstleve, R.W., Adams P.D. (2002) On the handling of atomic anisotropic displacement parameters. J. Appl. Cryst., 35: 477-480. 

Hible G., Christova P., Renault L., Andrew E.S., Girard T.E., Munier-Lehmann H., Cherfils J. (2005) Unique GMP-binding site in Mycobacte-rium tuberculosis guanosine monophosphate ki-nase. Proteins, 62(2): 489-500 

Jones E., Oliphant E., Peterson P., et al. (2019) SciPy: Open Source Scientific Tools for Python, 2001; http://www.scipy.org/ 

Read R.J., Adams P.D., Arendall W.B. 3rd, Brunger A.T., Emsley P., Joosten R.P., Kley-wegt G.J., Krissinel E.B., Lütteke T., Ot-winowski Z., Perrakis A., Richardson J., Shef-fler W., Smith J., Tickle I., Vriend G., Zwart P. (2011) A new generation of crystallographic val-idation tools for the protein data bank. Structure, 19: 1395–1412. 

Masmaliyeva R., Murshudov G.N. (2017) Refi-nement and validation of macromolecular struc-tures. Transactions of the Institute of Molecular Biology & Biotechnologies of ANAS, 1: 80-93. 

Masmaliyeva R., Murshudov G.N. (2018) Outli-er detection in atomic temperature factor - B va-lue distribution.Proceedings of ANAS (Biologi-cal and Medical Sciences), 73(2): 25-32. 

Masmaliyeva R.C., Murshudov G.N. (2019) Analysis and validation of macromolecular B values. Acta Cryst., D75: 505-518. 

Merritt E.A. (2011) Some Beq are more equiva-lent than others. Acta Cryst., A67: 512–516. 

McGill R., Tukey John W., Larsen Wayne A. (1978) Variations of box plots. The American Statistician., 32(1): 12–16. 

Morris A.L., MacArthur M.W., Hutchinson E.G., Thornton J.M. (1992) Stereochemical quality of protein structure coordinates. Proteins, 12 (4): 345–64. 

Murshudov G.N., Skubák P., Lebedev A.A., Pannu N.S., Steiner R.A., Nicholls R.A., Winn M.D., Long F., Vagin A.A. (2011) REFMAC5 for the refinement of macromolecular crystal structures. Acta Cryst., D67: 355–367. 

Pizarro J.C., Chitarra V., Verger D., Holm I., Petres S., Dartville S., Nato F., Longacre S., Bentley G.A. (2003) Crystal structure of a Fab complex whith Plasmodium falciparum MSP1-19.J. Mol.Biol., 328: 1091. 

R Core Team (2014) R: A Language and Environ-ment for Statistical Computing. Vienna: R Foun-dation for Statistical Computing. http://www.R-project.org. 

Read R.J., Adams P.D., Arendall W.B., Brunger A.T., Emsley P., Joosten R.P., Kleywegt G.J., Krissinel E.B., Lütteke T., Otwinowski Z., Per-rakis A., Richardson J.S., Sheffler W.H., Smith J.L., Tickle I.J., Vriend G., Zwart P.H. (2011) A new generation of crystallographic validation tools for the protein data bank. Structure, 12; 19(10):1395-412. 

Schneider B., Gelly J.C., de Brevern A.G., Černý J. (2014) Local dynamics of proteins and DNA evaluated from crystallographic B factors. Acta Cryst., Section D, Biological crystallog-raphy, 70(Pt 9): 2413–2419. 

Stuart A., Ord K., Arnold S. (1999) Kendall's Ad-vanced Theory of Statistics, 2A. London: Edward Arnold. 

Trueblood K.N., Burgi H.-B., Burzlaff H., Du-nitz J.D., Gramaccioli C.M., Schulz H.H., Shmueli U., Abrahams S.C. (1996) Atomic Dis-pacement Parameter Nomenclature. Report of a Subcommittee on Atomic Displacement Parame-ter Nomenclature. Acta Cryst., A52: 770-781. 

Tukey J.W. (1977). Exploratory data analysis. Ad-dison-Wesley Series in Behavioral Science - Quantitative Methods, 711.


Добавить комментарий

Оставить комментарий

reload, if the code cannot be seen