A Bayes-optimal sequence-structure theory that unifies by Lathrop R. H., Rogers Jr R. G., Smith T. F.

A rigorous Bayesian research is gifted that unifies protein sequence-structure alignment and popularity. Given a chain, particular formulae are derived to pick (1) its globally such a lot possible center constitution from a constitution library; (2) its globally so much possible alignment to a given center constitution; (3) its so much possible joint middle constitution and alignment selected globally around the whole library; and (4) its so much possible person segments, secondary constitution, and super-secondary constructions around the whole library. The computations concerned are NP-hard within the basic case (3D-3D). quick distinct recursions for the constrained series singleton-only (1D-3D) case are given. Conclusions contain: (a) the main possible joint middle constitution and alignment isn't unavoidably the main possible alignment of the main possible center constitution, yet really maximizes the made from middle and alignment percentages; (b) use of a sequence-independent linear or affine hole penalty can result within the highest-probability threading now not having the bottom ranking; (c) identifying the main possible middle constitution from the library (core constitution choice or fold acceptance in basic terms) comprises evaluating percentages summed over all attainable alignments of the series to the middle, and never evaluating person optimum (or near-optimal) sequence-structure alignments; and (d) assuming uninformative priors, middle constitution choice is resembling evaluating the ratio of 2 worldwide ability.

USA 89, 9029-9033. Greer, J. (1990). Comparative modeling methods: application to the family of the mammalian serine proteases. Proteins: Structure, Function, and Genetics 7, 317-333. Hartigan, J. A. (1983). Bayes Theory, New York: Springer-VerIag. Holm, L. and C. Sander (1994). The FSSP database of structurally aligned protein fold families. Nucl. Acids Res. 22, 3600-3609. Holm, L. and C. Sander (1996). Mapping the protein universe. Science 273, 595-602. Hunter, L. and D. J. States (1992). Bayesian classification of protein structure.

Lathrop. (1997). The threading approach to the inverse folding problem, in Proc. Int. Con. on Computational Molecular Biology, S. Istrail, R. Karp, T. Lengauer, P. Pevzner, R. Shamir and M. Waterman (Eds), New York: ACM Press, pp. 287-292 Smith, T. , L. Lo Conte, J. Bienkowska, C. Gaitatzes, R. G. Rogers Jr and R. H. Lathrop (1997). Current limitations to protein threading approaches. J. Camp. Biol. 4, 2 17-225. Srinivasan, R. and G. D. Rose (1995). LINUS: A hierarchic procedure to predict the fold of a protein.

Luthrop et al. Russell, R. B. and G. J. Barton (1994). Structural features can be unconserved in proteins with similar folds. J. Mol. Biol. 244, 332-350. Sankof, D. and J. B. Kmskal (Eds) (1983). Time Warps, String Edits and Macromolecules, Reading, MA: Addison-Wesley. Sippl, M. J. (1993). Boltzmann’s principle, knowledge-based mean fields and protein folding. J. Computer-aided Mol. Design 7, 473-50 1. Sippl, M. J. (1995). Knowledge-based potentials for proteins. Current Opinion in Szructural Biol.

