Biology:3D-Jury
3D-Jury is a metaserver that aggregates and compares models from various protein structure prediction servers.[1]
The 3D-Jury algorithm takes in groups of predictions made by a collection of servers and assigns each pair a 3D-Jury score, based on structural similarity. To improve accuracy of the final model, users can select the prediction servers from which to aggregate results.[1] The authors of 3D-Jury designed the system as a meta-predictor because earlier results concluded that the average low-energy protein conformation (by way of aggregation) fit the true conformation better than simply the lowest-energy protein conformation.[2]
The Robetta automatic protein structure prediction server incorporates 3D-Jury into its prediction pipeline.[3]
As of January 2024, the links to 3D-Jury originally hosted by the BioInfoBank Institute are no longer valid.[4]
Algorithm
First, pairwise comparisons are made between every combination of models generated from chosen protein prediction servers. Each comparison is then scored using the MaxSub tool.[5] The score, [math]\displaystyle{ sim( M_{a,b}, M_{i,j} ) }[/math], is generated by counting the number of Cα atoms in the two predictions within 3.5 Å of each other after being superpositioned.
To get a roughly 90% chance two models are of a similar fold class, the authors set a threshold of 40 as the lowest score possible for a pair of models to be annotated as "similar".[1] The authors admittedly chose this threshold based on unpublished work.
There are two scores 3D-Jury gives: the best-model-mode score using one model from each server ([math]\displaystyle{ 3D-Jury-single(M_{a,b}) }[/math]) and the all-model-mode score that considers all models from each server ([math]\displaystyle{ 3D-Jury-all(M_{a,b}) }[/math]).[1]
The best-model-mode score using one model per server, [math]\displaystyle{ 3D-Jury-single(M_{a,b}) }[/math], is calculated as,
[math]\displaystyle{ 3D-Jury-single (M_{a,b}) = \frac{ \sum^N_i \max^{N_i}_{j,a \ne i \mbox{ OR } b \ne j} (sim(M_{a,b}, M_{i,j})) }{ 1 + N } }[/math]
where [math]\displaystyle{ N }[/math] is the number of servers and [math]\displaystyle{ N_i }[/math] is the number of top ranking models (with a maximum of 10) from the server [math]\displaystyle{ i }[/math], while a pairwise similarity score is calculated between models [math]\displaystyle{ M_{a,b} }[/math] (model [math]\displaystyle{ b }[/math] from server [math]\displaystyle{ a }[/math]) and [math]\displaystyle{ M_{i,j} }[/math] (model [math]\displaystyle{ j }[/math] from server [math]\displaystyle{ i }[/math]).[1]
While the all-model-mode score considering all models from the servers, [math]\displaystyle{ 3D-Jury-all(M_{a,b}) }[/math], is calculated as,
[math]\displaystyle{ 3D-Jury-all(M_{a,b}) = \frac{ \sum^N_i \sum^{N_i}_{j,a \ne i \mbox{ OR } b \ne j} sim(M_{a,b}, M_{i,j}) }{ 1 + \sum^N_i N_i } }[/math]
using similar variables as noted with the best-model-mode score.
Note, these meta-predictor scores do not take into account the confidence scores from each of the models from other servers.[1]
References
- ↑ 1.0 1.1 1.2 1.3 1.4 1.5 Ginalski K (2003). "3D-Jury: a simple approach to improve protein structure predictions". Bioinformatics 19 (8): 1015–1018. doi:10.1093/bioinformatics/btg124. ISSN 1367-4803. OCLC 110817016. PMID 12761065.
- ↑ Bonneau, Richard; Ruczinski, Ingo; Tsai, Jerry; Baker, David (2002). "Contact order and ab initio protein structure prediction". Protein Science 11 (8): 1937–1944. doi:10.1110/ps.3790102. ISSN 0961-8368. OCLC 112117834. PMC 2373674. http://dx.doi.org/10.1110/ps.3790102.
- ↑ Chivian D (2005). "Prediction of CASP6 structures using automated Robetta protocols". Proteins 61 (S7): 157–166. doi:10.1002/prot.20733. PMID 16187358. http://www3.interscience.wiley.com/cgi-bin/fulltext/112097083/HTMLSTART.
- ↑ "BioInfoBank Meta Server". https://bioinfo.pl/Meta/.
- ↑ Siew, Naomi; Elofsson, Arne; Rychlewski, Leszek; Fischer, Daniel (2000-09-01). "MaxSub: an automated measure for the assessment of protein structure prediction quality". Bioinformatics 16 (9): 776–785. doi:10.1093/bioinformatics/16.9.776. ISSN 1367-4803. OCLC 121793099. PMID 11108700. http://dx.doi.org/10.1093/bioinformatics/16.9.776.
External links
- BioInfoBank Meta Server 3D-Jury web interface
Original source: https://en.wikipedia.org/wiki/3D-Jury.
Read more |