Predicted Aligned Error

From HandWiki
Short description: Predicted output by AlphaFold indicating expected position error of protein structures
Predicted Aligned Error
Predicted Aligned Error.png
Filename extension.json
Internet media typeapplication/json
Developed byDeepMind, EMBL-EBI
Type of formatBioinformatics
Websitehttps://alphafold.ebi.ac.uk/faq

The Predicted Aligned Error (PAE) is a quantitative output produced by AlphaFold, a protein structure prediction system developed by DeepMind.[1] PAE estimates the expected positional error for each residue in a predicted protein structure if it were aligned to a corresponding residue in the true protein structure. This measurement helps scientists assess the confidence in the relative positions and orientations of different parts of the predicted protein model.[2]

Structure

PAE is presented as a two-dimensional (2D) interactive plot where the color at coordinates (x, y) represents the predicted position error at residue x if the predicted and true structures were aligned on residue y.[3] Lower PAE values for residue pairs from different domains suggest well-defined relative positions and orientations in the prediction, while higher PAE values indicate uncertainty in the relative positions or orientations. Users can download the raw PAE data for all residue pairs in a custom JSON format for further analysis or visualization using a programming language such as Python. The format of the JSON file is as follows:

[
    {
        "predicted_aligned_error": [[0, 1, 4, 7, 9, ...], ...],
        "max_predicted_aligned_error": 31.75
    }
]

In the JSON file, the field predicted_aligned_error provides the PAE value for each residue pair (rounded to the nearest integer), and the field max_predicted_aligned_error gives the maximum possible PAE value, which is capped at 31.75 Å. The PAE is measured in Ångströms.

A separately developed 3D viewer of PAE allows for more intuitive visualization.[4]

Predicted Aligned Error 3D viewer

Interpretation

Interpretation of PAE values allows scientists to understand the level of confidence in the predicted structure of a protein: Lower PAE values between residue pairs from different domains indicate that the model predicts well-defined relative positions and orientations for those domains. Higher PAE values for such residue pairs suggest that the relative positions and/or orientations of these domains in the 3D structure are uncertain and should not be interpreted.[5]

Caveats

Although PAE provides valuable information, users should note that it is asymmetric; the PAE value for (x, y) may differ from the value for (y, x), particularly between loop regions with highly uncertain orientations.[6] Moreover, while AlphaFold can make useful inter-domain predictions, intra-domain prediction accuracy is expected to be more reliable based on CASP14 validation.

External links

References