3 domains in total. Different colors represent different domains. The domain definition
is used for the domain structure modeling, see the domain boundary prediction part for details.
Final Full-length Models Predicted by I-TASSER-MTD
Colored by domain: domain 1 in red; domain 2 in blue; domain 3 in green.
(b)
For each target, I-TASSER-MTD generates an ensemble of structural conformations
by starting from a set of initial models generated by different templates. The server reports up to five final
models sorted by the energy. The accuracy of each model is quantitatively evaluated by estimated TM-score (eTM-score)
and estimated RMSD (eRMSD) that are calculated based on the significance of the structural analogous templates for domain
models assembly, convergence parameters of the domain assembly simulations, satisfaction degrees of the inter-domain
distances/interfaces, and the estimated accuracy of the individual domain model. eTM-score is typically in the range
of [0,1], where an eTM-score of higher value signifies a model with a high confidence and vice-versa.
(d)
Since the top 5 models are ranked by the energy or cluster size,
it is possible that the lower-rank models have a higher eTM-score in rare cases. Although the first model has a better quality
in most cases, it is also possible that the lower-rank models have a better quality than the higher-rank models as seen in
our benchmark tests.
P-score is used to estimate the population formed in the modeling
simulations based on the structural similarity or SPICKER clustering. P-score ranges from 0 to 1, and a higher value means
the structure occurs more often in the simulation trajectory.
(f)
The inter-domain interaction is defined as ≥1 residue pairs
with distance <8Å apart from the linker region. The probability ranges from 0 to 1, and a large value
indicates the two domains have a large probability of interaction.
All the residues are colored in black; however, those residues in template which are identical to the residue in the query sequence are highlighted in color. Coloring scheme is based on the property of amino acids, where polar are brightly coloured while non-polar residues are colored in dark shade. (more about the colors used)
(b)
Rank of templates represents the top five structural aligenment templates identified by using TM-align.
(c)
SeqId is the ratio between the number of identical residues and query length.
(d)
TplScore is the score of the template calculated based on harmonic mean of TM-scores between all domain models and the template.
Predicted Distance/Interface Map for Domain Assembly
Query structure is shown in cartoon, while the structural analog is displayed using backbone trace.
(b)
Ranking of proteins is based on TM-score of the structural alignment between the query structure and known structures in the PDB library.
(c)
RMSDa is the RMSD between residues that are structurally aligned by TM-align.
(d)
IDENa is the percentage sequence identity in the structurally aligned region.
(e)
Cov. represents the coverage of the alignment by TM-align and is equal to the number of structurally aligned residues divided by length of the query protein.
Download full result of the above consensus prediction.
Click the graph to show a high resolution version.
(a)
CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template.
(b)
The graph shows the predicted terms within the Gene Ontology hierachy for Molecular Function. Confidently predicted terms are color coded by CscoreGO:
Download full result of the above consensus prediction.
Click the graph to show a high resolution version.
(a)
CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template.
(b)
The graph shows the predicted terms within the Gene Ontology hierachy for Biological Process. Confidently predicted terms are color coded by CscoreGO:
Download full result of the above consensus prediction.
Click the graph to show a high resolution version.
(a)
CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template.
(b)
The graph shows the predicted terms within the Gene Ontology hierachy for Cellular Component. Confidently predicted terms are color coded by CscoreGO:
Click on the radio buttons to visualize predicted active site residues.
(a)
CscoreEC is the confidence score for the Enzyme Commission (EC) number prediction. CscoreEC values range in between [0-1]; where a higher score indicates a more reliable EC number prediction.
(b)
TM-score is a measure of global structural similarity between query and template protein.
(c)
RMSDa is the RMSD between residues that are structurally aligned by TM-align.
(d)
IDENa is the percentage sequence identity in the structurally aligned region.
(e)
Cov. represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein.
Click on the radio buttons to visualize predicted binding site and residues.
(a)
CscoreLB is the confidence score of predicted binding site. CscoreLB values range in between [0-1]; where a higher score indicates a more reliable ligand-binding site prediction.
(b)
BS-score is a measure of local similarity (sequence & structure) between template binding site and predicted binding site in the query structure. Based on large scale benchmarking analysis, we have observed that a BS-score >1 reflects a significant local match between the predicted and template binding site.
(c)
TM-score is a measure of global structural similarity between query and template protein.
(d)
RMSDa the RMSD between residues that are structurally aligned by TM-align.
(e)
IDENa is the percentage sequence identity in the structurally aligned region.
(f)
Cov. represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein.
Xiaogen Zhou, Wei Zheng, Yang Li, Robin Pearce, Chengxin Zhang, Eric W. Bell, Guijun Zhang, and Yang Zhang.
I-TASSER-MTD: A deep-learning based platform for multi-domain protein structure and function prediction,
to be submitted.
yangzhanglabumich.edu
| (734) 647-1549 | 100 Washtenaw Avenue, Ann Arbor, MI 48109-2218