[Back to server]
I-TASSER-MTD Results for job ITM118
[Click on ITM118_results.tar.bz2 to download the tarball file including all results listed on this page]
Query Sequence and Predicted Domain Definition
|
>example (230 residues)
|
MNELVDTTEMYLRTIYDLEEEGVTPLRARIAERLDQSGPTVSQTVSRMERDGLLRVAGDRHLELTEKGRALAIAVMRKHRLAERLLVDVIGLPWEEVHAEACRWEHVMSEDVERRLVKVLNNPTTSPFGNPIPGLDELGVGPEPGADDANLVRLTELPAGSPVAVVVRQLTEHVQGDIDLITRLKDAGVVPNARVTVETTPGGGVTIVIPGHENVTLPHEMAHAVKVEKV
|
domain 1: 1-66
domain 2: 67-141
domain 3: 142-230
|
3 domains in total. Different colors represent different domains. The domain definition
is used for the domain structure modeling, see the domain boundary prediction part for details.
|
Final Full-length Models Predicted by I-TASSER-MTD
|
|
Top 5 models constructed by I-TASSER-MTD
Probabilties of inter-domain interactions
domain1-domain2: 1.00; domain1-domain3: 0.17; domain2-domain3: 0.85;
|
(a) | Colored by domain: domain 1 in red; domain 2 in blue; domain 3 in green. |
(b) | For each target, I-TASSER-MTD generates an ensemble of structural conformations
by starting from a set of initial models generated by different templates. The server reports up to five final
models sorted by the energy. The accuracy of each model is quantitatively evaluated by estimated TM-score (eTM-score)
and estimated RMSD (eRMSD) that are calculated based on the significance of the structural analogous templates for domain
models assembly, convergence parameters of the domain assembly simulations, satisfaction degrees of the inter-domain
distances/interfaces, and the estimated accuracy of the individual domain model. eTM-score is typically in the range
of [0,1], where an eTM-score of higher value signifies a model with a high confidence and vice-versa.
| (d) | Since the top 5 models are ranked by the energy or cluster size,
it is possible that the lower-rank models have a higher eTM-score in rare cases. Although the first model has a better quality
in most cases, it is also possible that the lower-rank models have a better quality than the higher-rank models as seen in
our benchmark tests. |
| More about eTM-score |
(e) | P-score is used to estimate the population formed in the modeling
simulations based on the structural similarity or SPICKER clustering. P-score ranges from 0 to 1, and a higher value means
the structure occurs more often in the simulation trajectory. |
(f) | The inter-domain interaction is defined as ≥1 residue pairs
with distance <8Å apart from the linker region. The probability ranges from 0 to 1, and a large value
indicates the two domains have a large probability of interaction. |
|
Predicted Individual Domain Structures
|
Predicted Secondary Structure
|
|
1 20 40 60 80 100 120 140 160 180 200 220 | | | | | | | | | | | |
|
Sequence |
MNELVDTTEMYLRTIYDLEEEGVTPLRARIAERLDQSGPTVSQTVSRMERDGLLRVAGDRHLELTEKGRALAIAVMRKHRLAERLLVDVIGLPWEEVHAEACRWEHVMSEDVERRLVKVLNNPTTSPFGNPIPGLDELGVGPEPGADDANLVRLTELPAGSPVAVVVRQLTEHVQGDIDLITRLKDAGVVPNARVTVETTPGGGVTIVIPGHENVTLPHEMAHAVKVEKV
|
|
Prediction |
CCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCSSSCCCCCSSSCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCSSSSSSSSSCCCCCCCHHHHHHHHHHCCCCCCSSSSSSSCCCCSSSSSSCCSSSSCCHHHHCCCSSSSC
|
|
Conf.Score |
97567689999999999996189866999999979993799999999999197898179726458889999999999999779999998499989999999998636989999999998379887999997897333666676543221211044348994289999994112357999999999909999829999995699749999699389848888412047759
|
|
|
H: Helix; S: Strand; C: Coil
|
Predicted Solvent Accessibility
|
|
1 20 40 60 80 100 120 140 160 180 200 220 | | | | | | | | | | | |
|
Sequence |
MNELVDTTEMYLRTIYDLEEEGVTPLRARIAERLDQSGPTVSQTVSRMERDGLLRVAGDRHLELTEKGRALAIAVMRKHRLAERLLVDVIGLPWEEVHAEACRWEHVMSEDVERRLVKVLNNPTTSPFGNPIPGLDELGVGPEPGADDANLVRLTELPAGSPVAVVVRQLTEHVQGDIDLITRLKDAGVVPNARVTVETTPGGGVTIVIPGHENVTLPHEMAHAVKVEKV
|
|
Conf.Score |
87524643130020003026675413143006317144210140043036431022254330202640351044004202002410341060436302620240111326500510261173054044342025364324245443334222303404644402000210253265244004203726031424030343574311021466541502460054020558
|
|
| Values range from 0 (buried residue) to 8 (highly exposed residue) |
Information for Domain Boundary Detection
|
Top 10 Full-length Templates for Domain Assembly
|
Rank | Template | SeqId | TplScore |
1 20 40 60 80 100 120 140 160 180 200 220 | | | | | | | | | | | |
|
|
| Query | | | MNELVDTTEMYLRTIYDLEEEGVTPLRARIAERLDQSGPTVSQTVSRMERDGLLRVAGDRHLELTEKGRALAIAVMRKHRLAERLLVDVIGLPWEEVHAEACRWEHVMSEDVERRLVKVLNNPTTSPFGNPIPGLDELGVGPEPGADDANLVRLTELPAGSPVAVVVRQLTEHVQGDIDLITRLKDAGVVPNARVTVETTPGGGVTIVIPGHENVTLPHEMAHAVKVEKV | |
1 | 4o6jA | 0.29 | 0.76 | -----VTEEDYLKIIQELVLYKGYATLADISRSLNVKRQSVRDEINHLISLSMAEK----------SGDREANRFLRKHRTAEILLSRCIGIPWERVDEEAMGIE-HGM--T-EEIIQRTEGVDRCPHGNP--------------PV--ADVRITSL-L-PDSTARISRIVYE-T--DDILHFLALNGLIPGKDIKIESVK-DTVRVLV-DGRSIEIPTD---------- |
2 | 6o5cA | 0.27 | 0.76 | ----TPNKEDYLKCIYEIGEQEPKITNKMVAEKMHVSAPAVSEMIKKMISQGWIVK----------KGYALVANLYRKHRLIEVFLIHQLGYNTQEVHQEAEVLEHTVSDTFIDRLDKILDFPDFCPHGGT--------------EM--NTTTLNTITE-L-GRFRLSRIHD--H-FD-LIQYLETHHLNINTELTLTIDTFKTYTICY-GDKELVIPEN---------- |
3 | 5cviB | 0.27 | 0.72 | -----PNKEDYLKIIYELSERDEKISNKQIAEK-SVSAPAVSE-VKKLLLEDLVLK----------KKQILASSLYRKHRLIEVFL-NHLNYTADEIHEEAEVLEHTVSDVFVERLDKFLNYPKVCPHGGT--------------ER--YRTTLKGVTE---GVYLLKRVQDN-F--Q-LLKY-EQHHLKIGDELRLLYDAFGAYTIEK-DGEQLQVTSA---------- |
4 | 6ndlA | 0.19 | 0.54 | ----SKYSQDVLQLLYKNK--PNYISGQSIAESLNISRTAVKKVIDQLKLEGCKID-----------AFSMISKFNLFIALGIRDAIQ-HFSQFLERLLQEIEKRYNQLFSEIREEYIAASNI------------------GRF-RH-----KWPND---WNRTLLFTEN-------------------DKQFKGQAIDLDYGYLIVRDEAGESHRL-I----------- |
5 | 3rirA | 0.19 | 0.53 | ----SKYSQDVLQLLYKNK--PNYISGQSIAESLNISRTAVKKVIDQLKLEGCKID-----------AFSMISKFNLFIALGIRDAIQ-HFSQFLERLLQEIEKRYNQLFSEIREEYNAASNI------------------GRF-RH-----KWPND---WNRTLLFTEN-------------------DKQFKGQAIDLDYDYLIVRDEAGESHRL-I----------- |
6 | 2ewnA | 0.21 | 0.51 | ---DNTVPLKLIALLANGE----FHSGEQLGETLGMSRAAINKHIQTLRDWGVDVF----------QGPAAAIGLSLVIGIVMAEVLRKLGILLAAMLIRELRAALELELAPYLSRWEKL-DNF-----------------NDLY-LQDRK-----------INRPV-KL-I--I-GD--KE-I----F---G-ISRGIDKQGALLLEQ-DGIIKPWM-G---------- |
7 | 3qphA | 0.24 | 0.50 | KEILTYWTLLVYGPSTAKE-----IST-KSGIPY-NR--VY-DTISSLKLRGFVTE--------------------------EVIGMTVVKSIIFSQYSLIIEIFKESTLEK-EI-IG--DIRFFA---MF----------NTVNF-NPK--HAVDFVKNLKRNIYAE-I-T----GKNLGRLET---LT--G-RVVGYTLSVNNIHLETENGVVKVGGM---------- |
8 | 4gygA | 0.26 | 0.48 | TMRLTAEDWRVLTAVEMGSKNHEIVPTPLIEKI------GVHKSIATLAKAGLIAR----------WMYLSRLAAIKEFAFMKALYEEGF---------EPAVMS---FNIDFPQ-MV-SMLDAAL--ASG---------------------KL------------DRAH-LTGGLDLALHTH-ADVYS---VGSRIGV-GKESDIMIVAKQKVLKIHRL---------- |
9 | 6k8nA | 0.29 | 0.48 | VSSAGRVQSPTLVQVVNSEIERSRYTKVSLLKWMSNTEATRGRIIEILVKRKYLTN----------ELCKYHYEAKVRLLDAVEIWKERTKYDHKKILKRISSSTGK--------------------M-------------TKSDILS----GLISFPTNSQKIPSIYNKGEN-SSYRKLVDLVRKITGG-KYVVKQAI--KILIRFSVSADNWIYH------------- |
10 | 4xgcC | 0.29 | 0.47 | HTALPPDLSVVYKLHLECGR---MINLFDWLQAFRVIQARFTRAVAELQFLGYIKM----------RDLLHFLLFRCSLEFLTELVGDLPRC--GKLRRELYVNCLAII---------KVNALERT---QF----------FHFFGG--NAFAL-CTDYSKLGKLTMETRLP--SFRP-YVEI-CRIAVLTDD----------------DYLKKKQLLCP---------- |
(a) | All the residues are colored in black; however, those residues in template which are identical to the residue in the query sequence are highlighted in color. Coloring scheme is based on the property of amino acids, where polar are brightly coloured while non-polar residues are colored in dark shade. (more about the colors used) | |
(b) | Rank of templates represents the top five structural aligenment templates identified by using TM-align. |
(c) | SeqId is the ratio between the number of identical residues and query length. |
(d) | TplScore is the score of the template calculated based on harmonic mean of TM-scores between all domain models and the template. |
|
Predicted Distance/Interface Map for Domain Assembly
|
Proteins Structurally Close to the Query Protein
|
|
Top 10 structural analogs in PDB (as identified by
TM-align)
(a) | Query structure is shown in cartoon, while the structural analog is displayed using backbone trace. |
(b) | Ranking of proteins is based on TM-score of the structural alignment between the query structure and known structures in the PDB library. |
(c) | RMSDa is the RMSD between residues that are structurally aligned by TM-align. |
(d) | IDENa is the percentage sequence identity in the structurally aligned region. |
(e) | Cov. represents the coverage of the alignment by TM-align and is equal to the number of structurally aligned residues divided by length of the query protein. |
|
Predicted Gene Ontology (GO) Terms
|
|
GO term | CscoreGO | Name |
GO:1901363 | 0.89 | heterocyclic compound binding |
GO:0097159 | 0.89 | organic cyclic compound binding |
GO:0003700 | 0.86 | transcription factor activity, sequence-specific DNA binding |
GO:0003677 | 0.86 | DNA binding |
GO:0046983 | 0.84 | protein dimerization activity |
Download full result of the above consensus prediction. |
| Click the graph to show a high resolution version. |
(a) | CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template. |
(b) | The graph shows the predicted terms within the Gene Ontology hierachy for Molecular Function. Confidently predicted terms are color coded by CscoreGO: |
| [0.4,0.5) | [0.5,0.6) | [0.6,0.7) | [0.7,0.8) | [0.8,0.9) | [0.9,1.0] |
|
|
|
Download full result of the above consensus prediction. |
| Click the graph to show a high resolution version. |
(a) | CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template. |
(b) | The graph shows the predicted terms within the Gene Ontology hierachy for Biological Process. Confidently predicted terms are color coded by CscoreGO: |
| [0.4,0.5) | [0.5,0.6) | [0.6,0.7) | [0.7,0.8) | [0.8,0.9) | [0.9,1.0] |
|
|
|
Download full result of the above consensus prediction. |
| Click the graph to show a high resolution version. |
(a) | CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template. |
(b) | The graph shows the predicted terms within the Gene Ontology hierachy for Cellular Component. Confidently predicted terms are color coded by CscoreGO: |
| [0.4,0.5) | [0.5,0.6) | [0.6,0.7) | [0.7,0.8) | [0.8,0.9) | [0.9,1.0] |
|
|
|
Predicted Enzyme Commission (EC) Numbers
|
|
Top 5 enzyme homologs in PDB
| Click on the radio buttons to visualize predicted active site residues. |
(a) | CscoreEC is the confidence score for the Enzyme Commission (EC) number prediction. CscoreEC values range in between [0-1]; where a higher score indicates a more reliable EC number prediction. |
(b) | TM-score is a measure of global structural similarity between query and template protein. |
(c) | RMSDa is the RMSD between residues that are structurally aligned by TM-align. |
(d) | IDENa is the percentage sequence identity in the structurally aligned region. |
(e) | Cov. represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein. |
|
Predicted Ligand Binding Sites
|
|
Template proteins with similar binding site
Click to view | Rank | CscoreLB | PDB Hit | TM-score | RMSDa | IDENa | Cov. | BS-score | Lig. Name | Download Complex | Predicted binding site residues |
| 1 | 0.37 | 1c0wD | 0.741 | 1.03 | 0.697 | 0.761 | 1.91 | QNA | complex1.pdb.gz | 7,36,37,39,40,43,47,50 |
| 2 | 0.10 | 1c0wA | 0.734 | 0.97 | 0.705 | 0.752 | 0.80 | CO | complex2.pdb.gz | 79,98,172,175 |
| 3 | 0.04 | 2h090 | 0.462 | 2.36 | 0.258 | 0.517 | 0.94 | III | complex3.pdb.gz | 32,34,85,90,92,93,96,100,103,104,107,108,109 |
| 4 | 0.03 | 3jsoB | 0.297 | 3.25 | 0.119 | 0.348 | 1.23 | QNA | complex4.pdb.gz | 5,7,36,37,39,40,43,44,50,59 |
| 5 | 0.01 | 2ff4B | 0.414 | 5.27 | 0.069 | 0.648 | 0.66 | III | complex5.pdb.gz | 51,52,53,54,55,66,67 |
| Click on the radio buttons to visualize predicted binding site and residues. |
(a) | CscoreLB is the confidence score of predicted binding site. CscoreLB values range in between [0-1]; where a higher score indicates a more reliable ligand-binding site prediction. |
(b) | BS-score is a measure of local similarity (sequence & structure) between template binding site and predicted binding site in the query structure. Based on large scale benchmarking analysis, we have observed that a BS-score >1 reflects a significant local match between the predicted and template binding site. |
(c) | TM-score is a measure of global structural similarity between query and template protein. |
(d) | RMSDa the RMSD between residues that are structurally aligned by TM-align. |
(e) | IDENa is the percentage sequence identity in the structurally aligned region. |
(f) | Cov. represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein. |
|
[Back to server]
Reference:
Xiaogen Zhou, Wei Zheng, Yang Li, Robin Pearce, Chengxin Zhang, Eric W. Bell, Guijun Zhang, and Yang Zhang.
I-TASSER-MTD: A deep-learning based platform for multi-domain protein structure and function prediction,
to be submitted.
|
|