>protein (123 residues) FVAELNNLLGREVQVVLSNGEVYKGVLHAVDNQLNIVLANASNKAGEKFNRVFIYRYIVH IDSTERRIDREFAKQAEKIFPGVKYIEETNVVLIGDKVRVSEIGVEGVGPVAERAKRLFE EFL |
Sequence |
20 40 60 80 100 120 | | | | | | FVAELNNLLGREVQVVLSNGEVYKGVLHAVDNQLNIVLANASNKAGEKFNRVFIYRYIVHIDSTERRIDREFAKQAEKIFPGVKYIEETNVVLIGDKVRVSEIGVEGVGPVAERAKRLFEEFL |
Prediction | CHHHHHHHCCCSSSSSSCCCCSSSSSSSSCCCCSSSSSSSSSSCCCCSSSSSSSSCSSSSSSCCCCCCHHHHHHHHHHHCCCCSSSCCCCSSSSSSSSSSSCCCCCCCCHHHHHHHHHHHHHC |
Conf.Score | 868989875989999997997999999976987489998767578987889999677999987887674999999999779978985688899975799865775577789999999999769 |
H:Helix; S:Strand; C:Coil | |
Sequence |
20 40 60 80 100 120 | | | | | | FVAELNNLLGREVQVVLSNGEVYKGVLHAVDNQLNIVLANASNKAGEKFNRVFIYRYIVHIDSTERRIDREFAKQAEKIFPGVKYIEETNVVLIGDKVRVSEIGVEGVGPVAERAKRLFEEFL |
Prediction | 645404521444030303645404020343444010203304566654111010332134053365621540053046216504316524003024414034531524231053035106626 |
Values range from 0 (buried residue) to 8 (highly exposed residue) | |
|
LOMETS threading is guided by the consensus contact map (left figure) and distance map (right figure) derived based on confidence scores of DeepPotential. In the contact and distance maps, the axes mark the residue index along the sequence. For the contact map, each dot represents a residue pair with predicted contact, while for the distance map a color scale represents a distance of 1-20+ angstroms. No dot is close to the diagonal in the contact map, because LOMETS does not consider contacts for residue pairs separated by <4 residues. |
|
LOMETS uses FUpred and ThreaDom to predict the domain boundary for the target protein. Where ThreaDom is used for homologous target and FUpred is used for non-homologous target. The domain prediction result is shown in the predicted contact map (left) and the FUscore for continuous/discontinuous domain are also shown (right). |
|
Rank | CMO | MAE | ID1 | ID2 | Cov | Norm. Zscore | Download Alignment | 20 40 60 80 100 120 | | | | | | | |||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Sec.Str. Sol.Acc. Seq | CHHHHHHHCCCSSSSSSCCCCSSSSSSSSCCCCSSSSSSSSSSCCCCSSSSSSSSCSSSSSSCCCCCCHHHHHHHHHHHCCCCSSSCCCCSSSSSSSSSSSCCCCCCCCHHHHHHHHHHHHHC 645404521444030303645404020343444010203304566654111010332134053365621540053046216504316524003024414034531524231053035106626 FVAELNNLLGREVQVVLSNGEVYKGVLHAVDNQLNIVLANASNKAGEKFNRVFIYRYIVHIDSTERRIDREFAKQAEKIFPGVKYIEETNVVLIGDKVRVSEIGVEGVGPVAERAKRLFEEFL | ||||||||||||||||||||||||||||||
1 | 0.608 | 1.664 | 0.20 | 0.20 | 0.98 | 1.60 | LOMETS-Assemble | LGATLQDSIGKQVLVKLRDSHEIRGILRSFDQHVNLLLEDAEEINVYKRGTMVVRENVLFISPVP-SIVEQFVQTLEKGFSDIKVEDTGHIVLLQEETLIQITHIICDNMLRVRLRDLVLK-- | |||||||||||||||||||||||
2 | 0.586 | 1.885 | 0.21 | 0.20 | 0.95 | 1.60 | LOMETS-Assemble | LGATLQDSIGKQVLVKLRDSHEIRGILRSFDQHVNLLLEDAEEINVYKRGTMVVRENVLFISPVP-AIVNKVAEMARERGYTVDT---TDGAKIIFWVLVRAIRIFSEAEYLNLGIELLEK-- | |||||||||||||||||||||||
3 | 0.599 | 1.715 | 0.19 | 0.19 | 0.98 | 1.60 | LOMETS-Assemble | LGATLQDSIGKQVLVKLRDSHEIRGILRSFDQHVNLLLEDAEEINVYKRGTMVVRENVLFISPVP-SIPVEQFVQTLEKHSDIKVETAGHIVLLQEATLIQDSTIICDNDLRVRLRDLVLKF- | |||||||||||||||||||||||
4 | 0.571 | 1.847 | 0.16 | 0.15 | 0.94 | 1.60 | LOMETS-Assemble | LGATLQDSIGKQVLVKLRDSHEIRGILRSFDQHVNLLLEDAEEIIDYKRGTMVVRENVLFISPVP-TA-LRVYSHLKSVH-CVQHLP-DGSVTVE-SVLLQAALVSWTEELGSLTSLLKKG-- | |||||||||||||||||||||||
5 | 0.552 | 1.876 | 0.18 | 0.18 | 0.98 | 1.60 | LOMETS-Assemble | LGATLQDSIGKQVLVKLRDSHEIRGILRSFDQHVNLLLEDAEEIIDYKRGTMVVRENVLFISPVP-IPVEQFVQTLEKHGFSDIVEDTGHIVLLQEAETLIQSTCDNDEMLRVRLRDLVLKF- | |||||||||||||||||||||||
6 | 0.574 | 1.750 | 0.20 | 0.20 | 0.98 | 1.60 | LOMETS-Assemble | GTTKMVSLLNHSLNVTTKDGRTFVGQLLAFDGFMNLVLSDCQEYEKRMLGLVILRGFIVSLSVQG-IPVEQFVQTLEKHFSDIKVEDKGHIVLLQEATLIQIEICDNDEMLRVRLRDLVLKF- | |||||||||||||||||||||||
7 | 0.562 | 1.868 | 0.14 | 0.14 | 0.98 | 1.60 | LOMETS-Assemble | GTTKMVSLLNHSLNVTTKDGRTFVGQLLAFDGFMNLVLSDCQEYEKRMLGLVILRGFIVSLSVQG-ETALRVYSHLKSVLDHCVQHLPDGSVTVESVLLQAAALVSWDEELGSFLTSLLKGL- | |||||||||||||||||||||||
8 | 0.577 | 1.917 | 0.15 | 0.15 | 0.98 | 1.20 | LOMETS-Assemble | PLALIDKCIGNRIYVVMKGDKEFSGVLRGFDEYVNMVLDDVQEYMVNRLETILLSNNVAMLVPGG-NIETESRQFIENKNYSIQSIGPMLRVVFTRLATVDQYLTGANRSLGQELADHLFET- | |||||||||||||||||||||||
9 | 0.577 | 1.917 | 0.15 | 0.15 | 0.98 | 1.20 | LOMETS-Assemble | PLALIDKCIGNRIYVVMKGDKEFSGVLRGFDEYVNMVLDDVQEYMVNRLETILLSNNVAMLVPGG-NIETESRQFIENKNYSIQSIGPMLRVVFTRLATVDQYLTGANRSLGQELADHLFET- | |||||||||||||||||||||||
10 | 0.534 | 1.829 | 0.18 | 0.18 | 0.98 | 1.20 | LOMETS-Assemble | PLALIDKCIGNRIYVVMKGDKEFSGVLRGFDEYVNMVLDDVQEYMVNRLETILLSNNVAMLVPGG-IPVEQFVQTLEKHFSDIKVEDTGHIVLLQELIQIEEDSCDNDEMLRVRLRDLVLKF- | |||||||||||||||||||||||
|
Rank | PDB Hit | RCSB Link | CMO | MAE | ID1 | ID2 | Cov | Norm. Zscore | Download Alignment | 20 40 60 80 100 120 | | | | | | | |||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Sec.Str. Sol.Acc. Seq | CHHHHHHHCCCSSSSSSCCCCSSSSSSSSCCCCSSSSSSSSSSCCCCSSSSSSSSCSSSSSSCCCCCCHHHHHHHHHHHCCCCSSSCCCCSSSSSSSSSSSCCCCCCCCHHHHHHHHHHHHHC 645404521444030303645404020343444010203304566654111010332134053365621540053046216504316524003024414034531524231053035106626 FVAELNNLLGREVQVVLSNGEVYKGVLHAVDNQLNIVLANASNKAGEKFNRVFIYRYIVHIDSTERRIDREFAKQAEKIFPGVKYIEETNVVLIGDKVRVSEIGVEGVGPVAERAKRLFEEFL | ||||||||||||||||||||||||||||||||||||
1 | 5mkl11 | 5mkl | 0.349 | 3.250 | 0.24 | 0.21 | 0.87 | 1.24 | HHpred | PLKSLKTALNKIVLVKLKNGEEYVGRLEQSDGTMNLVLKDCTEDPVAKYGRVLIRSNILFISIDYESIMENPLKSLKTALNKVLVKSDGTMNLVLKDCTEYREGTDP---------------- | |||||||||||||||||||||||||||
2 | 6ppnB | 6ppn | 0.435 | 1.947 | 0.25 | 0.17 | 0.68 | 1.60 | SparksX | FYSFFKTLIDTEVTVELKNDMSIRGILKSVDQFLNVKLENISVVDAAAVKDLFIRSVVRYVHMSSAYVDILLADACRRDLANNK--------------------------------------- | |||||||||||||||||||||||||||
3 | 3jcr2 | 3jcr | 0.386 | 2.244 | 0.28 | 0.18 | 0.65 | 1.32 | HHsearch | FYSFFKSLVGKDVVVELKNDLSICGTLHSVDQYLNIKLTDISVTHMLSVKNCFIRSVVRYVQLPADEVDQLLQDAARKEA------------------------------------------- | |||||||||||||||||||||||||||
4 | 5nrll | 5nrl | 0.454 | 2.064 | 0.18 | 0.14 | 0.76 | 1.31 | MUSTER | LVNFLKKLRNEQVTIELKNGTTVWGTLQSVSPQMNAILTDVKLTLIASLQYINIRNTIRQIILPDSLNLDSLLVDQKQLNSLRRSANKRP-----------RRGL------------------ | |||||||||||||||||||||||||||
5 | 6ppnB | 6ppn | 0.444 | 2.199 | 0.26 | 0.16 | 0.63 | 1.12 | FFAS3D | -YSFFKTLIDTEVTVELKNDMSIRGILKSVDQFLNVKLENISVVDASKVKDLFIRGVVRYVHMSSAYVDTILLADACRR-------------------------------------------- | |||||||||||||||||||||||||||
6 | 6v4xC | 6v4x | 0.426 | 2.196 | 0.23 | 0.15 | 0.66 | 1.22 | HHpred | LIILLQGLQGRVTTVDLRDESVAHGRIDNVDAFMNIRLAKVTYGHQVKLDDLFVTRNVRYVHIPDD-VNSTIEQQLQIIHRV----------------------------------------- | |||||||||||||||||||||||||||
7 | 5mkl11 | 5mkl | 0.522 | 2.415 | 0.21 | 0.21 | 1.00 | 1.56 | SparksX | PLKSLKTALNKIVLVKLKNGEEYVGRLEQSDGTMNLVLKDCTEYREGKYGRVLIRSNILFISIDYESIIENPLKSLKTALNKVLVKLKNGEEYVGRLEQSDGTMNLVLKDCTEYREGTSDPVA | |||||||||||||||||||||||||||
8 | 6ppnD | 6ppn | 0.448 | 2.107 | 0.22 | 0.14 | 0.64 | 1.29 | HHsearch | PLTLLNATQGRPILVELKNGETFNGHLENCDNYMNLTLREVIRTKFFRLPECYIRNNIKYLRIQDEVLSVAKQQAQQRE-------------------------------------------- | |||||||||||||||||||||||||||
9 | 4m77A | 4m77 | 0.398 | 2.325 | 0.20 | 0.13 | 0.65 | 1.18 | MUSTER | ---TLKDYLNKRVVIILVDGESLIASLNGFDKNTNLFLTNVFNRISFISKAQLLRSEIALVGLIDAENDDSLIENEHVIWEKV---------------------------------------- | |||||||||||||||||||||||||||
10 | 4c8qB | 4c8q | 0.383 | 2.427 | 0.28 | 0.18 | 0.63 | 1.09 | FFAS3D | -FSFFKTLVDQEVVVELKNDIEIKGTLQSVDQFLNLKLDNISCTDEKKVRNIFIRGSTVRYVYLNKNVDTNLLQDATRR-------------------------------------------- | |||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||
|
Download Model 1 |
Download Model 2 |
Download Model 3 |
Download Model 4 |
Download Model 5 |
Top 10 structural analogues in PDB (as identified by
TM-align)
|
Function Annotation for Proteins with Similar Structure |
(a) | PDB Hit is the PDB of the specific domain/chain aligned to Model 1 |
(b) | RCSB Link is the RCSB PDB link of the full-length protein. |
(c) | Gene Ontology (GO) term and Enzyme Commission (EC) number list the function annotations for structurally aligned proteins, as collected by BioLiP database. |
(d) | Ligand lists known binding ligands for each aligned protein, as collected by BioLiP database. |
(e) | Binding Site Residues lists the known binding residues for each ligand as collected by BioLiP database. The residue numbering is based on the specfic domain/chain numbering and highlighted residues means they are present in the alignment with Model 1.
Note: Listed binding residues may not be representative of the full ligand binding pocket and may be part of a complex with another protein chain/domain. Also, for a single ligand, one template may have multiple binding sites. |
Predicted Gene Ontology (GO) Terms |
|
Predicted Enzyme Commission (EC) Numbers |
Top 5 enzyme homologs in PDB
|
Predicted Ligand Binding Sites |
Template proteins with similar binding site:
|
(a) | Templates are ranked in descending order of normalized Z-score, as explained by (i) below. |
(b) | PDB Hit is the PDB of the specific domain/chain aligned to the query. |
(c) | RCSB Link is the RCSB PDB link of the full-length protein. |
(d) | CMO is the contact map overlap score, calculated from the number of overlapping contacts between the predicted contact map and the contact map derived from the aligned template, then normalized by the number of predicted contacts. |
(e) | MAE is the mean absolute error between the predicted distance map and the distance map derived from the aligned template. |
(f) | ID1 is the number of template residues identical to query divided by the number of aligned residues. |
(g) | ID2 is the number of template residues identical to query divided by the query sequence length. |
(h) | Cov is equal to the number of aligned template residues divided by the query sequence length. |
(i) | Norm. Zscore is the normalized Z-score of the threading alignments. A normalized Z-score ≥1 means a good alignment. |
(j) | Download PDB provides the structure of the aligned region for each template. |
(k) | Download FASTA provides the fasta format and full-length alignment of model and template. Shows alignment for potential ligand binding pockets and GO/EC terms. |
(l) | Gene Ontology (GO) term and Enzyme Commission (EC) number list the function annotations for template proteins, as collected by BioLiP database. |
(m) | Ligand lists known binding ligands for each template protein, as collected by BioLiP database. |
(n) | Binding Site Residues lists the known binding residues for each ligand as collected by BioLiP database. The residue numbering is based on the template PDB numbering and highlighted residues means they are aligned with the query sequence.
Note: Listed binding residues may not be representative of the full ligand binding pocket and may be part of a complex with another protein chain/domain. Also, for a single ligand, one template may have multiple binding sites. |
Templates from Hybrid-CEthreader |
Templates from SparksX |
Templates from CEthreader |
Templates from HHsearch |
Templates from MapAlign |
Rank | PDB Hit |
RCSB Link |
CMO | MAE | ID1 | ID2 | Cov | Norm. Zscore |
Download PDB | Download FASTA | Gene_Ontology_(GO)_term (Molecular_Function) |
Gene_Ontology_(GO)_term (Biological_Process) |
Gene_Ontology_(GO)_term (Cellular_Componenet) |
Enzyme_Commission (EC)_number |
Ligand | Binding site residues |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 5mkl14 | 5mkl | 0.599 | 2.130 | 0.21 | 0.20 | 1.29 | 0.95 | template_1 | template_1 | ||||||
2 | 5mkl11 | 5mkl | 0.583 | 2.282 | 0.23 | 0.22 | 1.21 | 0.89 | template_2 | template_2 | ||||||
3 | 4xi6A | 4xi6 | 0.506 | 2.853 | 0.11 | 0.10 | 2.95 | 0.66 | template_3 | template_3 | GO:0008270,GO:0004842,GO:0046872 | GO:0016567 | 2.3.2.27 | ZN ZN | C79 C82 C103 C106 C94 C97 H112 H116 | |
4 | 5ohqA | 5ohq | 0.503 | 2.809 | 0.12 | 0.10 | 0.89 | 0.66 | template_4 | template_4 | GO:0006357,GO:0032784 | |||||
5 | 3dlmA | 3dlm | 0.491 | 2.334 | 0.11 | 0.09 | 1.69 | 0.63 | template_5 | template_5 | 2.1.1.43 | |||||
6 | 5uz5K | 5uz5 | 0.481 | 2.511 | 0.20 | 0.15 | 1.01 | 0.63 | template_6 | template_6 | GO:0003723,GO:0005515,GO:0003674 | GO:0000398,GO:0006397,GO:0008380 | GO:0005737,GO:0005686,GO:0005685,GO:0005682,GO:0046540,GO:0071004,GO:0005687,GO:0071013,GO:0005634 | |||
7 | 7abi7 | 7abi | 0.491 | 3.156 | 0.05 | 0.04 | 1.92 | 0.63 | template_7 | template_7 | ||||||
8 | 2ckkA | 2ckk | 0.491 | 3.003 | 0.06 | 0.05 | 0.98 | 0.61 | template_8 | template_8 | ||||||
9 | 6ppnD | 6ppn | 0.463 | 2.572 | 0.23 | 0.15 | 0.72 | 0.58 | template_9 | template_9 | NUC | Y35 N37 R62 N64 | ||||
10 | 7oqcb | 7oqc | 0.466 | 2.167 | 0.19 | 0.15 | 0.98 | 0.58 | template_10 | template_10 |
Templates from MUSTER |
Templates from MRFsearch |
Templates from DisCovER |
Templates from FFAS3D |
Templates from EigenThreader |
Templates from HHpred |
References: | |
1. | W Zheng, Q Wuyun, X Zhou, Y Li, P Freddolino, Y Zhang. LOMETS3: Integrating deep-learning and profile-alignment for advanced protein template recognition and function annotation. in preparation, (2021) |
2. | W Zheng, C Zhang, Q Wuyun, R Pearce, Y Li, Y Zhang. LOMETS2: improved meta-threading server for fold-recognition and structure-based function annotation for distant-homology proteins. Nucleic Acids Research, 47: W429-W436 (2019). |
3. | S Wu, Y Zhang. LOMETS: A local meta-threading-server for protein structure prediction. Nucleic Acids Research, 35: 3375-3382 (2007). |