Protein

Protein accession
8i5pF [EnVhog]
Representative
7QFV
Source
EnVhog (cluster: phalp2_30768)
Protein name
8i5pF
Lysin probability
99%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
MNRGTSLQSAYVKRQIQKAFPDNAEQMKKIFRCESGLDSKAVGDRTMTYKDGQTVYGASYGLAQIRSLPGRPTPSWLMNVSNNLSYAKQLYDAHGVQPWTCAKIVGANDPGHVTHNR
Physico‐chemical
properties
protein length:117 AA
molecular weight:12991,5 Da
isoelectric point:9,65
hydropathy:-0,67
Representative Protein Details
Accession
7QFV
Protein name
7QFV
Sequence length
88 AA
Molecular weight
9973,97770 Da
Isoelectric point
6,29869
Sequence
VAYATKQAQEAKIRPTEVLGTIQCESNWNPQALGDGGHSRGLSQIHQPSHPDIAPEQAYDPRFAIDYMVSEMKDGRARQWTCWRNLYS
Other Proteins in cluster: phalp2_30768
Total (incl. this protein): 17 Avg length: 108,6 Avg pI: 6,76

Protein ID Length (AA) pI
7QFV 88 6,29869
1KRiy 108 6,39742
1Lcvw 94 6,06906
1LxOl 134 6,81780
1McCJ 119 6,33399
1hTSP 88 8,65928
1muie 101 8,36872
3f18y 118 5,83705
40Vah 100 6,08901
4fmjU 115 6,25157
6SZmH 109 6,98156
6Wrn8 132 9,14428
7uxc 124 6,50110
8dNv7 114 4,71641
8rt2s 71 5,77737
brLR 114 5,00060
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_22198
6TEqI
62 48,8% 86 2.337E-32
2 phalp2_23605
94OC
1 35,2% 71 8.497E-21
3 phalp2_36104
6GwYI
3 30,5% 95 6.633E-18
4 phalp2_2217
4fJmE
4 31,7% 85 9.109E-18
5 phalp2_22454
ULVA
3 36,4% 85 4.100E-16
6 phalp2_1911
4fRTM
1 34,1% 79 5.191E-15
7 phalp2_4149
1Kwjd
4 26,8% 97 1.847E-14
8 phalp2_11590
1cI1V
2 35,7% 84 6.573E-14
9 phalp2_30771
87u1
4 34,3% 67 1.240E-13
10 phalp2_17305
4f0W9
1 26,5% 94 1.695E-09

Domains

Domains
Unannotated
Representative sequence (used for alignment): 7QFV (88 AA)
Member sequence: 8i5pF (117 AA)
1 88 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
8i5pF
Method AlphaFoldv2
Resolution 90.17
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7QFV) rather than this protein.
PDB ID
7QFV
Method AlphaFoldv2
Resolution 97.95
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50