Protein

Protein accession
4F7Ol [EnVhog]
Representative
53zI9
Source
EnVhog (cluster: phalp2_31981)
Protein name
4F7Ol
Lysin probability
88%
PhaLP type
endolysin
Probability: 59% (predicted by ML model)
Protein sequence
MSHWAAEHIGRPWSPEYDCWAFVREVFHDQFGVDLPEHAAGVLILNGARNDTGLRPHDDDGGQEGDLVLMRTSSGKRHVGVMTFANGRLGVVHNDGHIVERFEDGVLVARYPVGCVGFTTLKELRETGCGVFQFWRKANLQ
Physico‐chemical
properties
protein length:141 AA
molecular weight:15743,5 Da
isoelectric point:5,86
hydropathy:-0,31
Representative Protein Details
Accession
53zI9
Protein name
53zI9
Sequence length
133 AA
Molecular weight
15042,22860 Da
Isoelectric point
8,50649
Sequence
VSDLLAFINERVGTPWRRDPDGNCWALVCEVQERFFHRTLPLSTYPKTPIGRHRAIHGHPAMAAWRQTDAPEHGAVVLMSRGDMSRRVDEHAGVCLMLPAPMVLHVDAPQGACLEGLMQVRFRGWNTTFYIPK
Other Proteins in cluster: phalp2_31981
Total (incl. this protein): 11 Avg length: 136,6 Avg pI: 7,45

Protein ID Length (AA) pI
53zI9 133 8,50649
1Q6uY 143 5,79737
4KsVN 143 8,88337
4NUr5 137 7,91867
5DnPi 129 7,09256
5EpJx 129 7,12803
5EqdH 129 7,15776
5IVnq 139 9,14473
6x2qm 151 6,50496
7cmjK 129 7,95941
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18081
3SQ2q
120 30,8% 133 4.549E-24
2 phalp2_14093
8iNl2
704 37,8% 119 1.459E-22
3 phalp2_9861
2bdZh
36 40,3% 119 2.741E-22
4 phalp2_27187
3gFSw
103 33,5% 146 1.203E-20
5 phalp2_14318
3Uztr
3 27,0% 133 3.182E-14
6 phalp2_8216
7ruj8
1 26,3% 133 4.357E-14
7 phalp2_19944
6EYn3
118 28,2% 124 2.865E-13
8 phalp2_17424
4Ue0V
32 24,8% 129 5.906E-11
9 phalp2_30783
cA3P
33 29,5% 88 6.418E-09
10 phalp2_38609
25FGT
2035 27,0% 122 1.636E-08

Domains

Domains
Unannotated
Representative sequence (used for alignment): 53zI9 (133 AA)
Member sequence: 4F7Ol (141 AA)
1 133 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (53zI9) rather than this protein.
PDB ID
53zI9
Method AlphaFoldv2
Resolution 91.75
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50