Protein

Protein accession
40W37 [EnVhog]
Representative
40W37 (this protein)
Source
EnVhog (cluster: phalp2_1556)
Protein name
40W37
Lysin probability
99%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
MLLLRARIVAESLNLKGTPYLWGAKGGQRLTADYLAWAKGPDNIGWRPEWAGSGKPSQWEPSPGGFALDCSGAFGEAIKRAGGPNLDAWHTGRYWQDLPPIGSPLPGDAALYIGHIEMVIGTVDGLVVVGGSSGGDSKTTTLEIASNRRAMYKVKQTHLYRPDFRGFRSIVPFLG
Physico‐chemical
properties
protein length:175 AA
molecular weight:18852,3 Da
isoelectric point:9,30
hydropathy:-0,23
Other Proteins in cluster: phalp2_1556
Total (incl. this protein): 2 Avg length: 162,0 Avg pI: 8,13

Protein ID Length (AA) pI
4jTFG 149 6,97178
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26078
7s07Z
53 40,6% 177 8.135E-29
2 phalp2_33743
ZDk4
1 33,8% 124 2.515E-16
3 phalp2_3467
4b9No
1 25,9% 166 1.280E-09
4 phalp2_4609
4iAfK
3 28,1% 135 3.686E-08
5 phalp2_29063
6QfOM
447 31,2% 131 3.097E-07
6 phalp2_27807
7Tiwg
23 31,4% 159 7.691E-07
7 phalp2_25998
6KgfU
2 27,5% 138 2.579E-06
8 phalp2_336
6VNQk
11 27,2% 158 1.574E-05
9 phalp2_13018
42lWN
9 29,2% 113 7.065E-05

Domains

Domains
Unannotated
Protein sequence: 40W37
1 175
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
40W37
Method AlphaFoldv2
Resolution 84.13
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50