Protein

Protein accession
4jTFG [EnVhog]
Representative
40W37
Source
EnVhog (cluster: phalp2_1556)
Protein name
4jTFG
Lysin probability
99%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
MNTDILLNEVGVPYGWGHKGYNASGVPVKEHNGKLVEALYYGYDCSGFAAWVGEHLRSEPLRATWDAKRMEKELPEVRREDVRPFDLAVYPGHVMVVLAVSHAGRVVVIGASGGNFEVTTKEYAERVGAMVKVINRPDYRPDFIGFRRW
Physico‐chemical
properties
protein length:149 AA
molecular weight:16764,9 Da
isoelectric point:6,97
hydropathy:-0,29
Representative Protein Details
Accession
40W37
Protein name
40W37
Sequence length
175 AA
Molecular weight
18852,26540 Da
Isoelectric point
9,29797
Sequence
MLLLRARIVAESLNLKGTPYLWGAKGGQRLTADYLAWAKGPDNIGWRPEWAGSGKPSQWEPSPGGFALDCSGAFGEAIKRAGGPNLDAWHTGRYWQDLPPIGSPLPGDAALYIGHIEMVIGTVDGLVVVGGSSGGDSKTTTLEIASNRRAMYKVKQTHLYRPDFRGFRSIVPFLG
Other Proteins in cluster: phalp2_1556
Total (incl. this protein): 2 Avg length: 162,0 Avg pI: 8,13

Protein ID Length (AA) pI
40W37 175 9,29797
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26078
7s07Z
53 40,6% 177 8.135E-29
2 phalp2_33743
ZDk4
1 33,8% 124 2.515E-16
3 phalp2_3467
4b9No
1 25,9% 166 1.280E-09
4 phalp2_4609
4iAfK
3 28,1% 135 3.686E-08
5 phalp2_29063
6QfOM
447 31,2% 131 3.097E-07
6 phalp2_27807
7Tiwg
23 31,4% 159 7.691E-07
7 phalp2_25998
6KgfU
2 27,5% 138 2.579E-06
8 phalp2_336
6VNQk
11 27,2% 158 1.574E-05
9 phalp2_13018
42lWN
9 29,2% 113 7.065E-05

Domains

Domains
Unannotated
Representative sequence (used for alignment): 40W37 (175 AA)
Member sequence: 4jTFG (149 AA)
1 175 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (40W37) rather than this protein.
PDB ID
40W37
Method AlphaFoldv2
Resolution 84.13
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50