Protein

Protein accession
4dStU [EnVhog]
Representative
16EwI
Source
EnVhog (cluster: phalp2_35571)
Protein name
4dStU
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MADLNLSMPYVLQNEGGYTVDDGGPTNFGIVEQDLATYRGIPVSTITAEDIKNLTVAEATAIYAKEYWQPMGLSAVNDQSIATAIFDAGVNMGIGTGARLAQRVVGATGDGIIGPNTLLAINTWTRANFIPPFANAVLTHYQGIVASDPGKYSRYFPGWSARAKRLLTLA
Physico‐chemical
properties
protein length:170 AA
molecular weight:18100,3 Da
isoelectric point:4,98
hydropathy:0,05
Representative Protein Details
Accession
16EwI
Protein name
16EwI
Sequence length
242 AA
Molecular weight
26711,66410 Da
Isoelectric point
9,28449
Sequence
LRKVKSTRLNCPACGHLWSVKGRAAAPRPRWAKQRFIKVNGVVKRCRNCKAAENIRIHDFEFEGGGSMAKIEGSIDYVLENEGGFTIDDGGPTMWGITIPDVAQYRKVPESSITIEDMKNLPKMEACEIYRELYWQKLSLDQVAAQNVATAIFDIGVNRGVSTSAKYAQRACKTLGLSVAVDGVMGPNTIAAVNMAKPSQMVRTLEGLDMAGYLAIVAIDPEKYGRYLKGWEARAQRLLTLV
Other Proteins in cluster: phalp2_35571
Total (incl. this protein): 3 Avg length: 195,7 Avg pI: 6,43

Protein ID Length (AA) pI
16EwI 242 9,28449
6XVSF 175 5,01868
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12184
6C2VC
988 35,4% 175 2.215E-30
2 phalp2_28839
4XJZ6
27 35,8% 170 6.719E-29
3 phalp2_13016
426A1
31 37,5% 173 3.767E-27
4 phalp2_27064
2jB8e
50 36,4% 181 5.133E-27
5 phalp2_37851
4LyyS
5845 36,0% 172 2.410E-26
6 phalp2_40381
3TbBc
289 34,6% 179 6.091E-26
7 phalp2_27351
4yIsn
3 31,9% 172 1.130E-25
8 phalp2_23735
PTzZ
309 37,1% 175 1.539E-25
9 phalp2_13619
5tgzY
5 33,8% 177 1.539E-25
10 phalp2_30611
6uqQp
1606 34,6% 173 4.585E-24

Domains

Domains
Disordered region
GH108
Representative sequence (used for alignment): 16EwI (242 AA)
Member sequence: 4dStU (170 AA)
1 242 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05838|PF09374

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4dStU
Method AlphaFoldv2
Resolution 96.01
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (16EwI) rather than this protein.
PDB ID
16EwI
Method AlphaFoldv2
Resolution 77.15
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50