Protein

Protein accession
4b2Zh [EnVhog]
Representative
T712
Source
EnVhog (cluster: phalp2_36417)
Protein name
4b2Zh
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
SAKYSAEATDIEVVQTAMAKSGEILVANTPVKSAPASEVPEDWTQDDAFIERIGKLSKSIGCDPVDMLACMAFETGRTFKPDQRNSIGATGLIQFLSKTATDLGTTTNELAALTRAQQCDWVEKYFKKGPLRNVSNPSLEDLYMAILWPAAVGKSNDYVIFTAGTKAYSQNNGLDIDNKGYITKQDAATKVREQIPYVIKQLAKAGINL
Physico‐chemical
properties
protein length:209 AA
molecular weight:22720,5 Da
isoelectric point:5,32
hydropathy:-0,27
Representative Protein Details
Accession
T712
Protein name
T712
Sequence length
227 AA
Molecular weight
24874,96230 Da
Isoelectric point
8,98839
Sequence
MSRVPMHEPWSQHENINPTQFSPFATDVLTGSTTAGRQNQLPTANSATSVTPANIPADWTKDDAFIKKVNSVAARLNVSPADLLACMAFETGRTFDPSLRNRIGATGLIQFIPKTAIGLGTTTDYLASLTRVQQMDWVEKYFLAGPIARVKNPTLEDLYMAILWPVAVGQSNDYVMWRAGSIQYQQNPLDVGGKGYITKADASSYVRRQIPYVQSQLATIKTGTAFA
Other Proteins in cluster: phalp2_36417
Total (incl. this protein): 4 Avg length: 231,0 Avg pI: 7,25

Protein ID Length (AA) pI
T712 227 8,98839
1eWKm 225 5,91622
31ah8 263 8,78796
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_38137
6S6gW
387 51,9% 152 8.565E-52
2 phalp2_544
AVmF
15 41,4% 176 1.367E-38
3 phalp2_27366
4D2kq
9 46,4% 153 1.869E-38
4 phalp2_31970
4XiKL
2 39,8% 188 3.795E-36
5 phalp2_37001
moMy
1 37,1% 221 1.178E-34
6 phalp2_38395
XYcA
8 39,7% 176 2.668E-33
7 phalp2_26552
7YsQE
123 37,1% 148 2.236E-29
8 phalp2_11161
5ECat
11 38,0% 171 5.685E-29
9 phalp2_34411
4p3Yl
41 34,5% 168 1.059E-28
10 phalp2_38131
6R13H
1 45,2% 146 6.834E-28

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): T712 (227 AA)
Member sequence: 4b2Zh (209 AA)
1 227 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4b2Zh
Method AlphaFoldv2
Resolution 86.32
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (T712) rather than this protein.
PDB ID
T712
Method AlphaFoldv2
Resolution 80.59
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50