Protein

Protein accession
2ab5Z [EnVhog]
Representative
5tgzY
Source
EnVhog (cluster: phalp2_13619)
Protein name
2ab5Z
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VTNLDVAFHYLIEFSEGWEYVNDPDDPGGPTKFGITKKTYENFFGVCVPDREIELMSPAIAKQIYAAEYWAPLALGCVQNLGFAVAFFDTCVLYGVRTTGLMIQRALSRGGATLKLDGIIGDKSLGFINLAGGGSPSAQLQLMKAFQGQIMDRIDAVIVAKPTSEKYRRGWTNRADRLLNLLDEEYVNKLKNQVTQELS
Physico‐chemical
properties
protein length:199 AA
molecular weight:22023,0 Da
isoelectric point:5,26
hydropathy:-0,08
Representative Protein Details
Accession
5tgzY
Protein name
5tgzY
Sequence length
182 AA
Molecular weight
20188,55460 Da
Isoelectric point
5,04971
Sequence
MNEFFERAFVYLFQNEGSTFTDDPSDSGGPTKFGVTQKAYEHWLGHSVDVSEIKNMSLDMAKQFYFECYWKAVSCDKLTSLAISTAIFDSAVLYGIANAALMAQKAANSLGCTLKIDGILGDKSTESLNSLGDEDFIRAFSAMVFARIEWIVQVNPKNEKYRDGWVNRGTRLLTLTGGDGKT
Other Proteins in cluster: phalp2_13619
Total (incl. this protein): 5 Avg length: 186,8 Avg pI: 6,40

Protein ID Length (AA) pI
5tgzY 182 5,04971
6TJMm 182 8,46136
6XOF0 185 4,99992
j4yh 186 8,20594
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20399
37U0b
4 34,1% 167 4.597E-30
2 phalp2_23735
PTzZ
309 34,9% 169 1.057E-28
3 phalp2_35571
16EwI
3 34,4% 177 1.446E-28
4 phalp2_12184
6C2VC
988 30,5% 170 5.066E-28
5 phalp2_13016
426A1
31 33,9% 171 5.066E-28
6 phalp2_38528
1E3i8
441 26,9% 189 1.589E-26
7 phalp2_25028
ZsUq
3460 29,5% 176 9.285E-25
8 phalp2_27351
4yIsn
3 32,3% 167 1.269E-24
9 phalp2_28839
4XJZ6
27 28,8% 170 1.735E-24
10 phalp2_1444
1DO8y
138 29,7% 185 3.243E-24

Domains

Domains
Representative sequence (used for alignment): 5tgzY (182 AA)
Member sequence: 2ab5Z (199 AA)
1 182 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05838

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
2ab5Z
Method AlphaFoldv2
Resolution 96.32
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5tgzY) rather than this protein.
PDB ID
5tgzY
Method AlphaFoldv2
Resolution 93.81
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50