Protein

Protein accession
6QUM2 [EnVhog]
Representative
8DxGz
Source
EnVhog (cluster: phalp2_28187)
Protein name
6QUM2
Lysin probability
88%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MFRYLPARTLAVCLASLGVILVGVTVAAEPRPIHEGIQGPFHHAERGEQVQVNPPPPPPTTVSERVWTGSIPAYYGGGSGCTQANANLIARAMWDVGANDQQVSRMLNIVSRESGCDSSQRNLNSRTRDDSWGFCQINVLAGFFRSNGLLAGFNRWAFADDPRSNARACAALYAQCGFGPWNYGNYYCRRP
Physico‐chemical
properties
protein length:191 AA
molecular weight:20882,3 Da
isoelectric point:8,78
hydropathy:-0,27
Representative Protein Details
Accession
8DxGz
Protein name
8DxGz
Sequence length
168 AA
Molecular weight
18948,43740 Da
Isoelectric point
7,66448
Sequence
mtdrtvsilcvlwavlaiiliadiahaeprpihegihgpfaqaerglqrqvsppptqpvppptdwdcaswkplldkygmpyelfqpvmyresrctnarnynprtrddsygplqvnrygsldagwnsvgisrsymatpegavhaasmlyhsceslgpwtkpysckkwlf
Other Proteins in cluster: phalp2_28187
Total (incl. this protein): 9 Avg length: 215,1 Avg pI: 7,49

Protein ID Length (AA) pI
8DxGz 168 7,66448
23n3x 189 7,74331
2Lct4 254 7,59013
2nIps 245 6,50365
6Nat0 228 6,70384
6OMbj 200 8,21148
7ZU4l 234 5,38813
8EBjg 227 8,80756
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_22319
daJf
1 45,1% 124 5.594E-25
2 phalp2_35584
1aKpM
156 27,3% 117 7.508E-06
3 phalp2_37443
2Ld6Z
8 31,1% 122 4.599E-05
4 phalp2_13239
2ZQKU
4 25,2% 115 1.533E-04
5 phalp2_21926
4Mbf8
14 27,1% 114 3.772E-04

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 8DxGz (168 AA)
Member sequence: 6QUM2 (191 AA)
1 168 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (8DxGz) rather than this protein.
PDB ID
8DxGz
Method AlphaFoldv2
Resolution 81.09
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50