Protein

Protein accession
8GhPp [EnVhog]
Representative
3rGbW
Source
EnVhog (cluster: phalp2_32902)
Protein name
8GhPp
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
mlddtniikdlqpdlqkiingiysmceldfkivegkvskeresylwykgevqqrqyariygaavtavvyidnipcfskdiyqeladgfrysaqnsgmgiywggainssgfvdiskdgrliqdiyeecynslintgkefnpslqyfqlaee
Physico‐chemical
properties
protein length:150 AA
molecular weight:17191,1 Da
isoelectric point:4,56
hydropathy:-0,34
Representative Protein Details
Accession
3rGbW
Protein name
3rGbW
Sequence length
152 AA
Molecular weight
17421,26230 Da
Isoelectric point
4,70140
Sequence
MTETEAEIVKQLQPNLQEVLKEAKRLSEIEFLIVEGKRTKERHDKLGYLGEDWSSGEARFFGCAVTLLLYSDDIPCFSEKPYWDLADSIRYGAQNAGDIGMNWGGAKDRNGYVNITKWDGTIEDLTYGCYGDLLEASKDYRPVYQYFQLHAE
Other Proteins in cluster: phalp2_32902
Total (incl. this protein): 11 Avg length: 153,4 Avg pI: 4,66

Protein ID Length (AA) pI
3rGbW 152 4,70140
1UIiA 152 5,03869
3CvMH 151 4,56903
3rK1p 159 4,48138
4RDaa 152 4,90222
7p9zp 152 5,07052
8DwXk 158 4,48081
8aGNh 149 4,44290
8ylvP 149 4,44290
e3C4 163 4,61660
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_39844
X0XM
14 35,3% 150 3.400E-23
2 phalp2_32333
8DxzU
1 38,2% 102 4.655E-23
3 phalp2_17891
B74m
41 28,2% 156 3.784E-21
4 phalp2_13949
8Fkan
8452 28,5% 105 5.788E-11
5 phalp2_1400
1mRwO
571 30,9% 110 6.929E-10

Domains

Domains
Unannotated
Representative sequence (used for alignment): 3rGbW (152 AA)
Member sequence: 8GhPp (150 AA)
1 152 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3rGbW) rather than this protein.
PDB ID
3rGbW
Method AlphaFoldv2
Resolution 95.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50