Protein

Protein accession
4eR4S [EnVhog]
Representative
4EiET
Source
EnVhog (cluster: phalp2_30345)
Protein name
4eR4S
Lysin probability
90%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MRLFTALLGAILLAPVAFVLPARADQFENPWIETRLHPENGPARIGRGYRIHRPTYQPTELRPAAQLLAEAQRFEGSGKVTRYRGPWCRDFINLVASRAGYRLANYSRRAIDALHLGRRVADPQPGDLVVMRHHVTIFAGRAGGKIVGLGGNQGRRVRYSHFAPGRVVAFVRL
Physico‐chemical
properties
protein length:173 AA
molecular weight:19343,1 Da
isoelectric point:11,52
hydropathy:-0,22
Representative Protein Details
Accession
4EiET
Protein name
4EiET
Sequence length
167 AA
Molecular weight
17811,37480 Da
Isoelectric point
11,88618
Sequence
MPDPRLPRGALARLGFTLAIGALAVFAAAGPRIAHAGGRRGFSRHDAPAPIERAMRGGILAEASRWLGGGNPTPFREPWCRDFVNFVLRRAGHPLGDRSHLAITALRLGPRVADPAPGDLAVMHGHVAFFAGWDGADAFLALGGNQSRRVTIARFARRAVIAFVRPT
Other Proteins in cluster: phalp2_30345
Total (incl. this protein): 14 Avg length: 168,2 Avg pI: 10,89

Protein ID Length (AA) pI
4EiET 167 11,88618
1AVTE 165 11,25007
1kPd3 167 11,63424
3g44Z 159 9,68575
3zjd2 170 9,84079
4I7ae 180 9,72694
4Tf15 156 11,18747
5IfmY 160 10,34977
5jv4 176 10,39761
6DFE4 172 10,13844
6R0uz 168 11,79889
6TE1S 168 11,41911
96rM 174 11,55539
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_39960
1CQ4o
6 43,4% 161 8.334E-45
2 phalp2_21894
4FzDB
5 47,2% 108 2.032E-37
3 phalp2_38022
5Ighg
1 48,6% 113 3.814E-37
4 phalp2_4016
13g00
5 51,6% 122 9.806E-37
5 phalp2_30664
6NcXd
1 49,0% 110 2.596E-27
6 phalp2_966
2cNUu
3 36,0% 111 2.278E-20
7 phalp2_6010
4N7su
3 27,5% 127 7.195E-14
8 phalp2_28323
16UXy
9 31,9% 141 4.798E-11
9 phalp2_13899
6YBY
3 25,2% 123 1.348E-05

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 4EiET (167 AA)
Member sequence: 4eR4S (173 AA)
1 167 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4EiET) rather than this protein.
PDB ID
4EiET
Method AlphaFoldv2
Resolution 79.03
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50