Protein

Protein accession
4eD8P [EnVhog]
Representative
4eGzk
Source
EnVhog (cluster: phalp2_39292)
Protein name
4eD8P
Lysin probability
99%
PhaLP type
endolysin
Probability: 90% (predicted by ML model)
Protein sequence
MSKTPAFDLAPTPVARRLLELYQRDLGVHEEPSGANKGPLIDKYLQGYKGRYGNRYLKGAMWCALACEYLAREAYDQLDITPCPLDEWHGLGGASGWLRFAEKFGAMVPLSKALPGDVAVIYNPQTDRGHVCVIALPTSATSIQTMDGNHGDTLSWADRPPRDFAGFVRLPFAG
Physico‐chemical
properties
protein length:174 AA
molecular weight:19098,6 Da
isoelectric point:6,41
hydropathy:-0,26
Representative Protein Details
Accession
4eGzk
Protein name
4eGzk
Sequence length
173 AA
Molecular weight
18703,29090 Da
Isoelectric point
6,03041
Sequence
MPATLVAPGWRCAPSEAHPLLVQVLLDAVACIGMHEEPLGSNQGPQVSTWLRLASATPGDPWCASFATALYQRIDPAPIPRLASAYKIYQWAKERGLLVPDGAPVLPGDICGLFHEDNPATLARENFRGHVGVVTADLGDAKIATVEGNVHSFVLGLVHLRADWQWFARPVRL
Other Proteins in cluster: phalp2_39292
Total (incl. this protein): 3 Avg length: 182,7 Avg pI: 6,70

Protein ID Length (AA) pI
4eGzk 173 6,03041
8iDlL 201 7,66823
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13408
4tHhZ
205 36,7% 128 7.508E-13
2 phalp2_31622
4B8wf
29 34,4% 122 4.791E-12
3 phalp2_28163
fkdY
23 28,7% 132 2.625E-10
4 phalp2_38301
gnom
27 26,9% 141 1.655E-09
5 phalp2_29792
1Qqqg
22 33,1% 163 2.248E-09
6 phalp2_21220
1bkE9
22 29,3% 150 2.596E-08
7 phalp2_21470
8aO9w
1 31,0% 164 4.777E-08
8 phalp2_743
1NQdC
55 30,0% 143 5.446E-07
9 phalp2_33133
4NTMg
12 28,8% 135 7.376E-07
10 phalp2_9091
5CU5d
533 32,7% 119 9.988E-07

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4eGzk (173 AA)
Member sequence: 4eD8P (174 AA)
1 173 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4eD8P
Method AlphaFoldv2
Resolution 88.43
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4eGzk) rather than this protein.
PDB ID
4eGzk
Method AlphaFoldv2
Resolution 93.62
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50