Protein

Protein accession
4EG7I [EnVhog]
Representative
15ILp
Source
EnVhog (cluster: phalp2_10252)
Protein name
4EG7I
Lysin probability
96%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTPKAYLYATYPGQARRLDCMIQGESSWLPWARSGPYLGLAQFDLTTWYETPQGKAGASRTDPIASIDAMAWGVSHLGYGRWPQTSRRC
Physico‐chemical
properties
protein length:89 AA
molecular weight:9973,2 Da
isoelectric point:9,10
hydropathy:-0,48
Representative Protein Details
Accession
15ILp
Protein name
15ILp
Sequence length
126 AA
Molecular weight
13908,70460 Da
Isoelectric point
6,16501
Sequence
MSVLLSLSLLLAAVDVDGQQPVPDLATPVIEEVTYTSPRTYLYATYPSLARRLDCMIRLESEWVWAASGAGGRYIGLAQFDRLTWASTPQGQAGLSRTDPYASIDAMAWGVRNLGWGRWPISSRRC
Other Proteins in cluster: phalp2_10252
Total (incl. this protein): 6 Avg length: 117,7 Avg pI: 5,81

Protein ID Length (AA) pI
15ILp 126 6,16501
1NDwE 140 4,86368
1gvXO 122 5,11752
6AkPW 116 5,10149
6HXTk 113 4,51020
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_16381
5J8Yf
65 36,8% 144 2.060E-22
2 phalp2_6179
5IJQz
1 35,7% 98 3.346E-08
3 phalp2_23113
4Bvql
9 33,3% 84 6.252E-08
4 phalp2_39927
1oh6N
2 33,3% 87 5.559E-07
5 phalp2_20227
8owGA
9 27,7% 83 6.715E-06
6 phalp2_27275
49OSl
3 28,0% 82 1.097E-04
7 phalp2_15355
1BWpH
3 24,0% 100 1.495E-04
8 phalp2_17929
YAbX
32 26,3% 110 2.776E-04

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 15ILp (126 AA)
Member sequence: 4EG7I (89 AA)
1 126 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (15ILp) rather than this protein.
PDB ID
15ILp
Method AlphaFoldv2
Resolution 86.73
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50