Protein

Protein accession
k1GS [EnVhog]
Representative
1Iy94
Source
EnVhog (cluster: phalp2_33874)
Protein name
k1GS
Lysin probability
97%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MEVILRSGQVISLKDYQKVNKLDSTKLSDNFSVSEFDCNGELVIAEPLIHFLELFRATIKRPVKINSGYRTKDYQEDLKKTNKGAATNSPHTVGMAADIDTYTRKESEEFAKIILDLASKHSYDVRVGWKQYSRLGSTFVHIDICPMYYGEGKPYAKDKVADAWRIKNLVW
Physico‐chemical
properties
protein length:171 AA
molecular weight:19573,1 Da
isoelectric point:8,72
hydropathy:-0,47
Representative Protein Details
Accession
1Iy94
Protein name
1Iy94
Sequence length
209 AA
Molecular weight
23830,22030 Da
Isoelectric point
9,19430
Sequence
MSDKIKMMLRDGSVIPLDQWQKRNGLQAGSNEIGRYFRSGESRFRQDIEEYGEVIINEMLLKVLDGYRQAIGQPVFLNSFNRNEAKQAQLKKAGFKAASKSPHVVKLAADVDTPGLNEIIHKRGYNAVVMTPELQKEARLEMMRINYDSAITMKQVAAALQIKVRIGHKQYLEIGQTFIHVDVCPEYYAPGKPFHKLGHPLAWEESITW
Other Proteins in cluster: phalp2_33874
Total (incl. this protein): 25 Avg length: 185,2 Avg pI: 7,81

Protein ID Length (AA) pI
1Iy94 209 9,19430
1IJVe 184 9,04590
1IbUh 194 6,45261
1Iv67 182 6,65666
1KrKP 169 7,68454
233T 181 9,02488
2TsKl 172 6,58795
2afPm 187 8,58630
2tHuo 227 5,08558
3eNfn 167 9,36759
3nssP 191 9,41769
3tQl 183 9,21261
4Apj 182 7,10598
4BZS6 186 9,76646
4Mu6l 180 8,51674
4fZBq 186 6,24367
4jUVN 179 5,32379
5cn1q 180 5,51835
5sQuh 175 8,62834
6JNd 168 8,98465
7czu4 217 9,27992
8fLic 186 6,70992
bszZ 193 7,73700
hL60 181 6,51065
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_4607
4inks
50 23,2% 168 7.731E-17
2 phalp2_30673
6Po7g
16 23,5% 225 2.886E-11
3 phalp2_38763
1dS3s
1408 21,8% 169 2.110E-04

Domains

Domains
Unannotated
Representative sequence (used for alignment): 1Iy94 (209 AA)
Member sequence: k1GS (171 AA)
1 209 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1Iy94) rather than this protein.
PDB ID
1Iy94
Method AlphaFoldv2
Resolution 92.88
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50