Protein

Protein accession
4I1DQ [EnVhog]
Representative
5sNZI
Source
EnVhog (cluster: phalp2_16340)
Protein name
4I1DQ
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
VIKQIVLGLFILCTLAPAIPFTYYIPNRLYTVELPAKKIIIKYPIPPPWFNKELLSYISKECEKYNVPVVLVHKLIEQESQWHSNSPSPNYDKHHRILSIDYGLMQINSDNIERFSHNYKEPWRSEKSYNPIHNSWDNVHLGICYLRDLYTQFGNWKNAVAAYNGGTRRVINNTLKKSTQEYVNIVCPVNDWWLTLPTNYVCLAGNQDSLQQKTE
Physico‐chemical
properties
protein length:215 AA
molecular weight:25173,5 Da
isoelectric point:8,69
hydropathy:-0,40
Representative Protein Details
Accession
5sNZI
Protein name
5sNZI
Sequence length
196 AA
Molecular weight
23021,98940 Da
Isoelectric point
8,98259
Sequence
MTLAPAIPFTYYIPNRLYTVELPVKKIIIKYPIPPAWFNKDLLNYISKECKKNNVPVLLVHKLIEQESQWHSNSPSPNYDRHHNLLSIDYGLMQINSDNLERFAHAYKEPWRSEKSYNPMHNSWDNVHFGICYLRDLYLQFGNWKDAVAAYNGGTKRVKNNTLKKSTQEYVNTVCPVTDWWLTLPTNYVSPTTQIN
Other Proteins in cluster: phalp2_16340
Total (incl. this protein): 2 Avg length: 205,5 Avg pI: 8,84

Protein ID Length (AA) pI
5sNZI 196 8,98259
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28314
15Ckr
15 35,9% 142 2.902E-17
2 phalp2_20578
4B877
3 31,4% 140 2.930E-11
3 phalp2_24168
2jDYd
5 35,7% 123 9.905E-11
4 phalp2_40442
4hYv9
4 33,5% 140 1.523E-09
5 phalp2_7270
3WQrb
2 30,3% 132 2.078E-06
6 phalp2_9430
lDQe
5 26,5% 143 2.078E-06
7 phalp2_14554
4Uuve
2 24,6% 142 2.078E-06
8 phalp2_35020
7Cajn
1 30,0% 130 5.082E-06
9 phalp2_4037
19be8
11 28,0% 146 4.062E-05
10 phalp2_29294
tD4R
30 28,6% 129 1.781E-04

Domains

Domains
Disordered region
SLT
Representative sequence (used for alignment): 5sNZI (196 AA)
Member sequence: 4I1DQ (215 AA)
1 196 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5sNZI) rather than this protein.
PDB ID
5sNZI
Method AlphaFoldv2
Resolution 85.59
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50