Protein

Protein accession
2SEok [EnVhog]
Representative
1p4NQ
Source
EnVhog (cluster: phalp2_39934)
Protein name
2SEok
Lysin probability
88%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTRLRLSLAALVAVVALAACTPDEVQFWVERAQEAEAAGAWCPNAAGGIIAYGLPPEFDVIAWRESRCDPEAVNSSSGALGLVQIMPFWLAALCPAGIACTADDLLRGDVNLEAARYVFDRQGFDAWSQTAP
Physico‐chemical
properties
protein length:132 AA
molecular weight:14068,9 Da
isoelectric point:4,27
hydropathy:0,30
Representative Protein Details
Accession
1p4NQ
Protein name
1p4NQ
Sequence length
132 AA
Molecular weight
14120,13740 Da
Isoelectric point
4,49343
Sequence
MRRALGALAACLVLSACTPELAALVAWSEQQEADAAAAGYPCAIYADDLAWRGLPAHFMWVIQRESMCQPWAVNHSSGALGLTQILPMWLPSLCGAGIACTRDDLLMPEANLDAAAYVFAIQGPQAWSQTWG
Other Proteins in cluster: phalp2_39934
Total (incl. this protein): 5 Avg length: 135,6 Avg pI: 4,47

Protein ID Length (AA) pI
1p4NQ 132 4,49343
1jp06 133 4,20173
1p7D4 131 4,65303
4ktTq 150 4,75625
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_11954
2SESI
12 33,9% 168 1.787E-18
2 phalp2_8391
CgjB
207 36,1% 94 7.808E-17
3 phalp2_10602
5BOvL
251 36,8% 103 1.070E-16
4 phalp2_35584
1aKpM
156 28,0% 121 1.304E-09
5 phalp2_8459
165nm
626 28,5% 112 3.330E-09
6 phalp2_13239
2ZQKU
4 24,0% 179 4.551E-09
7 phalp2_26606
8tBUU
34 27,5% 149 6.218E-09
8 phalp2_26521
7XVtu
369 29,2% 113 1.029E-07
9 phalp2_39939
1qdcG
3 24,5% 106 1.689E-06
10 phalp2_39980
1L23W
13 32,6% 92 3.142E-06

Domains

Domains
Representative sequence (used for alignment): 1p4NQ (132 AA)
Member sequence: 2SEok (132 AA)
1 132 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
2SEok
Method AlphaFoldv2
Resolution 90.11
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1p4NQ) rather than this protein.
PDB ID
1p4NQ
Method AlphaFoldv2
Resolution 90.01
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50