Protein

Protein accession
23us2 [EnVhog]
Representative
5D1s
Source
EnVhog (cluster: phalp2_1136)
Protein name
23us2
Lysin probability
98%
PhaLP type
endolysin
Probability: 94% (predicted by ML model)
Protein sequence
MKHFDVINIILWTLVLLVLSFSLGRSLRAEPVAEESVPVVTEVADTLDVWQQLIMAIAFTESRFTTDALGTAGDTGILQLREIYVKEVNRLYGTEYTIQDAYDPEKSLEIFSLMQEHYNPDRDLATGIKYHNKSPFYAATVKQNMALIQRYEEFRKLLMNTEL
Physico‐chemical
properties
protein length:163 AA
molecular weight:18754,3 Da
isoelectric point:4,86
hydropathy:-0,06
Representative Protein Details
Accession
5D1s
Protein name
5D1s
Sequence length
186 AA
Molecular weight
21726,71460 Da
Isoelectric point
7,86090
Sequence
MGIKKTVIPLIGALTLGLISVYELGERQGKVSQQRLYQDELAIQQEIIVELQKWQKDSLDEWMMLEMAIMMTESRYNPNAVGKSKDQGVFQQTPIYVKEVNRILEKVGVSQRYTHEDSFNIKKSIEMFNIIQNYYNPQHSPSVAIQKQNPGGESIGYSKKVYENLIFIERMETARRELINYHKERH
Other Proteins in cluster: phalp2_1136
Total (incl. this protein): 18 Avg length: 177,4 Avg pI: 5,89

Protein ID Length (AA) pI
5D1s 186 7,86090
1ctLt 163 5,09484
1khRz 169 4,91995
3ZP1T 171 8,92682
3gTBt 187 5,75025
3iVwZ 172 5,03471
4076R 171 8,53447
40a6N 187 5,17015
40dZ9 172 5,25666
40kNl 206 5,99728
419FQ 163 4,85538
4kkQX 200 6,65723
5PmbO 165 4,77046
71jT9 156 5,68114
7DEXe 206 6,54867
7DSlk 187 5,04022
8lzbn 169 5,11695
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20115
23BJh
1 41,1% 124 3.466E-44
2 phalp2_2751
7yxpI
42 45,9% 122 6.503E-44
3 phalp2_12582
1k8RQ
2 36,5% 126 3.177E-31
4 phalp2_11808
8aeor
7 34,8% 135 7.732E-15
5 phalp2_5485
3er8i
3 25,8% 112 6.706E-14

Domains

Domains
Representative sequence (used for alignment): 5D1s (186 AA)
Member sequence: 23us2 (163 AA)
1 186 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5D1s) rather than this protein.
PDB ID
5D1s
Method AlphaFoldv2
Resolution 86.66
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50