Protein

Protein accession
8s0mk [EnVhog]
Representative
40lRy
Source
EnVhog (cluster: phalp2_18955)
Protein name
8s0mk
Lysin probability
99%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MAAIDTAEECAGQIPYTFGGSRYDFWEGTDCSGLVCSALYKTYGIDPYSFGDWTGAQWSDTNYTDQIWWGTTADLPWDDMQRGDLIFTSNCSEDFSTGEGSHIGFYTGDPSNPFYSHFATYGPRWTAVNGVYGNECYYGVKRLKVEAGEQSSNDDENPDDVWNFDQNGVLMRDRIQGTDEATNAANSAANVMREQLTRTDDVSGRGTEANLYERMCWMGARTAEISDKLDELINLLKE
Physico‐chemical
properties
protein length:238 AA
molecular weight:26641,6 Da
isoelectric point:4,15
hydropathy:-0,66
Representative Protein Details
Accession
40lRy
Protein name
40lRy
Sequence length
243 AA
Molecular weight
27549,25120 Da
Isoelectric point
4,70714
Sequence
MSKGSELIALARSKAGQWRYTNDYPDRMYPEEYGGTDCSGFVRWCYAQFGYDVGTWTGDESYAGYEVARGHYPSEIPWDDMQPGDLILMTATYYDDYSFSHYLCHIELYCGGGTMIGHPGGYGPQEKWAQAWMQAYGCITWMVRRVFEGDDDVSAADVWSYKNESMNGKNDAYQLLTDIHTQLVRTDNAGWGTPAGHDIFGRINAIEQTVDKLKPTSQNVTVEVDYDKLANKVADVIYARMKQ
Other Proteins in cluster: phalp2_18955
Total (incl. this protein): 6 Avg length: 247,8 Avg pI: 4,80

Protein ID Length (AA) pI
40lRy 243 4,70714
21KQm 239 4,72306
23T4w 208 4,69242
40gB7 309 5,68017
7wjfz 250 4,82383
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6363
7w9Kz
1 56,4% 195 4.069E-72
2 phalp2_21377
24Hqg
1 62,7% 172 6.280E-67
3 phalp2_5495
3gRD8
3 31,9% 216 1.079E-29
4 phalp2_6349
7qJkP
2 28,0% 164 2.512E-13
5 phalp2_35176
5Yy0H
89 27,1% 236 3.392E-13
6 phalp2_24854
7xCMB
4 29,4% 173 5.480E-11
7 phalp2_30104
38LMY
4 26,1% 180 9.515E-05

Domains

Domains
Representative sequence (used for alignment): 40lRy (243 AA)
Member sequence: 8s0mk (238 AA)
1 243 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00877

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (40lRy) rather than this protein.
PDB ID
40lRy
Method AlphaFoldv2
Resolution 74.95
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50