Protein

Protein accession
4sJqI [EnVhog]
Representative
4sJqI (this protein)
Source
EnVhog (cluster: phalp2_27329)
Protein name
4sJqI
Lysin probability
89%
PhaLP type
VAL
Probability: 98% (predicted by ML model)
Protein sequence
MPQGFIPLSKQMDFLWNEVQGKEKSGFGKFLTANASSPEDYATLWDKYYERSGGAGDEKARNYASSVYAAMADGTSNEALISPNAKFAYGYLTQKGLTPQQAAGITGRLMAESYEDMNPDARNTLAGGKGTYGIAQWRGSRLQDLADFTGADIGDITSLPATKPSGGLLTSNQGGQDMAISNKPPYMMGGEQTYNAPNMRQPAPQQAQQGGMRGLLSTLKDAATAVDPNTGLTGFQTFASALDPLILPELRGGGEAIRKSGAARVARGQLNKTIEWLSNNGYPEMAAAVRANPGAASNIMSAVLSKKLTPKKDDGTTAMQNYQFLISRGMSEEEALKQAFGKGGPNINLGSGAQIAGDYVIVEDPTSEAGVRFVPIPGGKADQAAQQAQEREQVSEQQSAQKEAVVSNSIGNLISMIDKGGIFDLPEAGIVGNVLGSLGVNQEAVDFRNELASIQANIAFDRLQQMREASKTGGALGAVSERELDLLMNAYGNINQSTSPEKLKENLINIRNIMTKIENDPVASSFYYGSPAQGAQASPQSGGTQVGEAY
Physico‐chemical
properties
protein length:550 AA
molecular weight:58206,3 Da
isoelectric point:4,98
hydropathy:-0,44
Other Proteins in cluster: phalp2_27329
Total (incl. this protein): 38 Avg length: 529,2 Avg pI: 5,58

Protein ID Length (AA) pI
1amnS 445 5,63129
1vpS5 537 5,53455
22K5x 543 5,25132
23ml8 531 5,29565
296mg 545 5,44992
2JAXT 537 5,41894
2kXFu 549 5,56206
2mVBP 526 5,04670
3Qqo4 541 5,11769
4WAla 527 5,31237
4v2KJ 527 5,04539
5dIIG 469 5,66283
6AJWc 543 5,56138
6B3Sc 537 5,39086
6B4Va 526 5,22756
6BaXO 536 5,54546
6JDiT 536 5,67886
6JvX9 536 5,42837
6NMQI 542 6,01939
6NNxQ 544 5,40263
6NO68 546 5,55018
6NqRb 544 6,72891
6Oxpz 534 5,25979
7Ux7Z 527 5,16532
8BA1G 479 6,56578
8Btae 481 6,69480
8EAP2 538 6,56447
8EIDe 545 5,27116
8bcWZ 532 5,41223
8wmhu 538 5,29565
8xqZZ 536 5,27116
8z1pG 499 4,91841
AdkQ 550 6,58658
WGXF 547 6,58869
WmwL 545 5,30242
Wy8a 514 6,02700
qw8P 526 5,30077
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24384
4c7ox
12 27,7% 526 2.692E-26
2 phalp2_18377
5BYZz
1 25,4% 464 4.318E-13

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4sJqI
Method AlphaFoldv2
Resolution 70.24
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50