Protein

Protein accession
oez4 [EnVhog]
Representative
3A3co
Source
EnVhog (cluster: phalp2_40360)
Protein name
oez4
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MRKNWKKLMGMLCCCGMLCMFPVTSYADLEPQEVQMVEANPEEESIKRAEIEGDEPEEDIYEDSSEAPKVLNRTRAAVAEQEAIAEAKTEAKKQTENEAVRQAWLRAEEESSEERARSLAAAESAGKRQKVVDFALTFLGGPYRYGGNDPRTGVDCSGFTRYVLSNAACVHITRTAASQSTEGVPVSDVTMQPGDLLFYSGGGRINHVAMYIGGGKVVHASSVKTGIKISPWNYRTPVKIVNVLGD
Physico‐chemical
properties
protein length:246 AA
molecular weight:26931,1 Da
isoelectric point:5,23
hydropathy:-0,43
Representative Protein Details
Accession
3A3co
Protein name
3A3co
Sequence length
353 AA
Molecular weight
38090,44990 Da
Isoelectric point
4,58960
Sequence
LKNWMKYGACALSAALLLSEPMCANAVEISGMDTTKQIEESTELLTAGIGDLFTDVLTENEYVEIAKRAEGALWGYTKLGICNVEENNLNVRQEPDESGKLVGKLPKNAACEIISSENGWAFISSGKVEGYVKEEYLLTGYDAKKKGEELASAMAVATADALNIREEPSTDAEVVTQVAAGETLDIVEIQNDGWIKVYLDDEEVYVSSDYVEVKSDLSTAITMTELLYGQGVSDVRVDLCQYAKQFLGNPYVWGGTSLTSGADCSGFVLSVFAKYGVSLPHSSRAQANLGTSISASELQPGDLVFYAKGGTINHVAIYIGGGQVIHASSPKTGIKISSYNYRTPYKMVRILQD
Other Proteins in cluster: phalp2_40360
Total (incl. this protein): 9 Avg length: 289,8 Avg pI: 7,05

Protein ID Length (AA) pI
3A3co 353 4,58960
5Kd9i 251 9,24884
7NHLh 306 8,58405
7OOpk 306 8,58405
7OfJr 304 8,29065
7OqqI 306 8,57560
7Ykh0 304 5,55774
7efD8 232 4,80729
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12021
7OoVx
10 28,3% 303 2.299E-36
2 phalp2_14412
4yftP
16 25,8% 375 8.621E-33
3 phalp2_11039
7DN8Z
1 31,6% 253 1.723E-26

Domains

Domains
Disordered region
SH3_3
NLPC_P60
Representative sequence (used for alignment): 3A3co (353 AA)
Member sequence: oez4 (246 AA)
1 353 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00877, PF08239

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3A3co) rather than this protein.
PDB ID
3A3co
Method AlphaFoldv2
Resolution 79.29
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50