Protein

Protein accession
4fO87 [EnVhog]
Representative
2Uk1y
Source
EnVhog (cluster: phalp2_21645)
Protein name
4fO87
Lysin probability
99%
PhaLP type
endolysin
Probability: 88% (predicted by ML model)
Protein sequence
MKIASTPVKGQTGADVKALQAALKGKGFDPGSIDGIFGSKTEKAVSKFQKSIGLPGSGVIGPKTLAGLGLEIGDVVQPGAPITTGMDPDEGTQPWYRRMWAAIAFDPGFEERVANSARVVLKGKERYAAVAQKVFKDCFMTDNNGVKHAVIGSDLWWVIGLIHMKEASCNFAGVLHNGEKIIGTGKKTSIVPIGRGPFATWEEAAVDALNGESLGKLKDFEIGELFRAIERYNGTGYITGAGKTENSPYLWALSNINDDKGKYVSDGKWDPNASTQSAAGFATMLKWLVDNAGVVVVSKAAAVAVQPKPEAPKSGSKLTRQMVADKIIEVINRDIAAKLRETHGKNRSPRIDSFNKRAGVYMGAPYCASAGTCAIADAVAELSEILGLKLKNPVRITAASQDMRRTSYVPAKYIRKEGSLGKKGDVGVLQVNGDPAHGHYTTLSKDQESQPSFDTVEYNTDSGGSRDGDGAYARVRSTVDGSRANSGKLFICFTDVPQWIVDANA
Physico‐chemical
properties
protein length:505 AA
molecular weight:53822,5 Da
isoelectric point:9,04
hydropathy:-0,28
Representative Protein Details
Accession
2Uk1y
Protein name
2Uk1y
Sequence length
458 AA
Molecular weight
51350,08230 Da
Isoelectric point
9,10972
Sequence
MKIYSIPQPGDVNEDVRIFQAALNAQKVVKPKLVEDDNFGPKTRTAASQFQKSIGLEGTGIPGPKTIEALGLILDKAGQPVETPQVPFRLSWDNKAERLPWTGYIFSRLYEMYDSHIVKIKDMERFRIDWVALTKDQRIYVMAEIIVQMAKHESGWNPDSASVDVGNKDKKDTWSIGLLQLSVADQSWVKPRDKTRYTYEELIKPIPNLDLAFSILKRQIEKDGRLVLPNKSPLRYWAVMLDGNKYSKVDSIISVIKKIKIPEAVNEKLPSKDKKIDREVIARKIVAIIQADIDANLRETHGKNRSPRIDSFNKRAHAYPGDPYCASGGWCAIDDACKELGLKNPVAPTASSQAFRKTSFVPAKYIRPEGSKGKIGDVGVLQQVSDPGKGHYVTLREDQVSQPLFKTVEYNTDGSGSRDGDGAYAMTRSTVDRSAENSGKIFVCFTDIPQWIADHNKL
Other Proteins in cluster: phalp2_21645
Total (incl. this protein): 5 Avg length: 478,2 Avg pI: 9,07

Protein ID Length (AA) pI
2Uk1y 458 9,10972
1lKJt 461 9,52657
4WrM5 473 9,18231
4fMdr 494 8,48360
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24607
5ffnv
1 39,8% 296 2.984E-51

Domains

Domains
PG_1
SLT
Unannotated
Representative sequence (used for alignment): 2Uk1y (458 AA)
Member sequence: 4fO87 (505 AA)
1 458 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464, PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4fO87
Method AlphaFoldv2
Resolution 85.14
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (2Uk1y) rather than this protein.
PDB ID
2Uk1y
Method AlphaFoldv2
Resolution 88.13
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50