Protein

Protein accession
5K5L4 [EnVhog]
Representative
6kqeo
Source
EnVhog (cluster: phalp2_11210)
Protein name
5K5L4
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MHIRVKITRDKSGRVVTMDLEDYLKGVVPSEIKAETCPMEAQKAQAIAARTYAIRKTIDRRSKPYDVDDTAGYQAFGARPRHKNSDAAVEATRGMVLMYGGKLIDAVYTDSNGGRCVSSLERWGSAVPYLIDQPDPYDNSGKVRGHGVGLSQTGAAARGKAGQTCAEILGFYYLGAAIKKLKEADKMNLDVRLSENFTLREFYDPANYVDVLKGKAAPIVQPSDIDHRIIALLEKLRAKYRQKWPGVVIRIRPHGGYRPDPLNKLVGGAPGSQHRKGNAADYSVVVLGKAIDAPTLAVWTEHYMQEMGIKGGIGMYKATDNYIHVDARGKNVAWYDSYSSAGCPGQGGRPCTYRKGSKGAGVVLIQRYLGVPADGKYGPQTMAAVKAWQAEHGCTPDGIFGRETNRKMGYVLPWEV
Physico‐chemical
properties
protein length:416 AA
molecular weight:45518,5 Da
isoelectric point:9,43
hydropathy:-0,44
Representative Protein Details
Accession
6kqeo
Protein name
6kqeo
Sequence length
435 AA
Molecular weight
48692,14520 Da
Isoelectric point
9,24627
Sequence
MIIRVKISRQDNPDYGKVLNLDLEEYLRDVVPSEVRAGYDHMEALKAQAVAARTYAMYRAHKNRYLSYDVTDSSGRDQAYKSRPRHPRSDQAVAETAGWVLAYNNTTIDCKYTNSNGGRVRSSLERWGTDLPYYRGFDDPYDTTTEVSGHGVGMSQTGARAMARAGKNYKEILAYYYPGTRLIKREELNRVSFDYEEHLSPHFKRKEFFDPANYTNVTNKRTKRPYTYSEAEQALNGKVLVEKKLLDLLEAMREDLRKTYPGATIVLTPHGGYRPTELNAAVGGAAGSQHRYGRAADFRVVYGGKKVNAADLAVYTEKFMADHGYKGGVGMYHADDDYIHVDVRGVNVHWYASYKSAGCPGQGGTPCVYKHGYKSAGIVLVQRKLKELGYDPGTADGIYGVKTLNAVRKFQESVGLKPDGIYGKATNAKLGALPW
Other Proteins in cluster: phalp2_11210
Total (incl. this protein): 4 Avg length: 420,8 Avg pI: 9,38

Protein ID Length (AA) pI
6kqeo 435 9,24627
2vkY 416 9,43406
5Kte9 416 9,40821
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_34532
4UelR
2 27,3% 475 4.502E-46

Domains

Domains
Representative sequence (used for alignment): 6kqeo (435 AA)
Member sequence: 5K5L4 (416 AA)
1 435 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF08291, PF08486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6kqeo) rather than this protein.
PDB ID
6kqeo
Method AlphaFoldv2
Resolution 88.12
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50