Protein

Protein accession
2vkY [EnVhog]
Representative
6kqeo
Source
EnVhog (cluster: phalp2_11210)
Protein name
2vkY
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNIRVKITRDRSGRVVTMDLEDYLKGVVPSEIKAETCPMEAQKAQAIAARTYAIRKTIDRRSKPYDVDDTAGYQAYGARPRHRNSDAAVEATRGLVLMYGGKLIDAVYTDSNGGRCVSSLERWGSAVPYLIDQLDPYDKSGKVRGHGVGLSQTGAAARGKAGQTCAEILGFYYPGAAIKKLKEEDKMNLDARLSENFTLREFYDPANYVDVLKGKAAPIVQPSDIDHRIVTLLEKLRAKYRQKWPGVAIRIRPHGGYRPDPLNKLVGGAPGSQHRKGNAADFSVVVLGKAIDAPTLAVWTEHYMQELGIKGGIGMYKATDNYIHVDARGKNVAWYDSYSSAGCPGQGGRPCTYRKGSKGAGVVLIQRYLGIPADGKYGSQTAVAVKAWQAEHGCTPDGIFGRETNRKMGYVLPWEV
Physico‐chemical
properties
protein length:416 AA
molecular weight:45519,3 Da
isoelectric point:9,43
hydropathy:-0,46
Representative Protein Details
Accession
6kqeo
Protein name
6kqeo
Sequence length
435 AA
Molecular weight
48692,14520 Da
Isoelectric point
9,24627
Sequence
MIIRVKISRQDNPDYGKVLNLDLEEYLRDVVPSEVRAGYDHMEALKAQAVAARTYAMYRAHKNRYLSYDVTDSSGRDQAYKSRPRHPRSDQAVAETAGWVLAYNNTTIDCKYTNSNGGRVRSSLERWGTDLPYYRGFDDPYDTTTEVSGHGVGMSQTGARAMARAGKNYKEILAYYYPGTRLIKREELNRVSFDYEEHLSPHFKRKEFFDPANYTNVTNKRTKRPYTYSEAEQALNGKVLVEKKLLDLLEAMREDLRKTYPGATIVLTPHGGYRPTELNAAVGGAAGSQHRYGRAADFRVVYGGKKVNAADLAVYTEKFMADHGYKGGVGMYHADDDYIHVDVRGVNVHWYASYKSAGCPGQGGTPCVYKHGYKSAGIVLVQRKLKELGYDPGTADGIYGVKTLNAVRKFQESVGLKPDGIYGKATNAKLGALPW
Other Proteins in cluster: phalp2_11210
Total (incl. this protein): 4 Avg length: 420,8 Avg pI: 9,38

Protein ID Length (AA) pI
6kqeo 435 9,24627
5K5L4 416 9,42542
5Kte9 416 9,40821
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_34532
4UelR
2 27,3% 475 4.502E-46

Domains

Domains
Representative sequence (used for alignment): 6kqeo (435 AA)
Member sequence: 2vkY (416 AA)
1 435 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF08291, PF08486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
2vkY
Method AlphaFoldv2
Resolution 89.61
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6kqeo) rather than this protein.
PDB ID
6kqeo
Method AlphaFoldv2
Resolution 88.12
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50