Protein

Protein accession
A0A6J7VK34 [UniProt]
Representative
5sv59
Source
UniProt (cluster: phalp2_25860)
Protein name
Lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDKLGPIKFLTVHCAATPEGREVSAEQISSWDKARFGQTSYHWVVELDGTAIRTLEDDEKGAHTGGHNTGNIGVCYVGGVCKDNKTPKDTRTEAQKVALRKIVADYQAKRPKMIVRGHRDWPGVAKACPSFDVAKDL
Physico‐chemical
properties
protein length:137 AA
molecular weight:15050,9 Da
isoelectric point:8,68
hydropathy:-0,56
Representative Protein Details
Accession
5sv59
Protein name
5sv59
Sequence length
242 AA
Molecular weight
26026,79340 Da
Isoelectric point
9,85311
Sequence
MLINDWRQDWRLWSVRVCAAAATIYAFMLAAPDQVLALWQALPPQIQEMIPNRTEIAAALFFAAAIARVLRQQGAAPAAATPAPLSLMARVEPWRAAALAPISRAITSIAVHCTATREGQPFKAADIRAWHKAQGWQDIGYHFIVGLDGTIEIGRPKARPGAHVAGYNGSSVGIVYVGGVASDGKTPKDTRTPAQKTAMLDLLRALRQAHPAAVIKGHRDYPRVAKACPSFDAKGEYTGMSA
Other Proteins in cluster: phalp2_25860
Total (incl. this protein): 16 Avg length: 186,5 Avg pI: 9,38

Protein ID Length (AA) pI
5sv59 242 9,85311
1dvac 253 9,56654
4nPGw 251 9,56783
7dzLZ 230 9,56977
7dzS7 254 9,72539
A0A7D7KJ61 133 8,97047
A0A7G9W3D3 133 8,97047
A0A7G9W3H1 133 8,97047
A0A7S6R6Y4 139 9,38893
A0A6J5MYZ2 140 9,19514
A0A8S5LD09 142 9,01540
A0A8S5UDB5 143 9,29842
A0A8S5V5A1 144 9,35174
A0AAE9KSH0 255 9,99642
A0AAX4QLC5 255 9,99642
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12261
71VXz
7 35,7% 179 7.663E-30
2 phalp2_3660
7J5qp
4 30,0% 173 1.488E-27
3 phalp2_38135
6RJt9
7 24,5% 183 2.966E-05

Domains

Domains [InterPro]
Unannotated
Ami2
Representative sequence (used for alignment): 5sv59 (242 AA)
Member sequence: A0A6J7VK34 (137 AA)
1 242 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage uncultured Caudovirales phage
[NCBI]
2100421 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
LR796136 [NCBI]
CDS location
range 21719 -> 22132
strand -
CDS
ATGGACAAGCTCGGGCCCATTAAGTTTTTGACGGTGCATTGCGCCGCTACGCCAGAGGGCCGCGAGGTCTCCGCTGAGCAGATCTCCTCCTGGGACAAAGCGCGATTTGGCCAGACAAGCTATCACTGGGTTGTTGAGCTTGACGGCACGGCTATCAGGACACTGGAAGACGATGAAAAGGGCGCGCACACAGGCGGCCACAACACCGGCAACATTGGCGTTTGCTATGTTGGCGGTGTCTGCAAGGACAATAAGACGCCCAAGGACACGCGTACCGAGGCTCAGAAGGTAGCCCTGCGCAAGATCGTCGCTGACTACCAGGCCAAGCGTCCCAAGATGATTGTGCGCGGTCACCGCGACTGGCCGGGCGTTGCCAAGGCTTGCCCTAGCTTCGACGTAGCCAAAGACCTTTAA

CDS Source ID
CDS Source
LR797822 [NCBI]
CDS location
range 19967 -> 20380
strand +
CDS
ATGGACAAGCTCGGGCCCATTAAGTTTTTGACGGTGCATTGCGCCGCTACGCCAGAGGGCCGCGAGGTCTCCGCTGAGCAGATCTCCTCCTGGGACAAAGCGCGATTTGGCCAGACAAGCTATCACTGGGTTGTTGAGCTTGACGGCACGGCTATCAGGACACTGGAAGACGATGAAAAGGGCGCGCACACAGGCGGCCACAACACCGGCAACATTGGCGTTTGCTATGTTGGCGGTGTCTGCAAGGACAATAAGACGCCCAAGGACACGCGTACCGAGGCTCAGAAGGTAGCCCTGCGCAAGATCGTCGCTGACTACCAGGCCAAGCGTCCCAAGATGATTGTGCGCGGTCACCGCGACTGGCCGGGCGTTGCCAAGGCTTGCCCTAGCTTCGACGTAGCCAAAGACCTTTAA

CDS Source ID
CDS Source
LR798189 [NCBI]
CDS location
range 37690 -> 38103
strand +
CDS
ATGGACAAGCTCGGGCCCATTAAGTTTTTGACGGTGCATTGCGCCGCTACGCCAGAGGGCCGCGAGGTCTCCGCTGAGCAGATCTCCTCCTGGGACAAAGCGCGATTTGGCCAGACAAGCTATCACTGGGTTGTTGAGCTTGACGGCACGGCTATCAGGACACTGGAAGACGATGAAAAGGGCGCGCACACAGGCGGCCACAACACCGGCAACATTGGCGTTTGCTATGTTGGCGGTGTCTGCAAGGACAATAAGACGCCCAAGGACACGCGTACCGAGGCTCAGAAGGTAGCCCTGCGCAAGATCGTCGCTGACTACCAGGCCAAGCGTCCCAAGATGATTGTGCGCGGTCACCGCGACTGGCCGGGCGTTGCCAAGGCTTGCCCTAGCTTCGACGTAGCCAAAGACCTTTAA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5sv59) rather than this protein.
PDB ID
5sv59
Method AlphaFoldv2
Resolution 89.45
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50