Protein

Protein accession
Q4ZD58 [UniProt]
Representative
7vCZc
Source
UniProt (cluster: phalp2_36896)
Protein name
Endolysin 2638A
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MLTAIDYLTKKGWKISSDPRTYDGYPKNYGYRNYHENGINYDEFCGGYHRAFDVYSNETNDVPAVTSGTVIEANDYGNFGGTFVIRDANDNDWIYGHLQRGSMRFVVGDKVNQGDIIGLQGNSNYYDNPMSVHLHLQLRPKDAKKDEKSQVCSGLAMEKYDITNLNAKQDKSKNGSVKELKHIYSNHIKGNKITAPKPSIQGVVIHNDYGSMTPSQYLPWLYARENNGTHVNGWASVYANRNEVLWYHPTDYVEWHCGNQWANANLIGFEVCESYPGRISDKLFLENEEATLKVAADVMKSYGLPVNRNTVRLHNEFFGTSCPHRSWDLHVGKGEPYTTTNINKMKDYFIKRIKHYYDGGKLEVSKAATIKQSDVKQEVKKQEAKQIVKATDWKQNKDGIWYKAEHASFTVTAPEGIITRYKGPWTGHPQAGVLQKGQTIKYDEVQKFDGHVWVSWETFEGETVYMPVRTWDAKTGKVGKLWGEIK
Physico‐chemical
properties
protein length:486 AA
molecular weight:55494,5 Da
isoelectric point:8,44
hydropathy:-0,76
Representative Protein Details
Accession
7vCZc
Protein name
7vCZc
Sequence length
254 AA
Molecular weight
28957,78350 Da
Isoelectric point
9,50627
Sequence
MKHIYSKHIEGSKLTGKKDSIAGVVIHNDYGRMTPSQYLSWLYQREANGTHVDGFASVYANKDECLWYHPTDYVEWHCANRWANANLIGIEVVQSYPGILTDEQFKLNEEACFEIAADILKSYNLPVNRDTVNLHREYYATACPHRSWDIHVGKNAPNTRANQVKLLDYFISRIKFYYNGGSTKTVTKQKAPVKKKVVKKMSKPKSSTSTYKVKSGDSLWGIATSNKMTVAELKKLNGLKTNDIFVNQVLKIKK
Other Proteins in cluster: phalp2_36896
Total (incl. this protein): 30 Avg length: 317,5 Avg pI: 9,32

Protein ID Length (AA) pI
7vCZc 254 9,50627
1bALT 289 9,62154
1bAVv 284 9,43522
1cmfx 289 9,43522
1cmhd 284 9,43522
2qUiq 284 9,58447
78HwB 312 9,54811
7lKHR 284 9,39441
7rBDx 284 9,62154
7rBH7 284 9,51871
7rBHX 284 9,50801
7rBI6 284 9,47287
7rBLG 284 9,02759
7rBWw 289 9,38403
7ryoQ 284 9,43522
7zobs 286 9,58447
7zogE 284 9,63785
7zohR 209 9,61096
8IppW 289 9,55120
8IppX 289 9,51871
QOjl 284 9,54166
A0A1Q1PVY4 289 9,55120
A0A1J0MFY3 486 8,44853
A0A4P6QXB6 486 8,85443
A0A1S6KVC6 295 9,23273
A0A499SHK0 486 8,27260
A0A499SJP3 486 8,84489
A0AAX4BIS3 299 9,33601
A0AAX4BIZ1 299 9,33601
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_7790
7qCWn
51 40,4% 198 1.990E-52
2 phalp2_35500
DDEb
68 43,6% 181 1.166E-50
3 phalp2_2897
yopH
100 33,1% 208 1.018E-46
4 phalp2_36839
6Z1Os
11 30,8% 185 2.067E-44
5 phalp2_32258
7xeAb
253 34,8% 201 7.440E-39
6 phalp2_15488
8diNj
63 34,9% 183 2.022E-36
7 phalp2_20237
8tSRS
1 34,5% 188 7.817E-32
8 phalp2_15359
1El58
3 30,5% 193 1.453E-31
9 phalp2_6798
8987b
91 27,5% 229 4.853E-11
10 phalp2_5182
3wjGA
273 24,0% 183 2.351E-08

Domains

Domains [InterPro]
Representative sequence (used for alignment): 7vCZc (254 AA)
Member sequence: Q4ZD58 (486 AA)
1 254 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01476, PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Staphylococcus phage 2638A
[NCBI]
320836 Fibralongavirus > Fibralongavirus fv2638A
Host Staphylococcus aureus
[NCBI]
1280 Firmicutes > Bacilli > Bacillales > Staphylococcaceae > Staphylococcus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
AY954954 [NCBI]
CDS location
range 20695 -> 22155
strand +
CDS
ATGCTAACTGCTATTGACTATCTTACGAAAAAAGGTTGGAAAATATCATCTGACCCTCGCACTTACGATGGTTACCCTAAAAACTACGGCTACAGAAATTACCATGAAAACGGCATTAATTATGATGAGTTTTGTGGTGGTTATCATAGAGCTTTTGATGTTTACAGTAACGAAACTAACGACGTGCCTGCTGTTACTAGCGGAACAGTTATTGAAGCAAACGATTACGGTAATTTTGGTGGTACATTCGTTATTAGAGACGCTAACGATAACGATTGGATATATGGGCATCTACAACGTGGCTCAATGCGATTTGTTGTAGGCGACAAAGTCAATCAAGGTGACATTATTGGTTTACAAGGTAATAGCAACTATTACGACAATCCTATGAGTGTACATTTACATTTACAATTACGCCCTAAAGACGCAAAGAAAGATGAAAAATCACAAGTATGTAGTGGTTTGGCTATGGAAAAATATGACATTACAAATTTAAATGCTAAACAAGATAAATCAAAGAATGGGAGCGTGAAAGAGTTGAAACATATCTATTCAAACCATATTAAAGGTAACAAGATTACAGCACCAAAACCTAGTATTCAAGGTGTGGTCATCCACAATGATTATGGTAGTATGACACCTAGTCAATACTTACCATGGTTATATGCACGTGAGAATAACGGTACACACGTTAACGGTTGGGCTAGTGTTTATGCAAATAGAAACGAAGTGCTTTGGTATCATCCGACAGACTACGTAGAGTGGCATTGTGGTAATCAATGGGCAAATGCTAACTTAATCGGATTTGAAGTGTGTGAGTCGTATCCTGGTAGAATCTCGGACAAATTATTCTTAGAAAATGAAGAAGCGACATTGAAAGTAGCTGCGGATGTGATGAAGTCGTACGGATTACCAGTTAATCGCAACACTGTACGTCTGCATAACGAATTCTTCGGAACTTCTTGTCCACATCGTTCGTGGGACTTGCATGTTGGCAAAGGTGAGCCTTACACAACTACTAATATTAATAAAATGAAAGACTACTTCATCAAACGCATCAAACATTATTATGACGGTGGAAAGCTAGAAGTAAGCAAAGCAGCAACTATCAAACAATCTGACGTTAAGCAAGAAGTTAAAAAGCAAGAAGCAAAACAAATTGTGAAAGCAACAGATTGGAAACAGAATAAAGATGGCATTTGGTATAAAGCTGAACATGCTTCGTTCACAGTGACAGCACCAGAGGGAATTATCACAAGATACAAAGGTCCTTGGACTGGTCACCCACAAGCTGGTGTATTACAAAAAGGTCAAACGATTAAATATGATGAGGTTCAAAAATTTGACGGTCATGTTTGGGTATCGTGGGAAACGTTTGAGGGCGAAACTGTATACATGCCGGTACGCACATGGGACGCTAAAACTGGTAAAGTTGGTAAGTTGTGGGGCGAAATTAAATAA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0004222 metalloendopeptidase activity molecular function None (UniProt)
GO:0006508 proteolysis biological process None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0046872 metal ion binding molecular function None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.5.1.28 None Hydrolyzes the link between N-acetylmuramoyl residues and L-amino acid residues in certain cell-wall glycopeptides. experimental evidence used in manual assertion
ECO:ECO:0000305
PubMed:22777279

Tertiary structure

PDB ID
6YJ1
Method PDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
7AQH
Method PDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
Q4ZD58
Method AlphaFoldv2
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi00005058de_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7vCZc) rather than this protein.
PDB ID
7vCZc
Method AlphaFoldv2
Resolution 90.25
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50