Protein

UniProt accession
Q4ZE52 [UniProt]
Protein name
ORF004
PhaLP type
endolysin

evidence: ML prediction

probability: 99 % (predicted by ML model)

Protein sequence
MILKRVITMNDQEKIDKFTHSYINDDFGLTIDQLVPKVKGYGRFNVWLGGNESKIRQVLKAVKEIGVSPTLFAVYEKNEGFSSGLGWLNHTSARGDYLTDAKFIARKLVSQSKQAGQPSWYDAGNIVHFVPQDVQRKGNADFAKNMKAGTIGRAYIPLTAAATWAAYYPLGLKASYNKVQNYGNPFLDGANTILAWGGKLDGKGGSPSDSSDSGSSGDSGSSLLALAKQAMQELLKKVQDALQWDVHSIGSDKFFSNDYFTLQKTFNNTYNIKMTIGLLDSLKKLIDSVQVDSGGSSSNPTDDDGDHKAISGKSVKPNGKSGRVIGGNWTYAQLPEKYKKAIGVPLFKKEYLYKQGNIFPQTGNAGQCTELTWAYMSQLHGKRQPTDDGQITNGQRVWYVYKKLGAKTTHNPTVGYGFSSKPPYLQATAYGIGHTGVVVAVFDDGSFLVANYNVPPYVAPSRVVLYTLINGVPHNAGDNIVFFSGIA
Physico‐chemical
properties
protein length:487 AA
molecular weight:53104,00000 Da
isoelectric point:9,30803
aromaticity:0,11088
hydropathy:-0,37310

Domains

Domains [InterPro]
Protein sequence: Q4ZE52
1 487
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Staphylococcus phage 66
[NCBI]
320832 Rountreeviridae > Rosenblumvirus > Rosenblumvirus rv66
Host Staphylococcus aureus
[NCBI]
1280 Bacteria > Firmicutes > Bacilli > Bacillales > Staphylococcaceae > Staphylococcus

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AAX90655.1 [NCBI]
Genbank nucleotide accession
AY954949 [NCBI]
CDS location
range 10270 -> 11733
strand +
CDS
ATGATACTGAAAAGAGTGATAACAATGAACGATCAAGAGAAGATAGATAAATTTACGCATTCCTATATTAATGATGATTTTGGTTTAACGATAGACCAGTTAGTCCCTAAAGTAAAAGGATATGGGCGCTTTAATGTATGGCTTGGTGGTAATGAAAGTAAAATCAGACAAGTATTAAAAGCAGTAAAAGAGATAGGTGTTTCACCTACTCTTTTTGCCGTATATGAAAAAAATGAGGGTTTTAGTTCTGGACTTGGTTGGTTAAACCATACGTCTGCACGTGGTGATTATTTAACAGATGCTAAATTCATAGCAAGAAAGTTAGTATCACAATCAAAACAAGCTGGACAACCGTCTTGGTATGACGCAGGTAACATCGTCCACTTTGTACCTCAAGACGTACAAAGAAAAGGTAATGCAGATTTTGCAAAAAATATGAAAGCAGGTACAATTGGACGTGCATATATTCCATTAACAGCAGCTGCTACTTGGGCGGCATATTATCCTTTAGGTTTGAAAGCATCATATAACAAAGTACAAAACTATGGTAACCCATTTTTAGACGGTGCGAATACTATTCTAGCTTGGGGTGGTAAATTAGACGGTAAAGGTGGATCACCTAGTGATTCGTCTGACAGTGGTAGTAGTGGTGACAGTGGTAGTTCACTACTCGCTTTAGCAAAACAAGCCATGCAAGAATTATTAAAAAAAGTACAAGACGCATTACAATGGGATGTGCATAGTATTGGTAGTGATAAATTTTTTAGTAATGATTATTTTACATTACAAAAAACATTTAACAACACATATAATATTAAAATGACAATTGGTTTACTTGATTCATTAAAAAAACTGATTGATAGCGTGCAAGTGGATAGTGGGGGTAGTAGTTCTAATCCTACTGATGATGACGGTGACCATAAAGCAATTAGTGGTAAATCAGTTAAACCAAATGGAAAAAGTGGACGTGTGATTGGTGGTAACTGGACGTATGCACAGTTACCAGAAAAATATAAAAAAGCGATTGGTGTACCTTTATTCAAAAAAGAATATTTATACAAACAAGGTAACATATTTCCTCAAACGGGAAATGCAGGACAGTGTACAGAATTAACATGGGCGTATATGTCACAACTACATGGAAAAAGACAACCTACCGACGACGGTCAAATAACAAACGGTCAACGTGTATGGTACGTCTATAAAAAGTTAGGTGCAAAAACAACACATAATCCAACAGTTGGTTACGGTTTTTCTAGTAAACCACCATACTTACAAGCAACAGCATATGGTATTGGTCACACTGGTGTTGTTGTAGCAGTATTTGACGATGGTTCGTTTTTAGTCGCAAACTATAATGTACCACCATATGTTGCACCATCACGTGTGGTATTATATACACTCATTAATGGTGTACCACATAATGCAGGTGATAATATTGTATTCTTCAGTGGTATTGCATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available.