Protein

Protein accession
A0AAE9W706 [UniProt]
Representative
4RjQQ
Source
UniProt (cluster: phalp2_15957)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MLTEQLDTSKFKIYKELDDLVVVDYRDNLKRLRGYGKRPEKVSLVVYHHSAGNENCDWTKDPQWQEFVDGVNHWHTKGLGWPACGYHIMVPYSALEHNGKNCVILANALGTMSYHTGRGRNAYGIGVCFQGNFAGPVNDSGLRPSDFQKECAHSIWLSWLQPDFNLADYQLTGHFNHGKQACPGEDLADIIYAHQMGFTLGGMLVQ
Physico‐chemical
properties
protein length:206 AA
molecular weight:23233,9 Da
isoelectric point:6,23
hydropathy:-0,43
Representative Protein Details
Accession
4RjQQ
Protein name
4RjQQ
Sequence length
367 AA
Molecular weight
41037,97740 Da
Isoelectric point
8,60842
Sequence
VDLFVGSNFVVRDLRPQIDRIRGDGYPYGWRGKGLAWSRAPRTIRGVEYHQTAGSLNPGEAGPITTARFVTANPWFRCPTCGRTWEGTVMYPYEACSSCKDDEGQPVLGKDLGRGRGWPKMCYHLFVPWAPLLDDAHRFIVYQCLDWYERSWHSNAEGNTYNVAVAFQGLYRSRHNPHFVPWPETDGQPSVAQQAIARPLWYEYLRIELKLAAAGLRGHFQYGKPTCPGDMLESIVVEVSGAKVETLTLPEIAPSAFPTWESRQRFLLSLGFDLGSYGPAKDGVDGKPGIRTRAAIEVIQRMHAIQVTGNWDPETDRVASVLQPLPGEIKQYVSSALPPSPVKASDELEEPAVSGSKPFGGRRRHRR
Other Proteins in cluster: phalp2_15957
Total (incl. this protein): 2 Avg length: 286,5 Avg pI: 7,42

Protein ID Length (AA) pI
4RjQQ 367 8,60842
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19673
4B2DP
5 33,8% 310 3.358E-38
2 phalp2_37090
174wy
2 30,4% 243 8.068E-25

Domains

Domains [InterPro]
Unannotated
PG_1
Disordered region
Representative sequence (used for alignment): 4RjQQ (367 AA)
Member sequence: A0AAE9W706 (206 AA)
1 367 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhodobacteraceae phage LS06-2018-MD06
[NCBI]
3003839 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
OP947162 [NCBI]
CDS location
range 12162 -> 12782
strand +
CDS
ATGCTTACTGAACAACTAGACACTAGCAAATTTAAAATCTACAAAGAACTTGACGACCTTGTCGTTGTTGACTATCGAGACAATCTAAAGAGACTGCGCGGGTACGGCAAAAGACCAGAAAAGGTTTCTCTCGTTGTATACCATCACAGCGCCGGCAATGAAAACTGTGACTGGACCAAGGACCCGCAATGGCAAGAGTTCGTAGACGGTGTTAATCACTGGCATACCAAGGGTCTTGGCTGGCCGGCCTGCGGATATCACATTATGGTGCCCTACTCTGCGCTGGAGCACAACGGCAAAAACTGTGTCATTCTGGCTAACGCTCTCGGTACAATGTCGTACCATACAGGCCGTGGCAGAAATGCTTATGGAATCGGTGTATGCTTCCAGGGTAACTTTGCAGGACCGGTGAACGACAGTGGCTTGCGGCCATCGGACTTCCAGAAGGAATGCGCTCACAGTATTTGGCTTAGCTGGCTACAACCAGACTTCAACTTGGCGGATTATCAGCTGACAGGTCATTTTAACCACGGCAAGCAGGCATGCCCAGGAGAGGATCTAGCTGATATTATCTATGCTCATCAGATGGGTTTTACGCTTGGCGGGATGTTGGTGCAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
A0AAE9W706
Method SMR
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4RjQQ) rather than this protein.
PDB ID
4RjQQ
Method AlphaFoldv2
Resolution 81.12
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50