Protein

Protein accession
A0AAX4BHX2 [UniProt]
Representative
7r9gE
Source
UniProt (cluster: phalp2_36222)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKNIYSNHIKGQKLTSQKPSIDGVVIHNDYGSMTPSQYLNWLYTREQNGTYTQGWASVYINRNEVLWYHPTNFVEWHCGNQYANQHLIGFEVCESYPNHISDETFMKNEEATFKVVADVMKSYNLPINRNTVHLHREYFSTSCPHRSWDIHIGVNAPNTRANQLKLIDYFISRIKHYANGGKTPDKPQVSENKYVKYNWRGTFTAHKTNTLPIVPRYDYGMSAKEVDKDSYIQPNEYVPFYQIIKDKQAKLWWIKFKYAKKGSSDKYFYMPIGHIEDKDEKILNEKHLWGKLEVEKHGK
Physico‐chemical
properties
protein length:299 AA
molecular weight:35290,4 Da
isoelectric point:9,06
hydropathy:-0,82
Representative Protein Details
Accession
7r9gE
Protein name
7r9gE
Sequence length
490 AA
Molecular weight
55682,98120 Da
Isoelectric point
6,47091
Sequence
MLTAIDYLTKKGWKISSDPRKYEGYPNNYGYRNYQENGVNYDSFCNGYHRAFDLYSNATNDIPTVTSGTVVTSETHGNFGGTVEIRDANGNDWIYGHLQRDSLRFTKGDKVNQGDIVGLQGSSNYYDNPMNAHLHLQLRPKGTDLNDEKAEVCSGLPMEKYDISKLNAEQDKSKNGSEKQLKHICSNHIKGNKITAPKPSVQGVVIHNDYGSMTPAQYLPWLYARENNGTHVNGWASVYANRNEVLWYHPTDYVEWHCGHQWANANLIGFEVCESYPGRSSDKLFLENEEATLKVAADVMRSYSLPVNRNTVRLHNEFFGTSCPHRSWELHVGKGAKYTEANQNKMKDYFISRIKYYYNGGKLQAGNVQVINEKDVKNEVAKHTEKQAVKTTDWKQNNYSTWWKNEQATFENGNEPIQVWHVGPFRINGNEAGKLPAGASINYDEVMLQDGHVWVGYDSFEGERLYLPVRTWNGVAPPNHGIGDLWGSIH
Other Proteins in cluster: phalp2_36222
Total (incl. this protein): 32 Avg length: 446,4 Avg pI: 8,23

Protein ID Length (AA) pI
7r9gE 490 6,47091
1iZpL 487 8,98684
1k5WR 487 6,22639
5tkh 487 8,45621
79o1B 488 8,05302
7gPWZ 489 7,66874
7kQHm 486 8,26796
7qqbA 502 8,67179
7qsDm 489 7,65334
7qtNR 489 6,97246
7qtRA 489 6,60829
7qugZ 489 6,47597
7quld 489 8,02317
7r9jI 490 7,69483
7r9oR 490 7,69346
7r9qI 490 7,10097
7rqxU 487 8,45679
7wlzs 481 8,95983
7ycFz 486 8,26325
7ycI1 486 9,02308
7zG9Z 486 8,83077
8JNsH 486 8,43706
8JNsI 486 8,44853
8MGJQ 486 8,85443
8fv1N 487 8,28195
A0A1Q1PW31 289 9,51871
A0A1S6L1H5 299 9,35199
A0A3G2YSK6 299 9,18612
A0A3G2YSM9 299 9,18612
A0A3G2YSP8 299 9,28346
A0AAX4BIV5 299 9,24575
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_16525
72rfr
1 27,4% 506 1.523E-35

Domains

Domains [InterPro]
Representative sequence (used for alignment): 7r9gE (490 AA)
Member sequence: A0AAX4BHX2 (299 AA)
1 490 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510, PF01551, PF08460

Taxonomy

  Name Taxonomy ID Lineage
Phage Staphylococcus phage S-CoN_Ph29
[NCBI]
3076586 Rountreeviridae > Andhravirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
OR354852 [NCBI]
CDS location
range 5655 -> 6554
strand +
CDS
ATGAAAAATATTTATTCAAACCACATTAAAGGTCAAAAGTTAACAAGTCAAAAACCTAGTATTGACGGGGTTGTCATTCATAATGATTATGGAAGTATGACACCTAGCCAATATTTAAACTGGTTATATACACGTGAACAAAATGGTACGTATACACAGGGTTGGGCTTCAGTTTATATCAATCGTAATGAGGTTTTATGGTATCACCCTACAAATTTTGTTGAGTGGCATTGTGGAAATCAATACGCAAACCAACATTTAATTGGTTTTGAAGTCTGTGAAAGTTATCCAAATCATATATCTGATGAAACATTTATGAAAAATGAAGAAGCTACTTTTAAAGTTGTTGCCGATGTGATGAAATCATACAATTTACCAATTAATCGTAATACGGTTCACCTACATCGTGAATACTTTTCAACGTCTTGTCCTCATCGTAGTTGGGATATACACATAGGGGTTAATGCACCAAATACAAGAGCAAATCAATTGAAATTGATTGATTACTTTATTTCACGTATTAAACATTATGCAAACGGAGGTAAAACACCTGATAAACCACAAGTGAGTGAAAATAAATACGTTAAATATAATTGGCGTGGTACATTTACTGCTCATAAAACAAATACATTACCGATTGTTCCTAGATATGATTATGGTATGAGTGCAAAAGAGGTTGATAAAGATTCATATATACAACCTAATGAATATGTACCGTTTTATCAAATCATAAAAGATAAACAAGCAAAATTATGGTGGATTAAGTTTAAATACGCTAAAAAAGGTTCAAGCGATAAATATTTCTATATGCCAATCGGTCATATTGAAGATAAAGACGAAAAAATACTAAATGAAAAACATCTTTGGGGAAAATTGGAGGTTGAAAAGCATGGCAAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7r9gE) rather than this protein.
PDB ID
7r9gE
Method AlphaFoldv2
Resolution 88.50
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50