Protein

Protein accession
S5YC98 [UniProt]
Representative
7PQjE
Source
UniProt (cluster: phalp2_2393)
Protein name
Putative lysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MYYHNRNLANLEKLAPHTRQKAKQWYQYCVENGIEVLIYETTRTIEQQRENVRKGASQTMKSYHLVGQALDFVPAREAEVYWDGYYRNDIQKAIGYAKSIGFEWGGDWKGFVDSPXLQYNYNGYGTDKGNVSDEPVHVTGNTGVVRVVVDSALVRREPTTQSPINTDAGENGRLYRGTEWQAWGSTIGEGGYTWYPLGNEMWVRGDLVSWRNA
Physico‐chemical
properties
protein length:213 AA
molecular weight: Da
isoelectric point:6,11
hydropathy:
Representative Protein Details
Accession
7PQjE
Protein name
7PQjE
Sequence length
276 AA
Molecular weight
31709,03470 Da
Isoelectric point
9,65925
Sequence
MTMYYEERSRNNIAKLAANTRAKALEWFNWCCKNGIEVLVYETIRTKEQQAANVANGKSQTMRSYHIVGQAFDFVMAKGKTVDWGGYKTAKAKKVIAKAKALGFSWGGDWSGFVDCPHMQYEYKGYGTDKFTADKLVANNKTGKQGVYARDFLNIRTKATWDSPVAFKVPIYYYAQILWDTKNGDWVQIEFQGKKGWYKPSFKEYWYEKDPCTSYICVADVNFRKSSKWDSPVAQKKKKGETVRMVKKAKNGWLEFGLTNGVIGYIPNSAKYVKKK
Other Proteins in cluster: phalp2_2393
Total (incl. this protein): 101 Avg length: 277,7 Avg pI: 9,76

Protein ID Length (AA) pI
7PQjE 276 9,65925
3BTGY 200 9,67614
3UE8 281 9,84157
71VDB 283 9,82700
73Ut5 276 9,64333
73cjI 281 9,81965
73cm9 281 9,86213
74epd 281 9,73004
75T68 281 9,84782
76l89 281 9,75982
76l8t 283 9,89733
76ojD 281 9,84157
7803F 291 9,89675
78MIa 281 9,85362
78N6u 293 9,58105
7B4nT 281 9,84131
7BpXp 283 9,86239
7QDzm 281 9,90126
7QE5S 276 9,68072
7baxg 281 9,80514
7bnOX 281 9,89707
7cOc7 283 9,87722
7cYzK 280 9,81965
7cZQz 283 9,75434
7czOq 287 9,63959
7czPg 276 9,61116
7dYHQ 280 9,87722
7danT 281 9,76801
7datF 204 9,75518
7dnia 296 9,58208
7e4tb 281 9,82700
7gKz7 281 9,81945
7gQx3 281 9,87722
7gQy4 280 9,78206
7hEzm 281 9,75982
7hG00 280 9,85665
7hGOa 281 9,87696
7hGuo 281 9,82700
7hGxG 281 9,82700
7hw7g 280 9,82700
7hw85 273 9,61554
7iEEL 281 9,89707
7kM71 281 9,82700
7kXft 280 9,84157
7kdu0 281 9,83370
7lH2h 245 9,53495
7lgAN 281 9,84782
7lgB7 281 9,87722
7lgO1 281 9,79102
7lgWK 279 9,83995
7lgt7 280 9,80514
7lgu0 281 9,79102
7lh5K 281 9,76801
7lh5i 283 9,84157
7lhaZ 282 9,84157
7mJow 283 9,73010
7myol 283 9,82674
7oIh3 280 9,82700
7oIjs 281 9,91635
7oJ8m 281 9,72288
7oOqu 281 9,78206
7oOv7 281 9,84782
7oXHu 282 9,87748
7oXLY 286 9,64900
7oXMC 282 9,91609
7oXNi 283 9,84131
7oz46 280 9,80514
7ozhu 281 9,82700
7qYxN 280 9,84157
7r61U 276 9,69219
7tN2Z 281 9,84782
7tZFF 281 9,84157
7tZHS 247 9,80392
7uILM 281 9,73004
7uWID 281 9,84782
7uWIm 283 9,82674
7uWOD 280 9,81965
7uuMB 281 9,81965
7vbaV 282 9,81288
7vbba 281 9,74112
7vp9J 283 9,86239
7vpd3 283 9,86213
7w0Gq 283 9,76801
7w0L0 281 9,82700
7wKGa 281 9,89308
7wKHK 281 9,77736
7wd3Y 280 9,79663
7wd85 281 9,76801
7wdaP 281 9,90101
7wpBR 281 9,80514
7wpn7 283 9,76801
7x43C 280 9,80514
7y5ZN 281 9,84969
8IiQT 280 9,85665
8IiQU 281 9,88199
8IiQW 283 9,78206
8IiR0 281 9,89707
8IiR2 283 9,73010
A8ATB8 276 9,65925
W8EEW7 215 9,57899
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8189
7bcYD
17 38,8% 286 6.623E-61
2 phalp2_32215
72gmg
11 41,1% 221 1.664E-57
3 phalp2_12448
qevc
108 30,3% 201 1.559E-25
4 phalp2_34998
4MASs
7 25,0% 228 9.103E-21
5 phalp2_33303
6crLy
183 26,1% 252 7.592E-20
6 phalp2_1844
3xTtn
8 21,6% 231 1.283E-17

Domains

Domains [InterPro]
PET_M15
Unannotated
Unannotated
Representative sequence (used for alignment): 7PQjE (276 AA)
Member sequence: S5YC98 (213 AA)
1 276 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF13539

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Wip1
[NCBI]
663237 Kalamavirales > Tectiviridae > Betatectivirus > Betatectivirus Wip1
Host Bacillus anthracis
[NCBI]
1392 Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus > Bacillus cereus group

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KF188458 [NCBI]
CDS location
range 10344 -> 10985
strand +
CDS
ATGTATTATCACAATCGAAATCTAGCAAACCTAGAGAAGTTAGCACCGCATACAAGACAGAAAGCAAAACAGTGGTATCAATATTGTGTAGAAAACGGTATTGAGGTACTAATCTATGAAACAACTCGTACCATCGAACAACAACGTGAGAACGTACGTAAAGGCGCGTCACAAACTATGAAGTCATACCACTTAGTAGGACAAGCTTTGGATTTCGTACCAGCAAGAGAAGCCGAGGTATATTGGGATGGATACTATAGAAACGACATTCAAAAGGCTATCGGGTATGCGAAGTCTATCGGTTTTGAATGGGGTGGTGACTGGAAAGGGTTTGTTGATAGTCCACRCTTACARTACAACTATAACGGTTACGGGACGGACAAAGGAAACGTGTCAGATGAGCCTGTACACGTGACAGGGAATACAGGGGTTGTAAGAGTTGTTGTTGATAGCGCACTTGTCAGAAGAGAGCCTACAACACAATCACCTATCAATACGGATGCTGGAGAAAATGGTCGCTTATATCGTGGTACTGAATGGCAAGCGTGGGGAAGTACAATTGGTGAAGGTGGTTATACATGGTATCCATTAGGAAATGAAATGTGGGTACGTGGTGATTTAGTAAGTTGGAGAAATGCATAA

Gene Ontology

Description Category Evidence (source)
GO:0008233 peptidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000387537d_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7PQjE) rather than this protein.
PDB ID
7PQjE
Method AlphaFoldv2
Resolution 93.89
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50