Protein

Protein accession
A0A7G9W222 [UniProt]
Representative
6W4yG
Source
UniProt (cluster: phalp2_14782)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MTYTFQKACSLVTDVLPAAENKIQRGGMPSVVTGLVVHQFRADELRFDVHIGSLINSFTQPGDRIASAHFAVEGKRVVQFVDLKDRAFHAGAAGNDHWSIEVFGGMDETTLATVAALIRELERIAGRELTLYRHNALMATSCGTHVDLAKIRALVGATQGKEAATMVRPVAAVHKTSQPFGSYATAGVVGNPNGSTVQQLVAMYGNYQPFGHAGEDIACPIGTPVYAIADGEVVWADWATNLPGDDSDLGYRRRWYFYKGFPGILTVIKHPQLGPNVYTAYAHLSDNNMAPVGTKVRAGQLIAKSGNTGGVAPHLHVEYLVDPTYSSSGGFIYGRKNPALLYGIPAATTQGGSTLSSAEVTEIKNYIHALLIGGYTSGGKAHPGVAMVAEENQRRIGRVLTAVEGVPAATSEAVWSDRKVRRAGGDVAAIQELANVNTKLDALAPAISGLPAALSAAVAQAVEANLPEVIAAEIPDDLAQGVIDALAARLTLTKDAPPTA
Physico‐chemical
properties
protein length:500 AA
molecular weight:52710,1 Da
isoelectric point:6,31
hydropathy:0,01
Representative Protein Details
Accession
6W4yG
Protein name
6W4yG
Sequence length
206 AA
Molecular weight
21725,84470 Da
Isoelectric point
4,50889
Sequence
MRPVAEQFAVTQAFGSYATGGVAADPNGTEVQQLVAQYGNYQPYGHAGQDIGCPIGTPVHAIAAGTVLWADWGTNLPGDETDAGYRERWYLYKTFPGIVTVIQHDGWISVYAHLSEAPLNPGDTVTEGQQIALSGNTRSPGVSLGAHLHVEALIDLSFATGNGLIYGRTDPTPFFNPGAITVQSTTPPAPGVGPDQQFLIDLFGSL
Other Proteins in cluster: phalp2_14782
Total (incl. this protein): 33 Avg length: 265,0 Avg pI: 5,96

Protein ID Length (AA) pI
6W4yG 206 4,50889
19ZkI 276 5,13150
1hMsR 166 5,07529
1hO7q 280 5,13480
1hRhl 197 4,82543
3PtD3 217 6,06776
4LMIj 170 6,02376
6T8Kj 206 5,85296
756jy 186 5,13923
7dApe 265 5,94811
7hGjQ 197 4,83731
7uimb 216 5,78794
7vXMo 205 5,33249
7wWEf 197 5,27991
7wcpr 211 6,02842
7x5Gs 181 5,14964
7zTA9 205 4,82543
fTxI 204 5,51738
fsqL 212 5,41172
A0A3G2KJH4 322 6,24271
A0A6B9JL67 330 6,65973
A0A6G8R2M7 330 7,06375
A0A6G8R3L3 330 6,65973
A0A7M1CLB3 326 5,71808
A0A7T3N1B1 328 6,65695
A0A9E7LTH3 321 7,13252
A0AA48Y3F8 328 6,65695
A0AA48Y3Y1 327 7,82577
A0AA92N4F3 330 7,06431
A0AAF0GHT5 321 7,83222
A0AAF0K7D6 330 7,06494
A0AAF0K7N2 325 5,90508
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_276
6CCkj
17 57,1% 189 6.525E-77
2 phalp2_33817
1joRs
29 38,2% 188 4.425E-57
3 phalp2_33390
6Q97d
9 30,9% 194 4.418E-25
4 phalp2_8157
6Tpcb
26 29,0% 193 1.124E-24
5 phalp2_2494
5Ecd6
8 35,2% 176 3.615E-21
6 phalp2_40532
4HCiG
1 41,3% 138 6.720E-21
7 phalp2_24154
2cn0r
3 32,1% 146 6.971E-19
8 phalp2_38873
1OvVW
15 30,5% 193 1.293E-18
9 phalp2_18623
3ebV
83 33,8% 130 4.446E-18
10 phalp2_36749
6BXWz
16 30,1% 179 3.849E-17

Domains

Domains [InterPro]
Representative sequence (used for alignment): 6W4yG (206 AA)
Member sequence: A0A7G9W222 (500 AA)
1 206 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Arthrobacter phage Tweety19
[NCBI]
2768133 Galvastonvirus > Galvastonvirus tweety19
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MT897906 [NCBI]
CDS location
range 17561 -> 19063
strand +
CDS
ATGACCTACACCTTCCAGAAGGCATGCTCCCTCGTGACGGACGTCCTCCCGGCCGCCGAGAACAAGATCCAGCGAGGCGGCATGCCTTCGGTCGTGACGGGGCTCGTCGTGCATCAGTTCCGCGCCGACGAGCTCCGCTTCGACGTCCATATCGGCTCGCTTATCAACTCGTTTACACAGCCCGGCGACCGGATCGCCTCGGCTCACTTCGCCGTCGAGGGTAAGCGCGTCGTCCAGTTCGTCGACCTGAAGGACCGGGCGTTTCACGCCGGGGCGGCCGGTAACGATCACTGGTCGATCGAGGTCTTCGGCGGCATGGACGAGACAACCCTCGCGACCGTCGCCGCCCTGATCCGGGAGCTCGAACGGATCGCCGGGCGCGAGCTCACGCTCTACCGCCACAACGCCCTTATGGCGACCTCGTGTGGAACTCACGTCGACCTCGCCAAGATCCGCGCCCTCGTCGGCGCTACCCAAGGAAAGGAGGCCGCTACTATGGTCCGTCCCGTCGCCGCCGTCCATAAGACGTCGCAACCCTTCGGCTCCTATGCGACGGCCGGAGTCGTCGGCAACCCGAACGGCTCGACCGTTCAGCAACTCGTCGCCATGTACGGCAACTATCAGCCCTTCGGCCACGCCGGAGAGGATATCGCTTGCCCGATCGGGACGCCCGTCTACGCGATCGCCGACGGCGAGGTCGTCTGGGCCGACTGGGCTACCAACCTGCCCGGCGACGACTCCGACCTCGGCTACCGCCGCCGCTGGTACTTCTACAAGGGCTTCCCGGGCATCCTGACGGTTATCAAGCATCCTCAGCTTGGACCGAACGTTTACACGGCCTACGCCCATCTGTCCGATAACAACATGGCCCCCGTCGGGACGAAGGTCCGGGCCGGTCAGCTTATCGCCAAGTCCGGCAACACCGGAGGCGTCGCCCCTCACCTTCACGTCGAGTATCTCGTCGACCCGACTTACTCCTCGTCCGGAGGCTTCATCTACGGCCGGAAGAACCCGGCCCTTCTCTATGGCATCCCTGCCGCTACTACCCAAGGAGGCTCGACCTTGTCGTCCGCTGAAGTAACCGAGATCAAGAACTACATTCACGCCCTCCTGATCGGCGGCTACACGTCCGGGGGCAAGGCCCATCCCGGCGTCGCTATGGTCGCCGAGGAGAATCAGCGCCGCATCGGCCGCGTGCTGACCGCCGTCGAGGGCGTACCGGCCGCGACGTCCGAGGCGGTCTGGTCCGACCGCAAGGTCCGCCGTGCCGGAGGCGACGTCGCGGCGATTCAGGAGCTCGCGAACGTGAACACGAAGCTCGACGCCCTCGCCCCGGCGATCTCCGGCCTTCCTGCCGCGCTGAGCGCCGCCGTCGCTCAGGCCGTCGAGGCGAACCTCCCGGAAGTGATCGCGGCCGAGATCCCGGACGACCTCGCTCAGGGCGTGATCGACGCCCTCGCGGCCCGGCTGACCCTGACGAAGGACGCCCCGCCCACCGCGTAA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0004222 metalloendopeptidase activity molecular function None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6W4yG) rather than this protein.
PDB ID
6W4yG
Method AlphaFoldv2
Resolution 83.68
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50