Protein

Protein accession
A0A1D7SEW2 [UniProt]
Representative
2QG9f
Source
UniProt (cluster: phalp2_37450)
Protein name
Peptidase M23 domain-containing protein
Lysin probability
69%
PhaLP type
VAL
Probability: 98% (predicted by ML model)
Protein sequence
MAAGTESYEAPEYGNLAGAIGEKIGGALTMAAGARRRRNEEIDELNSKEDKTDEEKQRLKDLQSQGFGFIAKKAMGAEFGGDMKRRTKGFFQMNPDDQNDPALDKKKRFEALLRAQPVGNQTAPGAPPEAPKASPDGGALGSFATGIIEKISLLSKKVDDLKNVEQKDQTPKTVVNLSKNVGSIRRFFSKNNKIEEEQVKISEQQLEQQKEEAADAKQARAEAVAEGRNRTAGGSEIDNSRKGSTLKGLLGGALDFAGDLLGFGRRGRRGGRRRGGRRGGGIGLGGFGKRGRRRGASRGMSGGRGRTQYTAPVGPQPMNSATPWARKGAGDRGGQFGQGGFAPRMEASPIKFASGGIVDNPTTGQAVIPKNKLTAAVKTNQDNVKKADPFAKVMQLPTMAAGALLMSTVGNVINNMGGISKLFRPVLQRMFVPAATAFGLPANLISAFFGSSGASAKGLGGIGKGKGKGKGKNGGGGEQTSSNVTPGSTIGGGTVTGGGSVDGYSISSPFGARNTGIPGASTNHLGVDYRTPQGTKLSIKRPGKVIDTTTPAMGNNGEVYIQHDDGSKSRYLHMSAVAVSPGERVDTGAFLGKTGGEPGTPGAGPTSGAHLHFEYYPPGASGPVDGSGVASTVFSVGGTLTPTAPQAVQPTAAPVPSSQTPAPAAATPQSNQPTTLEPIVIPRPAPAAAAPAAADNTGNGGANLPVRNPNASMSLLGGMP
Physico‐chemical
properties
protein length:720 AA
molecular weight:73875,1 Da
isoelectric point:10,00
hydropathy:-0,53
Representative Protein Details
Accession
2QG9f
Protein name
2QG9f
Sequence length
248 AA
Molecular weight
24035,20410 Da
Isoelectric point
7,20289
Sequence
NGGGGSETTPGVTPGSTIGGGTITGGGSVDGYGISSGFGHRNTGIPGASTYHLGVDYRTPQGTKLSIKRPGKVIATTAPAIGNNGEVYIQHDDGSKSRYLHMSAVAVSAGQRVDAGAFLGKTGGEPGTPGAGPTSGAHLHFEYYPPGASGPVDGSGVASSVFSVGGTLTPTAPPTAVQPTAAATPSAQTPRPAGATPESNQPATLPPIVIPAPAAPAAAAEPDNSGNGGANLPVRNPNARMSLLGGMP
Other Proteins in cluster: phalp2_37450
Total (incl. this protein): 4 Avg length: 602,0 Avg pI: 9,31

Protein ID Length (AA) pI
2QG9f 248 7,20289
A0A1D7SE66 720 10,01673
A0A1D7SGC7 720 10,01673
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26176
8BAD7
1 37,6% 162 1.004E-23
2 phalp2_39770
jCI5
31 29,1% 199 9.973E-12

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Cyanophage S-RIM44
[NCBI]
1278485 Kyanoviridae > Vellamovirus > Vellamovirus rhodeisland44
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KX349294 [NCBI]
CDS location
range 8831 -> 10993
strand +
CDS
ATGGCAGCAGGCACCGAAAGTTACGAAGCACCTGAATATGGAAACCTCGCTGGTGCTATTGGCGAGAAGATTGGTGGTGCTCTTACGATGGCAGCAGGAGCAAGACGCCGCCGTAATGAGGAGATAGATGAATTAAATAGTAAAGAAGATAAAACAGACGAAGAAAAACAAAGACTAAAAGACTTACAGTCTCAAGGATTTGGGTTCATTGCAAAGAAAGCAATGGGTGCTGAGTTCGGTGGAGACATGAAGAGAAGAACTAAAGGGTTCTTCCAAATGAATCCTGATGATCAGAATGATCCAGCATTAGATAAGAAGAAAAGATTTGAAGCACTGCTGAGAGCACAACCAGTAGGAAATCAAACAGCACCTGGAGCACCACCAGAGGCACCAAAAGCATCTCCAGATGGCGGTGCCTTAGGATCATTTGCTACTGGTATTATTGAGAAGATTAGTCTTCTTTCTAAGAAAGTAGATGATCTAAAAAATGTAGAACAGAAAGATCAGACACCTAAAACTGTAGTAAATCTCAGTAAAAATGTAGGTAGCATTAGAAGGTTCTTCTCTAAGAACAATAAGATTGAAGAAGAACAAGTAAAGATTTCTGAGCAGCAACTAGAACAGCAGAAAGAAGAGGCTGCTGACGCAAAGCAAGCAAGAGCAGAAGCAGTAGCAGAGGGTAGAAATAGAACTGCTGGTGGTAGTGAAATTGATAACAGCAGGAAAGGATCCACTCTCAAAGGATTGCTTGGTGGAGCACTTGACTTTGCTGGTGACCTGCTTGGTTTTGGTCGTCGCGGTCGTCGTGGTGGCAGGAGAAGAGGTGGTCGTCGAGGGGGTGGTATTGGTTTAGGTGGGTTCGGCAAGCGTGGCAGGCGCAGAGGCGCGTCTCGCGGCATGAGTGGAGGTAGAGGTAGGACTCAATACACTGCTCCTGTCGGTCCACAACCGATGAACTCTGCCACACCATGGGCACGTAAAGGTGCTGGTGATCGTGGTGGTCAATTTGGACAGGGTGGATTTGCTCCAAGAATGGAGGCATCACCAATAAAGTTTGCAAGTGGTGGTATTGTTGACAATCCAACCACAGGACAAGCAGTTATTCCAAAGAACAAACTAACAGCAGCAGTCAAAACTAATCAAGATAATGTAAAGAAAGCAGATCCTTTTGCTAAGGTGATGCAACTACCTACCATGGCAGCAGGTGCTCTGCTCATGTCAACGGTTGGTAATGTCATCAACAACATGGGTGGTATATCTAAACTATTCCGTCCAGTTCTGCAGAGAATGTTTGTTCCTGCTGCTACAGCATTTGGTCTGCCTGCTAATCTAATTAGTGCATTCTTTGGTTCTAGTGGTGCTTCAGCGAAAGGACTTGGAGGGATTGGTAAAGGTAAAGGCAAAGGTAAAGGGAAAAATGGTGGCGGTGGTGAACAAACATCATCTAATGTCACACCTGGATCTACTATTGGCGGAGGAACAGTTACTGGTGGTGGTTCTGTTGATGGATATTCAATCTCATCACCTTTCGGTGCTCGTAATACTGGTATACCAGGCGCTTCTACTAATCACTTAGGCGTTGACTATCGCACACCACAAGGAACCAAACTTTCTATCAAGAGACCAGGAAAGGTTATTGATACTACTACCCCTGCTATGGGTAACAATGGTGAAGTATATATTCAGCATGATGATGGATCTAAATCTAGATACTTACACATGAGTGCTGTAGCAGTGTCTCCTGGTGAACGTGTTGATACTGGAGCATTCCTTGGTAAAACTGGTGGAGAACCTGGAACTCCTGGTGCTGGTCCTACTAGTGGTGCTCACCTACACTTTGAATACTATCCACCTGGAGCATCTGGTCCTGTTGATGGTTCTGGTGTTGCATCAACTGTCTTTAGTGTTGGTGGGACCCTCACGCCTACTGCTCCACAAGCAGTTCAACCAACCGCTGCTCCAGTTCCATCATCACAGACACCTGCACCTGCTGCAGCAACTCCACAATCAAATCAACCAACAACACTAGAACCTATTGTTATCCCTCGTCCTGCCCCTGCCGCTGCAGCACCAGCAGCTGCTGATAATACAGGTAATGGTGGTGCTAATCTTCCCGTAAGAAATCCTAACGCATCAATGTCATTGCTAGGAGGTATGCCGTAA

Gene Ontology

Description Category Evidence (source)
GO:0004222 metalloendopeptidase activity molecular function None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2QG9f) rather than this protein.
PDB ID
2QG9f
Method AlphaFoldv2
Resolution 72.27
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50