Protein

Protein accession
A0A3Q8HZI8 [UniProt]
Representative
4g1gr
Source
UniProt (cluster: phalp2_21811)
Protein name
Peptidase M23 domain-containing protein
Lysin probability
87%
PhaLP type
VAL
Probability: 98% (predicted by ML model)
Protein sequence
MDEREQEKKKETTEEPRQGEESPSPSSREGVGLVRSFLSGVVRSIKEAVPKAVVDSGGGHKVGDGGRESYQANLFAKTVAEGIKRQNAILGEISASLKKISSTLVYIRKTLKDWYERRNQLSLRDLLGLFGAVSLGAFRQFSDLARGAGSLLRDLLLYRLLGRGILGRGTGRIPRTGRGFFGRLFRREGTTRGQVIEVGGRRIARIVKPEPGKPVFPRAPMAPVPIPTQAPKPQPQRGLPRVREVLGRSATFLREVGTRVSRVPEGLLRVLSRAGSIATRGLARVLPVASLFLLGKDFVENLIEVRKLTGEGIGGALKAGLASALSTLSLGIVPPEVSASLVSKFTEWVSGVFGKAREKASELASKVSSFFQDSFLPVFISTKDSVVRSFSSVKERISEVASGLVDSISSVLSSIREKISSLPIVEWISHLSNSIKDSISSVSSLASEKISNALSSIGLFFDSVKDFFEEKIKGIGSSLSNLGKGVGDRLRGFGEGVVNFGKGVMDRLFGKPKEPGGGGVRGFNEKMQEPPVVGTAARPREPGSYQTLGPVAKSELSEIYEKVIVPSVGRHSITQLIGQNRPFVDPATGKVVNPMGYGRHGHMGIDVATPQGTPVRAPFDGVVVDASGGAWGRSVYLVGKNLAVRFSHLSEISVPSGTYVKAGQVIAKTGNTGNSTGPHLDITFYTANRGKIGTWIANYEAINRLVSSEITKTPLSAREVEEIRSKAKASSVSASSTFDTGQVPISSPFGLFVNQVAQAAGNLLGLDAQEVISSIQKRISDFADYLYRNSGMVDIRQIERRMNEIIGTPKRFQSKPYIEGVTYTSQANVKLGTPTPAPRPNVPPSPRPKAPATARPATPATPKPPIQASPTKKEGTQETPPQVKGTVKTTSESYTTPPTRPEDVGVGKTFYVPTVTTKTTPEGKEAKKRSVLMLLTSDLEANRQQGLRTLASLVARGELTEEEASELVKEAERIRSGKGAKATPASTLKPPPTPKPSGQEGGIGERRGETPPQGQVVGERGGEGGQKSGEKEVKVSQVPPVSPITPKTSMKSPEVKVPEVKVVVPQQAPQVSPYQVPRQTTSPNALETLILINTLVNR
Physico‐chemical
properties
protein length:1098 AA
molecular weight:117537,9 Da
isoelectric point:10,04
hydropathy:-0,32
Representative Protein Details
Accession
4g1gr
Protein name
4g1gr
Sequence length
232 AA
Molecular weight
25096,56360 Da
Isoelectric point
7,81532
Sequence
MDDKLTKFRDAAIKSGYSENEVNSFLDMVKSAPQKKEEVSTQEKKEEVSTQVASFEPNFSSITGTVGYNQGGIYTPVMTAEEQKNLSNKVIPVETTLTQAFGNLNPIEKFSGGFNRGADFSIPKNTPLSVPAGDWMVSESFNGAKSGGVGNATNRGYGNSVVLVNRLTGERIRYSHLNSVNVKPGQLVKPGQTVAYSGSTGNSTGPHLDVEYYNSQGKLSNILTSKYSNQFI
Other Proteins in cluster: phalp2_21811
Total (incl. this protein): 3 Avg length: 540,0 Avg pI: 7,48

Protein ID Length (AA) pI
4g1gr 232 7,81532
2n74L 290 4,59790
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_16120
4gKTJ
4 43,9% 157 1.100E-40
2 phalp2_21998
4ZChj
22 29,2% 188 2.252E-12

Domains

Domains [InterPro]
Unannotated
PET_M23
Representative sequence (used for alignment): 4g1gr (232 AA)
Member sequence: A0A3Q8HZI8 (1098 AA)
1 232 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Thermus phage phiLo
[NCBI]
2301536 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MH673673 [NCBI]
CDS location
range 85248 -> 88544
strand +
CDS
TTGGACGAAAGGGAGCAAGAAAAGAAGAAGGAAACCACGGAGGAGCCTCGCCAGGGGGAAGAAAGCCCTTCTCCCTCATCTAGGGAGGGAGTAGGCCTGGTGAGGTCTTTCCTCTCTGGGGTGGTCCGTAGCATAAAGGAAGCCGTTCCCAAGGCGGTCGTGGATAGTGGCGGAGGCCACAAGGTCGGGGATGGAGGAAGAGAGTCCTACCAGGCCAACCTCTTCGCAAAGACAGTCGCCGAGGGGATTAAGAGGCAGAATGCCATTTTAGGAGAAATCTCTGCTAGCCTAAAGAAGATAAGTTCAACGCTCGTCTACATAAGGAAGACGCTAAAGGACTGGTACGAGAGGAGAAACCAGCTATCTCTTAGGGACCTGTTGGGGCTCTTTGGGGCGGTGTCCTTGGGAGCCTTCCGTCAGTTCTCCGACTTGGCCCGTGGAGCGGGGAGCTTACTAAGAGACCTACTACTCTACAGGCTATTGGGCCGTGGGATTCTTGGTAGGGGAACTGGAAGAATCCCAAGGACAGGAAGGGGGTTCTTTGGGAGGCTATTCCGTAGGGAAGGAACTACCAGGGGCCAGGTTATAGAGGTCGGGGGTAGGAGGATAGCCAGGATTGTCAAGCCTGAGCCAGGGAAGCCTGTCTTTCCTCGGGCTCCCATGGCCCCTGTTCCTATCCCAACGCAGGCCCCTAAGCCCCAACCTCAAAGAGGCCTTCCCAGGGTGAGGGAGGTTCTTGGGCGTTCAGCAACATTCCTGAGGGAAGTGGGAACCCGAGTGTCCAGGGTGCCCGAGGGGTTGTTGCGAGTCCTGTCCCGAGCAGGCTCCATTGCTACTCGGGGCTTAGCCAGGGTTCTTCCTGTAGCCTCCCTCTTCCTCCTGGGGAAGGACTTTGTGGAGAACCTGATAGAGGTCAGAAAGCTTACCGGGGAGGGTATAGGCGGTGCACTAAAGGCCGGGCTCGCATCGGCCCTCTCCACCCTCTCCCTTGGAATAGTTCCTCCAGAAGTGTCTGCCTCTCTGGTGAGCAAGTTCACGGAGTGGGTTTCCGGGGTGTTTGGCAAGGCCAGGGAGAAGGCTTCTGAGCTAGCCTCTAAGGTATCCTCCTTTTTCCAGGACTCCTTCCTACCTGTCTTCATAAGCACCAAAGACTCCGTTGTTCGGTCGTTTTCCTCAGTTAAGGAAAGGATTTCTGAAGTTGCTTCTGGGCTAGTGGATAGCATCTCCTCTGTCCTCTCCTCTATCCGTGAGAAGATATCTTCCTTGCCTATTGTAGAGTGGATTTCTCATCTCTCTAACTCTATAAAGGACTCTATTTCCTCCGTTTCTAGCCTGGCTTCTGAAAAGATTTCTAACGCCCTCTCCTCCATAGGGCTCTTCTTTGATTCCGTAAAGGACTTTTTTGAGGAAAAGATAAAGGGAATAGGCTCCTCCCTCTCTAACCTCGGAAAGGGAGTAGGGGACAGATTGAGAGGATTTGGAGAGGGGGTCGTAAACTTTGGCAAGGGGGTAATGGACAGGCTCTTTGGAAAACCCAAGGAGCCTGGAGGGGGAGGAGTCAGAGGCTTTAACGAGAAGATGCAAGAGCCCCCGGTGGTGGGTACGGCGGCGAGGCCCAGGGAGCCTGGTTCCTACCAAACCCTAGGGCCTGTGGCCAAGAGCGAGCTTTCCGAGATATACGAAAAGGTCATCGTTCCCTCCGTTGGAAGGCATAGCATAACGCAACTAATAGGCCAGAACAGGCCTTTCGTTGACCCGGCCACGGGCAAGGTGGTCAACCCAATGGGCTACGGGAGGCACGGCCACATGGGTATAGACGTGGCCACCCCACAGGGAACACCTGTCAGGGCTCCGTTTGACGGAGTAGTCGTGGACGCTTCGGGAGGGGCCTGGGGCCGTTCGGTCTACCTGGTTGGGAAGAACCTAGCCGTCAGGTTCTCCCACCTTTCGGAGATTTCCGTTCCCTCAGGAACCTACGTCAAGGCGGGCCAAGTCATCGCTAAGACCGGAAACACGGGGAACTCTACCGGGCCTCATCTGGACATTACCTTCTACACCGCTAACAGAGGCAAGATAGGAACCTGGATAGCCAACTATGAGGCGATAAATAGACTTGTCTCCTCTGAGATAACGAAGACCCCGTTGAGTGCTAGGGAAGTAGAAGAGATAAGGAGCAAGGCCAAGGCCTCCTCGGTATCCGCTTCTTCCACCTTCGATACTGGGCAAGTACCCATTTCCTCTCCCTTTGGCCTTTTTGTGAATCAGGTGGCCCAAGCCGCCGGAAACCTCTTGGGACTGGACGCTCAGGAAGTTATTTCCTCTATTCAAAAGAGGATTTCTGACTTCGCAGACTATCTTTACAGAAACTCCGGGATGGTTGACATAAGACAAATAGAAAGGAGGATGAACGAGATTATAGGAACACCTAAGAGGTTCCAGTCCAAGCCCTACATAGAGGGGGTGACGTACACCTCTCAGGCGAACGTGAAGCTTGGTACACCTACACCTGCTCCGAGGCCCAATGTTCCTCCTTCCCCTAGACCAAAGGCTCCGGCCACGGCCCGTCCTGCAACACCAGCTACGCCCAAGCCTCCTATACAGGCTTCTCCTACCAAAAAGGAGGGAACCCAAGAAACACCTCCTCAGGTGAAGGGAACAGTAAAGACAACAAGCGAGTCCTACACTACCCCACCGACTAGGCCAGAGGACGTGGGGGTGGGAAAAACCTTCTACGTTCCGACCGTGACCACGAAGACCACCCCTGAAGGCAAGGAGGCCAAGAAGCGCTCTGTCCTGATGCTCTTGACAAGTGATTTAGAGGCTAACAGGCAACAAGGTCTAAGGACGCTAGCTTCCCTTGTGGCCCGGGGAGAGCTAACCGAAGAAGAGGCGAGTGAGCTTGTCAAGGAGGCTGAGAGGATAAGGAGTGGTAAGGGGGCCAAAGCGACCCCAGCCTCGACCCTAAAACCCCCTCCAACTCCAAAACCCTCAGGACAAGAGGGAGGTATTGGAGAGAGGAGAGGGGAAACTCCCCCTCAGGGCCAGGTAGTGGGAGAAAGGGGAGGCGAGGGAGGCCAAAAGAGTGGGGAGAAAGAGGTCAAGGTGAGCCAAGTTCCTCCCGTTTCCCCTATAACCCCTAAGACCTCTATGAAGTCCCCAGAAGTCAAAGTTCCCGAGGTCAAGGTGGTGGTTCCTCAGCAAGCTCCACAGGTTTCCCCATACCAGGTTCCTAGGCAGACCACTTCTCCCAACGCTCTGGAAACCCTTATACTCATAAACACCTTGGTCAATAGATAA

Gene Ontology

Description Category Evidence (source)
GO:0004222 metalloendopeptidase activity molecular function None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4g1gr) rather than this protein.
PDB ID
4g1gr
Method AlphaFoldv2
Resolution 74.17
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50