Protein
- Protein accession
- M4QQL7 [UniProt]
- Representative
- 8gQfh
- Source
- UniProt (cluster: phalp2_22754)
- Protein name
- Peptidase M23 domain-containing protein
- Lysin probability
- 97%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MAKAAVQKGSKINVYKVVSIKEPDARMKKKDPNDYETAKGLNKTTVAINNLGATINGMNAMVADLKQVSLDRLEAANKDKPKMDPKYGTTKKGAGGILGSIAKGFVKTGGSFLGGLLKLLGGMFKLFVVLPILNWLSKEENQQKVAGALEVMAKIGKFIWEWAKFGITNTIDGLYKLLSDDSSWMDRIVGFGQAFLGFASIFIGMKGIAWLLNPIKVVKGITTAIRALIKFATNRGLSSVTGRRGRKGRGFAAGGALATTMTVAPMVMMPQMAKGGWISGPMSGYPVSLDGGRSTSFIGHGTEYVSRKAGGQAFVTPYDTPATRRNPGLTAMRQSEAKRKGFAEGGEVKESLNLTEDQFRNLAFGVSGEAQRGTDDEFGVAAAILNRVADPRYPNSIMQVLSAPDQYEAYHKGKMKFDDQLQDRLSSQKGQEGIIAALRELKGRTDFKGTAMYKYMGADDIKFSRKGNFYHYPEQKAKSDPPPDTIPTHYLRFIQNQEASDDQQRGESNDRAAVGVMGKLSSALSGVTNVLSNIFLGGPASAAENPGVVDPPPKKLESEANNKEGANASAGIESGSLGEKVFPLPKGRFQATARQVFGASRGGRSHAGVDLTEAPPWGSDPKIPVVAAIAGSVLAERYKAGQTYYSGMMIRGQDGHDQRYLHMEPAVKPGQEVNAGDQIGRLYDDGDNSHLHFEVYKNGKGGPLNPSLIYPSMFKAGTAGGGQMTNPTAFQSTSSVTSPGQNKPPQAAIQSMGDMSREEDTTFSGQDGIISQFSAEKTRATELQKATDDRDKERENFKAGVENAVMQASQQVQRSNQQSAAAIQQSQQGVQQAAQSGGGGKDVITGGLPGIGNVNINGVMKTTAYALNSNNNFMRGILR
- Physico‐chemical
properties -
protein length: 879 AA molecular weight: 94087,5 Da isoelectric point: 9,53 hydropathy: -0,45
Representative Protein Details
- Accession
- 8gQfh
- Protein name
- 8gQfh
- Sequence length
- 725 AA
- Molecular weight
- 77769,77220 Da
- Isoelectric point
- 9,40911
- Sequence
-
MKDGEKPTQLEEGEEPEEPQQQLARGGKVKTPQKALGGFINGPQSGYPVSLDGGRSTAFIGHGREYVARKSDGGAFVVPINTPGTKTQPHLTQKRVQEAKSQGYSVPGMAIGGDYLKAVKENDRTQGDNSAKKIFLHWSAGHRDGTNFYKGHGYHTYIPSSGQPVRKAPFGKQGAPYHTYGRPQTQSAAIGVAGMSTANNENGKDWGSQAITPNQYKGMAKEAAAIATAWGWKPSDITDKRVRTHAEEYRDYPNWYHRNNSSHYRWDLSRLYAGEPAFSGGPKIRNMIKQEMGGAAGDAKDEPDAKDGARQVQQPRGIMSNFLGAIDAMTGGRTDFDGMGGGQRNITNPDQKGEKEANTREAQESKDKKQGMVFPLPNGRFAAGPRQVYGAGRSYGGHAGIDLTEMPPFGSDPKIPVVAAISGTVLKEKYKGGQTYYSGMMIQGEDGYDQRYLHMEPTVKPGQKVQAGQVIGKLYDDADNTHLHFEVYKRGKGGHLNPAQIYPDLFKPGATGGGGEIQGSVTPPGATNPPDAPGTDSAANPSGAANVAKTIIPKASSAMSFEERYGSTRYTNDNFELATKKEEERRQQLQAMLPNMSNAMGPGAPQAGQQQGASAKQLQKITKDRNNARQLVQNRTQEMIQQVMAQVAKQNGMNQQMAQQASLQISQMMQQSMAQQNAAKNAPPQVVGGGGGGAAGGRGTELVGGIVKTTASALNSNLNPLKGLF
Other Proteins in cluster: phalp2_22754
| Total (incl. this protein): 42 | Avg length: 875,8 | Avg pI: 9,74 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 8gQfh | 725 | 9,40911 |
| 12e5g | 951 | 10,01628 |
| 1SvsA | 878 | 9,61767 |
| 1UKce | 878 | 9,58324 |
| 1UPxn | 936 | 9,82958 |
| 1VXYn | 746 | 9,35535 |
| 1kubg | 907 | 9,68259 |
| 1sg1J | 1025 | 9,85265 |
| 1t6Ej | 876 | 9,75763 |
| 1tBpm | 868 | 9,75821 |
| 1tdnp | 1013 | 9,89301 |
| 1yYDy | 954 | 10,06185 |
| 1zxHu | 1021 | 9,86071 |
| 26bbd | 936 | 9,69509 |
| 27pb5 | 934 | 9,67569 |
| 2h3Mf | 931 | 9,75621 |
| 3BUNg | 982 | 10,04000 |
| 3xdTT | 844 | 9,80617 |
| 4R93f | 880 | 9,64352 |
| 4RGyd | 1013 | 9,68297 |
| 4uoaU | 695 | 9,49073 |
| 4upd8 | 881 | 9,61148 |
| 4ux0B | 936 | 9,87419 |
| 4x7E6 | 727 | 9,72346 |
| 6O4lA | 955 | 10,07707 |
| 7AkSc | 868 | 9,85504 |
| 7PyJU | 879 | 9,53225 |
| 7p5S1 | 1012 | 9,68297 |
| 8CxDj | 982 | 10,04000 |
| 8Dnqa | 855 | 9,79818 |
| 8F2fa | 855 | 9,79818 |
| 8GEy8 | 662 | 9,55146 |
| 8GR1v | 935 | 9,83712 |
| 8HqvF | 867 | 9,86239 |
| 8LSV4 | 936 | 9,68265 |
| 8c04G | 855 | 9,79818 |
| 8gzqH | 935 | 9,83712 |
| 8xNJz | 669 | 9,51961 |
| dUmV | 947 | 10,06476 |
| r2y1 | 973 | 9,97521 |
| A0A8S0I168 | 183 | 8,83128 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_21079
8LRzV
|
133 | 33,1% | 763 | 4.774E-78 |
| 2 |
phalp2_32545
1rZh9
|
26 | 26,1% | 584 | 3.609E-22 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Cyanophage P-RSM1 [NCBI] |
536444 | Kyanoviridae > Emcearvirus > Emcearvirus gerard |
| Host |
Prochlorococcus marinus str. MIT 9303 [NCBI] |
59922 | Cyanobacteria > Prochlorales > Prochlorococcaceae > Prochlorococcus > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
HQ634175
[NCBI]
CDS location
range 45785 -> 48424
strand -
strand -
CDS
ATGGCAAAGGCAGCAGTACAAAAAGGTAGTAAGATTAATGTTTACAAGGTAGTAAGCATTAAAGAACCTGATGCGAGGATGAAGAAGAAAGATCCTAACGATTATGAAACAGCAAAAGGTCTTAATAAAACTACAGTAGCAATCAATAACTTAGGTGCCACTATCAATGGCATGAATGCGATGGTTGCTGATCTTAAGCAAGTATCATTAGATCGTCTTGAAGCAGCGAATAAAGATAAACCTAAAATGGATCCCAAGTATGGGACAACGAAGAAGGGTGCGGGTGGTATACTAGGTTCGATTGCAAAAGGATTTGTAAAGACTGGTGGTAGTTTCCTTGGTGGTTTGTTAAAACTACTAGGTGGGATGTTCAAGTTATTTGTAGTTCTTCCCATCTTAAATTGGTTATCCAAAGAAGAGAACCAACAAAAAGTTGCAGGTGCCCTAGAGGTAATGGCAAAGATTGGTAAGTTCATTTGGGAGTGGGCAAAGTTTGGAATCACAAATACGATAGATGGACTATATAAATTACTCAGCGATGATTCATCATGGATGGATCGTATTGTTGGATTTGGCCAGGCGTTCCTAGGATTTGCTTCTATCTTCATTGGCATGAAGGGGATAGCATGGTTACTAAATCCAATCAAAGTCGTTAAGGGTATTACAACAGCAATCAGAGCATTAATTAAGTTCGCAACTAACAGAGGGTTATCAAGTGTGACTGGTAGAAGAGGAAGAAAAGGAAGAGGATTTGCAGCAGGTGGTGCTCTTGCCACAACAATGACAGTAGCACCTATGGTAATGATGCCACAAATGGCAAAAGGTGGATGGATTAGTGGACCTATGTCAGGTTATCCTGTATCACTGGATGGTGGCAGGTCAACATCATTCATTGGTCATGGTACAGAGTATGTCTCACGGAAGGCAGGTGGACAAGCATTTGTCACCCCGTATGATACTCCTGCAACCCGTAGGAATCCTGGTCTGACTGCTATGAGACAATCAGAAGCAAAGAGAAAAGGTTTTGCTGAGGGTGGAGAAGTAAAAGAATCGTTGAATCTAACTGAAGATCAGTTCAGAAATCTTGCATTTGGTGTGAGTGGTGAAGCACAACGGGGCACTGATGATGAGTTTGGTGTTGCAGCAGCAATCTTGAATAGAGTTGCAGATCCTAGGTATCCTAATAGTATCATGCAAGTTCTCTCTGCTCCTGATCAGTATGAAGCATACCATAAGGGTAAGATGAAATTTGATGATCAATTACAAGACAGACTCTCATCTCAAAAAGGACAAGAAGGAATCATTGCTGCACTGAGAGAACTTAAAGGTAGAACTGACTTTAAAGGAACTGCCATGTACAAATACATGGGTGCAGATGATATTAAGTTCTCTAGAAAAGGAAACTTCTATCACTATCCAGAACAGAAAGCAAAATCCGATCCTCCTCCAGATACTATTCCAACTCATTACCTAAGATTCATTCAGAATCAAGAGGCATCAGACGATCAGCAAAGAGGGGAATCAAATGATCGTGCTGCGGTGGGTGTGATGGGCAAACTAAGTTCAGCACTCTCTGGTGTTACTAACGTGCTTAGTAACATTTTCTTGGGAGGTCCTGCTTCTGCTGCAGAAAATCCAGGTGTCGTAGATCCTCCTCCAAAAAAATTAGAATCAGAAGCAAACAATAAAGAAGGTGCAAATGCTAGTGCAGGTATTGAATCTGGTAGTTTAGGAGAGAAAGTATTCCCACTTCCTAAAGGTAGATTCCAAGCAACAGCAAGACAAGTGTTTGGTGCATCAAGAGGAGGAAGAAGTCATGCAGGTGTAGACTTAACTGAAGCACCTCCTTGGGGATCAGATCCAAAGATTCCTGTTGTTGCTGCTATTGCAGGATCAGTGTTAGCGGAAAGATATAAAGCAGGTCAAACATACTACTCAGGTATGATGATCAGAGGTCAAGATGGTCATGATCAGAGATACTTGCACATGGAACCTGCTGTCAAACCTGGGCAGGAAGTCAATGCAGGTGATCAGATTGGTAGACTTTATGATGATGGAGACAACAGTCACCTACACTTTGAGGTATACAAAAACGGTAAGGGTGGACCTCTAAATCCATCTCTCATCTATCCTAGTATGTTCAAAGCAGGAACTGCAGGTGGAGGACAGATGACTAATCCTACTGCATTCCAGAGCACTTCTAGTGTAACAAGTCCTGGTCAGAACAAACCTCCTCAAGCTGCCATTCAATCGATGGGTGATATGAGTAGGGAGGAGGATACGACTTTCTCTGGTCAAGATGGTATTATCAGTCAGTTCTCTGCTGAAAAGACAAGAGCAACAGAGTTGCAGAAGGCAACCGATGATAGAGATAAAGAAAGAGAAAACTTTAAAGCAGGAGTAGAGAATGCAGTAATGCAAGCATCACAACAGGTTCAACGATCTAACCAACAGAGTGCTGCTGCTATTCAACAGTCACAGCAAGGTGTTCAGCAAGCAGCACAATCGGGTGGTGGAGGTAAGGATGTAATCACTGGTGGTCTGCCTGGAATTGGTAATGTGAACATCAATGGTGTTATGAAAACTACAGCATATGCATTGAACTCTAATAATAACTTTATGCGAGGTATTCTTAGATGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0004222 | metalloendopeptidase activity | molecular function | None (UniProt) |
| GO:0016020 | membrane | cellular component | None (UniProt) |
| GO:0031640 | killing of cells of another organism | biological process | None (UniProt) |
| GO:0042742 | defense response to bacterium | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0002c132b2_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(8gQfh)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50