Protein
- Protein accession
- A0A1D7RP15 [UniProt]
- Representative
- 8ETHI
- Source
- UniProt (cluster: phalp2_6443)
- Protein name
- Mannosyl-glycoprotein endo-beta-N-acetylglucosamidase-like domain-containing protein
- Lysin probability
- 97%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MATVQKSTKINFYKFVATKDVSPSAKGADEGTVATVKVLNKNTEALNNIGSVLNGIAKIAQDLKKVMLMQLDAQQRKNAGIFDAQYTKTKKEKKSGFVAGIVGKSVSFLEGLLTTFSNLFKLLVVIPALKWLSDPKNQETIATALRIIRSVVTFIFDWAKFGITNTIDGLYNLLRDDASWWDRIVGLGQAIAGIGAVVLGIRYLSNPLKLVKDITNGVRALVRFVLGGGRGGGGGGKKPRGRFGGAMRLLGGAAIVGGSAYAISQMNQPEEKSQGGSVKILPSRSQGGWISGPQSGYKVSLDGGRSTSFIGHGTEYVARKANGGAFVVPFNTPGTKTQPHLTQKRLGEAKRLGYFSNGGEITGNNEQKWSKVMGLGKAAGAKYPELVAAQFALESAWGTALSAKNNFFGIKATGNESATVSNTREVINGRSVYVDARFKNFSTPFDAINHLVTQWYKDYRGYTGVNNAPDAFAAASKLQKEGYATDPQYTKHLSRLMQQYSNLRGAQVTAPTQQPSGNPFMNFFGNLANSVLGIGGANAAEHGGNDNQGRAPEYTGGTADVVGISHPDTGAGYGIKDQKDQHGRPLAFSQPAAEQFSKALKASGMNLGAYIASTGRSDAKNNSVGGHPNSHHMYGEAIDMNGDGYEWMRKNGRKYGWQYVYNHGPGSAHFKYVGPGAGSTPKLGPPGSKPKVGPSVTGGSSSSTSTHSSGTREETKKKVTLADIMGGRSTRPVPLENYKTGNINSHGDDRSIIDATEERNRARQRANEKSSQMVQAAIEAVRLSNTSNEQIIMQAQQGIQQAMQMGSGNSQPQVIPTGGGRGAVKSVVSSLASSINPLRGIFK
- Physico‐chemical
properties -
protein length: 843 AA molecular weight: 89666,0 Da isoelectric point: 9,92 hydropathy: -0,42
Representative Protein Details
- Accession
- 8ETHI
- Protein name
- 8ETHI
- Sequence length
- 688 AA
- Molecular weight
- 73409,14040 Da
- Isoelectric point
- 7,66249
- Sequence
-
mtgiglpvavagllsslvigglagwgayeasyntlkalglrddnpelkrqgyqeggrvrkpikrtigggkkkpkakvrkfvrkptperlpelpqstekggkdrawwdflgwagtgekplgpggeqltqkvtdvgnnlgdndffgpilrvtskiildqevnnkdiknvglginflinkglsdkkisegikgyaqggmvsplgdglslekwveqafkpiakknystrytsggmktsdmdgdevstrttsstptsgsggtsltgdnqakwkavyamaekagakypelvaaqfglesawgtalaaknnffgikatssedatvsntrevvngqsvyvdarfknfntpqdcinhlvtqwykdykgysgvnnasdaqaaadqlkregyatdpqyasslkrimseysgirgtnadiasslvmsgdspdnpddagsggsvggdgkfiqgnsgasagvhfhvgpghqtdgtllqsqyfadarasakkvvdhflgkgstiydgrrgvyfkssdevaaaqrahtssgsaggidmqvdfetphkfplkvssmayrrngfgvsaniegsksfvshgrydeqgnvapqermklyhaglrlggkeqiakilkgetildedtskalgstllarlddastpagiqkvlqsvmgisefpsydsragqvimiddsdeapmeeqpapvmmgalggtsydnsmdfldyqg
Other Proteins in cluster: phalp2_6443
| Total (incl. this protein): 40 | Avg length: 828,8 | Avg pI: 9,79 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 8ETHI | 688 | 7,66249 |
| 8Aaz1 | 667 | 6,78808 |
| A0A1D7S4A6 | 840 | 9,93414 |
| A0A1D7RHQ7 | 843 | 9,92351 |
| A0A1D7RJK3 | 842 | 9,94330 |
| A0A1D7R817 | 840 | 9,90365 |
| A0A1D7S5X8 | 840 | 9,93414 |
| A0A1D7S2V6 | 840 | 9,93414 |
| A0A1D7RWU9 | 840 | 9,92351 |
| A0A1D7S1S6 | 840 | 9,93414 |
| A0A1D7RLN8 | 840 | 9,94330 |
| A0A1D7R8M5 | 843 | 9,93434 |
| A0A1D7RB33 | 843 | 9,94330 |
| A0A1D7RCA8 | 843 | 9,92351 |
| A0A1D7RDJ4 | 843 | 9,92351 |
| A0A1D7RE90 | 843 | 9,94330 |
| A0A1D7RER3 | 843 | 9,92351 |
| A0A1D7RFD3 | 843 | 9,94330 |
| A0A1D7RGH2 | 843 | 9,92351 |
| A0A1D7RH84 | 843 | 9,94330 |
| A0A1D7RIK8 | 843 | 9,94330 |
| A0A1D7RN73 | 843 | 9,92351 |
| A0A1D7RS06 | 843 | 9,93247 |
| A0A1D7RT28 | 843 | 9,89488 |
| A0A1D7RUR2 | 843 | 9,94330 |
| A0A1D7RVB9 | 843 | 9,92351 |
| A0A1D7RXZ4 | 843 | 9,94330 |
| A0A1D7S108 | 843 | 9,92351 |
| A0A1D7S5J0 | 843 | 9,96283 |
| A0A1D7S7H2 | 843 | 9,94330 |
| M4PLH8 | 735 | 9,88985 |
| M4PM11 | 735 | 9,88985 |
| A0A1D7RAG5 | 843 | 9,92351 |
| A0A1D7RSN2 | 843 | 9,92351 |
| A0A1D7RTS2 | 843 | 9,92351 |
| A0A1D7RVJ3 | 843 | 9,94330 |
| A0A1D7RYJ8 | 843 | 9,92351 |
| A0A1D7S020 | 843 | 9,92351 |
| A0A1D7S0M6 | 843 | 9,94330 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_29679
16oyO
|
4 | 41,1% | 457 | 1.113E-73 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Synechococcus phage S-RIM2 [NCBI] |
687800 | Kyanoviridae > Nerrivikvirus > Nerrivikvirus srim2 |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
KX349252
[NCBI]
CDS location
range 9120 -> 11651
strand +
strand +
CDS
ATGGCAACGGTACAAAAAAGTACTAAAATCAATTTCTACAAGTTTGTAGCAACAAAAGATGTCTCACCATCTGCGAAGGGTGCTGATGAGGGAACGGTTGCAACTGTAAAAGTCCTTAATAAAAATACTGAAGCACTTAATAATATTGGTAGTGTCTTGAATGGCATTGCTAAGATCGCTCAAGACCTTAAAAAGGTCATGTTGATGCAACTTGATGCACAACAAAGAAAGAATGCTGGAATTTTTGACGCACAATATACCAAAACCAAAAAAGAGAAAAAGAGTGGATTTGTTGCTGGTATTGTAGGCAAATCAGTATCCTTCTTGGAGGGATTATTAACGACATTTTCCAACCTCTTTAAGTTATTAGTTGTTATACCTGCTCTCAAATGGTTATCAGATCCTAAGAATCAAGAAACAATAGCAACAGCACTGAGAATTATTAGATCAGTAGTTACATTCATCTTTGATTGGGCAAAGTTTGGTATTACCAACACCATTGATGGACTGTATAATCTTCTAAGGGACGATGCGTCTTGGTGGGATAGGATAGTTGGTCTCGGTCAAGCAATTGCAGGTATTGGGGCGGTTGTCCTGGGTATCAGATATCTTTCAAATCCTCTGAAACTTGTAAAGGACATTACCAACGGCGTTAGAGCACTCGTCAGATTCGTCCTTGGTGGCGGTAGAGGGGGTGGAGGAGGTGGCAAGAAACCTCGCGGACGCTTTGGTGGAGCAATGCGCTTGCTCGGTGGCGCAGCGATCGTTGGTGGTTCTGCCTATGCCATCAGTCAGATGAACCAACCTGAGGAGAAATCCCAAGGAGGTTCAGTAAAGATTCTTCCAAGTAGATCACAAGGAGGTTGGATTAGTGGACCACAATCGGGATATAAGGTATCTTTGGACGGGGGGAGATCAACCTCGTTCATCGGACATGGAACTGAGTATGTTGCTAGAAAGGCAAATGGGGGAGCTTTCGTCGTTCCTTTTAATACTCCTGGAACAAAAACACAACCCCATCTGACTCAGAAGAGACTTGGTGAGGCAAAGAGACTTGGATATTTCTCTAATGGTGGAGAGATAACTGGTAATAATGAACAGAAATGGTCCAAGGTCATGGGTCTTGGCAAAGCAGCAGGTGCTAAGTATCCAGAACTTGTCGCTGCACAGTTTGCTCTAGAAAGTGCATGGGGCACAGCACTATCTGCTAAAAACAACTTCTTCGGTATCAAAGCAACTGGTAATGAATCCGCCACAGTTTCCAACACAAGAGAGGTTATTAATGGTCGAAGTGTTTATGTTGATGCAAGATTTAAGAACTTTAGCACACCTTTTGATGCTATCAATCATCTAGTAACACAATGGTATAAAGATTATAGGGGGTATACTGGTGTTAACAATGCACCAGACGCTTTTGCTGCTGCGTCTAAGTTGCAGAAAGAAGGTTATGCTACAGATCCTCAATATACCAAGCATCTATCCAGATTGATGCAACAATATTCTAATCTAAGAGGGGCACAGGTTACGGCACCAACACAACAACCTAGTGGTAATCCCTTTATGAACTTCTTTGGGAATCTAGCGAACTCTGTACTGGGTATCGGTGGCGCAAACGCAGCTGAGCATGGAGGTAATGATAATCAAGGTAGAGCACCTGAATATACTGGCGGTACGGCTGATGTGGTAGGTATTTCTCATCCTGATACAGGTGCTGGATATGGCATTAAAGATCAGAAGGATCAACATGGTAGACCCTTAGCATTCTCTCAACCTGCTGCAGAACAGTTTTCAAAGGCGTTGAAGGCATCTGGAATGAACCTTGGTGCATATATTGCTAGCACGGGTCGTAGTGATGCAAAGAACAACTCTGTTGGTGGGCATCCAAACTCACACCATATGTACGGTGAAGCGATTGACATGAATGGTGATGGATATGAATGGATGAGAAAGAATGGTAGAAAATATGGTTGGCAATACGTTTATAATCATGGACCTGGTAGTGCTCACTTCAAATATGTTGGACCTGGTGCAGGATCTACTCCTAAGTTAGGTCCTCCTGGCAGCAAACCCAAGGTGGGACCTTCCGTTACTGGGGGTTCTAGTAGTTCGACTTCGACGCATAGTAGTGGAACCAGAGAAGAAACTAAGAAGAAAGTTACACTTGCTGATATCATGGGTGGTAGATCTACTAGACCTGTACCACTAGAAAACTATAAAACTGGCAATATAAATTCACATGGTGATGATCGATCTATAATCGATGCGACTGAAGAGAGAAATCGAGCAAGACAAAGGGCAAATGAGAAGTCTTCCCAAATGGTTCAAGCGGCAATAGAAGCAGTTAGACTATCTAATACTAGTAATGAACAAATTATTATGCAGGCACAACAAGGTATTCAGCAAGCAATGCAAATGGGTAGTGGCAATAGTCAACCTCAGGTCATTCCAACAGGTGGTGGAAGAGGTGCTGTTAAGTCTGTTGTTTCCTCACTCGCGTCCTCTATTAATCCTCTGAGAGGTATCTTCAAATGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0004040 | amidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(8ETHI)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50