Protein
- Protein accession
- A0A1D7RLN8 [UniProt]
- Representative
- 8ETHI
- Source
- UniProt (cluster: phalp2_6443)
- Protein name
- Mannosyl-glycoprotein endo-beta-N-acetylglucosamidase-like domain-containing protein
- Lysin probability
- 63%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MATVQKSTKINFYKFVATKDVSPSAKGADEGTVATVKVLNKNTEALNNIGSVLNGIAKIAQDLKKVMLMQLDAQQRKNAGTFDAQYTKTKKEKKSGFVAGIVGKSVSFLEGLLTTFSNLFKLLVVIPALKWLSDPKNQETIATALRIIRSVVTFIFDWAKFGITNTIDGLYNLLRDDASWWDRIVGLGQAIAGIGAVVLGIRYLSNPLKLVKDITNGVRALVRFVLGGGRGGGKKPRGRFGGAMRLLGGAAIVGGSAYAISQMNQPEEKSQGGSVKILPSRSQGGWISGPQSGYKVSLDGGRSTSFIGHGTEYVARKANGGAFVVPFNTPGTKTQPHLTQKRLGEAKRLGYFSNGGEITGNNEQKWSKVMGLGKAAGAKYPELVAAQFALESAWGTALSAKNNFFGIKATGNESATVSNTREVINGRSVYVDARFKNFSTPFDAINHLVTQWYKDYRGYTGVNNAPDAFAAASKLQKEGYATDPQYTKHLSRLMQQYSNLRGAQVTAPTQQPSGNPFMNFFGNLANSVLGIGGANAAEHGGNDNQGRAPEYTGGTADVVGISHPDTGAGYGIKDQKDQHGRPLAFSQPAAEQFSKALKASGMNLGAYIASTGRSDAKNNSVGGHPNSHHMYGEAIDMNGDGYEWMRKNGRKYGWQYVYNHGPGSAHFKYVGPGAGSTPKLGPPGSKPKVGPSVTGGSSSSTSTHSSGTREETKKKVTLADIMGGRSTRPVPLENYKTGNINSHGDSRSIIDATEERNRARQRANEKSSQMVQAAIEAVRLSNTSNEQIIMQAQQGIQQAMQMGSGNSQPQVIPTGGGRGAVKSVVSSLASSINPLRGIFK
- Physico‐chemical
properties -
protein length: 840 AA molecular weight: 89454,8 Da isoelectric point: 9,94 hydropathy: -0,43
Representative Protein Details
- Accession
- 8ETHI
- Protein name
- 8ETHI
- Sequence length
- 688 AA
- Molecular weight
- 73409,14040 Da
- Isoelectric point
- 7,66249
- Sequence
-
mtgiglpvavagllsslvigglagwgayeasyntlkalglrddnpelkrqgyqeggrvrkpikrtigggkkkpkakvrkfvrkptperlpelpqstekggkdrawwdflgwagtgekplgpggeqltqkvtdvgnnlgdndffgpilrvtskiildqevnnkdiknvglginflinkglsdkkisegikgyaqggmvsplgdglslekwveqafkpiakknystrytsggmktsdmdgdevstrttsstptsgsggtsltgdnqakwkavyamaekagakypelvaaqfglesawgtalaaknnffgikatssedatvsntrevvngqsvyvdarfknfntpqdcinhlvtqwykdykgysgvnnasdaqaaadqlkregyatdpqyasslkrimseysgirgtnadiasslvmsgdspdnpddagsggsvggdgkfiqgnsgasagvhfhvgpghqtdgtllqsqyfadarasakkvvdhflgkgstiydgrrgvyfkssdevaaaqrahtssgsaggidmqvdfetphkfplkvssmayrrngfgvsaniegsksfvshgrydeqgnvapqermklyhaglrlggkeqiakilkgetildedtskalgstllarlddastpagiqkvlqsvmgisefpsydsragqvimiddsdeapmeeqpapvmmgalggtsydnsmdfldyqg
Other Proteins in cluster: phalp2_6443
| Total (incl. this protein): 40 | Avg length: 828,8 | Avg pI: 9,79 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 8ETHI | 688 | 7,66249 |
| 8Aaz1 | 667 | 6,78808 |
| A0A1D7S4A6 | 840 | 9,93414 |
| A0A1D7RHQ7 | 843 | 9,92351 |
| A0A1D7RJK3 | 842 | 9,94330 |
| A0A1D7R817 | 840 | 9,90365 |
| A0A1D7S5X8 | 840 | 9,93414 |
| A0A1D7S2V6 | 840 | 9,93414 |
| A0A1D7RWU9 | 840 | 9,92351 |
| A0A1D7S1S6 | 840 | 9,93414 |
| A0A1D7R8M5 | 843 | 9,93434 |
| A0A1D7RB33 | 843 | 9,94330 |
| A0A1D7RCA8 | 843 | 9,92351 |
| A0A1D7RDJ4 | 843 | 9,92351 |
| A0A1D7RE90 | 843 | 9,94330 |
| A0A1D7RER3 | 843 | 9,92351 |
| A0A1D7RFD3 | 843 | 9,94330 |
| A0A1D7RGH2 | 843 | 9,92351 |
| A0A1D7RH84 | 843 | 9,94330 |
| A0A1D7RIK8 | 843 | 9,94330 |
| A0A1D7RN73 | 843 | 9,92351 |
| A0A1D7RP15 | 843 | 9,92351 |
| A0A1D7RS06 | 843 | 9,93247 |
| A0A1D7RT28 | 843 | 9,89488 |
| A0A1D7RUR2 | 843 | 9,94330 |
| A0A1D7RVB9 | 843 | 9,92351 |
| A0A1D7RXZ4 | 843 | 9,94330 |
| A0A1D7S108 | 843 | 9,92351 |
| A0A1D7S5J0 | 843 | 9,96283 |
| A0A1D7S7H2 | 843 | 9,94330 |
| M4PLH8 | 735 | 9,88985 |
| M4PM11 | 735 | 9,88985 |
| A0A1D7RAG5 | 843 | 9,92351 |
| A0A1D7RSN2 | 843 | 9,92351 |
| A0A1D7RTS2 | 843 | 9,92351 |
| A0A1D7RVJ3 | 843 | 9,94330 |
| A0A1D7RYJ8 | 843 | 9,92351 |
| A0A1D7S020 | 843 | 9,92351 |
| A0A1D7S0M6 | 843 | 9,94330 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_29679
16oyO
|
4 | 41,1% | 457 | 1.113E-73 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Synechococcus phage S-RIM2 [NCBI] |
687800 | Kyanoviridae > Nerrivikvirus > Nerrivikvirus srim2 |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
KX349248
[NCBI]
CDS location
range 9135 -> 11657
strand +
strand +
CDS
ATGGCAACGGTACAAAAAAGTACTAAAATCAATTTCTACAAGTTTGTAGCAACAAAAGATGTCTCACCATCTGCGAAGGGTGCTGATGAGGGAACGGTTGCAACTGTAAAAGTCCTTAATAAAAATACTGAAGCACTTAATAATATTGGTAGTGTCTTGAATGGCATTGCTAAGATCGCTCAAGACCTTAAAAAGGTCATGTTGATGCAACTTGATGCACAACAAAGAAAGAATGCTGGAACTTTTGACGCACAATATACCAAAACCAAAAAAGAGAAAAAGAGTGGATTTGTTGCTGGTATTGTAGGCAAATCAGTATCCTTCTTGGAGGGATTATTAACGACATTTTCCAACCTCTTTAAGTTATTAGTTGTTATACCTGCTCTCAAATGGTTATCAGATCCTAAGAATCAAGAAACAATAGCAACAGCACTGAGAATTATTAGATCAGTAGTTACATTCATCTTTGATTGGGCAAAGTTTGGTATTACCAACACCATTGATGGACTGTATAATCTTCTAAGGGACGATGCGTCTTGGTGGGATAGGATAGTTGGTCTCGGTCAAGCAATTGCAGGTATTGGGGCGGTTGTCCTGGGTATCAGATATCTTTCAAATCCTCTGAAACTTGTAAAGGACATTACCAACGGCGTTAGAGCACTCGTCAGATTCGTCCTTGGTGGCGGTAGAGGAGGTGGCAAGAAACCTCGCGGACGCTTTGGTGGAGCAATGCGCTTGCTCGGTGGCGCAGCGATCGTTGGTGGTTCTGCCTATGCCATCAGTCAGATGAACCAACCTGAGGAGAAATCCCAAGGAGGTTCAGTAAAGATTCTTCCAAGTAGATCACAAGGAGGTTGGATTAGTGGACCACAATCGGGATATAAGGTATCTTTGGACGGGGGGAGATCAACCTCGTTCATCGGACATGGAACTGAGTATGTTGCTAGAAAGGCAAATGGGGGAGCTTTCGTCGTTCCTTTTAATACTCCTGGAACAAAAACACAACCCCATCTGACTCAGAAGAGACTTGGTGAGGCAAAGAGACTTGGATATTTCTCTAATGGTGGAGAGATAACTGGTAATAATGAACAGAAATGGTCCAAGGTCATGGGTCTTGGCAAAGCAGCAGGTGCTAAGTATCCAGAACTTGTCGCTGCACAGTTTGCTCTAGAAAGTGCATGGGGCACAGCACTATCTGCTAAAAACAACTTCTTCGGTATCAAAGCAACTGGTAATGAATCCGCCACAGTTTCCAACACAAGAGAGGTTATTAATGGTCGAAGTGTTTATGTTGATGCAAGATTTAAGAACTTTAGCACACCTTTTGATGCTATCAATCATCTAGTAACACAATGGTATAAAGATTATAGGGGGTATACTGGTGTTAACAATGCACCAGACGCTTTTGCTGCTGCGTCTAAGTTGCAGAAAGAAGGTTATGCTACAGATCCTCAATATACCAAGCATTTATCCAGATTGATGCAACAATATTCTAATCTAAGAGGGGCACAGGTTACGGCACCAACACAACAACCTAGTGGTAATCCCTTTATGAACTTCTTTGGGAATCTAGCGAACTCTGTACTGGGTATCGGTGGCGCAAACGCAGCTGAGCATGGAGGTAATGATAATCAAGGTAGAGCACCTGAATATACTGGCGGTACGGCTGATGTGGTAGGTATTTCTCATCCTGATACAGGTGCTGGATATGGCATTAAAGATCAGAAGGATCAACATGGTAGACCCTTAGCATTCTCTCAACCTGCTGCAGAACAGTTTTCAAAGGCGTTGAAGGCATCTGGAATGAACCTTGGTGCATATATTGCTAGCACGGGTCGTAGTGATGCAAAGAACAACTCTGTTGGTGGGCATCCAAACTCACACCATATGTACGGTGAAGCGATTGACATGAATGGTGATGGATATGAATGGATGAGAAAGAATGGTAGAAAATATGGTTGGCAATACGTTTATAATCATGGACCTGGTAGTGCTCACTTCAAATATGTTGGACCTGGTGCAGGATCTACTCCTAAGTTAGGTCCTCCTGGCAGCAAACCCAAGGTGGGACCTTCCGTTACTGGGGGTTCTAGTAGTTCGACTTCGACGCATAGTAGTGGAACCAGAGAAGAAACTAAGAAGAAAGTTACACTTGCTGATATCATGGGTGGTAGATCTACTAGACCTGTACCACTAGAAAACTATAAAACTGGCAATATAAATTCACATGGTGATTCTCGATCTATAATCGATGCGACTGAAGAGAGAAATCGAGCAAGACAAAGGGCAAATGAGAAGTCTTCCCAAATGGTTCAAGCGGCAATAGAAGCAGTTAGACTATCTAATACTAGTAATGAACAAATTATTATGCAGGCACAACAAGGTATTCAGCAAGCAATGCAAATGGGTAGTGGCAATAGTCAACCTCAGGTCATTCCAACAGGCGGTGGAAGAGGTGCTGTTAAGTCTGTTGTTTCCTCACTCGCGTCCTCTATTAATCCTCTGAGAGGTATCTTCAAATGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0004040 | amidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0002c09626_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(8ETHI)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50