Protein
- Protein accession
- M1TV98 [UniProt]
- Representative
- 4upJo
- Source
- UniProt (cluster: phalp2_1947)
- Protein name
- Uncharacterized protein
- Lysin probability
- 99%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MQKSGKTSKIDFYKFISPKGIKAGDDSEGTAAANVIAVKTVKANNQIGKTLNGIGAVLQDIHKKMAVNAMMEADYHKELKKSIADDSKPKNTKPVAEKRSGITDFLAPVVGSFFEGLANLAGWFLKTFVARAVLEWLSNPENFEKLTNIVEGIKAVGMFIYNFFKGTIGGILDGIAKMWDPEASWWEKLLGFGQFFISLGGLLLGLRWLKNPLKLVKDFVWVLTTLYNNLLRGKKRMKARGRFGLVKGLVATTVIVGGAGLIINATNNASEEDMAPVDGGKNTVPVGDSGSKKYAIMTYGTDQHKDPDKAAEYVSDQLTKAKEMGYNTVFIPPSSQGDKFAEVSEATTNAAQAAGAIIENASFDPDKEYEKMMPSSMKAIQSKYNGAAVFGDKFARLADKPNRVGGANTLPKLEEKAAGGWITGPQSGYPVSLDGGRSVSFIGHGTEYVAQRSGGGFVVPFDTPATRKNPSLTGQRIGEASRGGFKLGGMLPGFDIGGALTKMLPHFAAGGQITAIQQKALDVLAKYESGAAGYNAVNQIGTNAGRGVKGFSGDITKMKQHGGKALTRFTIGDIKKLQHDDRSMSDDQWINAGKLHAVGRYQFIGNTLPGVAKRAGLKDSDLFSEKNQDIMAIQLMKERGISPWVGPSDKATKEERAIVAAVQQNRSGAGLLDYDLGTTTAAEGTNASEGAADTTAELTPEQKLNFALEKLVGGIKDVRGVMHGSEVAAATEDNLDAKDQAEKNEVSKTEEQMAAATAIATSVSKATAGKAVETAAGAGGGQKTIVVPTEEKEGLLTFMPGFGLFGGSS
- Physico‐chemical
properties -
protein length: 809 AA molecular weight: 85843,7 Da isoelectric point: 9,04 hydropathy: -0,27
Representative Protein Details
- Accession
- 4upJo
- Protein name
- 4upJo
- Sequence length
- 809 AA
- Molecular weight
- 85938,73330 Da
- Isoelectric point
- 9,13744
- Sequence
-
MQKSGKTSKIDFYKFISPKGIKAGDDSEGTAAANVIAVKTVKANNQIGKTLNGIGAVLQDIHKKMAVNAMMEADYHKELKKSIADDSKPKNTKPVAEKRSGITDFLAPVVGSFFEGLANLAGWFLKTFVARAVLEWLSNPENFEKLTNIVEGIKAVGMFIYNFFKGTIGGILDGIAKMWDPEASWWEKLLGFGQFFISLGTLLLGLRWLKNPLKLVKDFVWVLTTLYNNLLRGKKRMKARGRFGLVKGLVATTVIVGGTGLIINATNNAGDLENSVTNDGKNTVPVGDSGSKKYAIMTYGTDQHKDPEKAAEYVSDQLTKAKEMGYNTVFIPPSSQGDKFAEVNEATTNAAQAAGAIIENASFDPDKEYAKMMPSSMKAIQSKYNGAAVFGDKFARLADKPNRVGGANTLPKLEEKAAGGWITGPQSGYPVSLDGGRSVSFIGHGTEYVAQRSGGGFVVPFDTPATRKNPGLTGQRIGEASRSGFKLGGMLPGFDMGGALAKMLPRFSAGGKITAIQQKALDVLAKYESGAAGYNAVNQIGTNNGRGVEGFSGDFTKMRQHGGKALTSLTIGDIKKLQYDDRSMSDSQWINAGKLHAVGRYQFIGNTLPGVAKRAGLKDSDLFSEKNQDIMAIQLMKERGISPWVGPSDKATKEERAIVAAVQQNRSGAGLLDYDLGTTTAAEGTNASEGAADTTAELTPEQKLNFALEKLVGGIKDVRGVMHGSEVAAATEDNLDAKDQAEKNEVSKTEEQMAAATAIATSVSKATAGKAVETAAGAGGGQKTIVVPTEEKEGLLTFMPGFGLFGGSS
Other Proteins in cluster: phalp2_1947
| Total (incl. this protein): 19 | Avg length: 816,1 | Avg pI: 9,00 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 4upJo | 809 | 9,13744 |
| 1JAH | 879 | 8,73303 |
| 1T9Ue | 647 | 10,09931 |
| 1VWLd | 809 | 9,05060 |
| 1g6KY | 658 | 9,56268 |
| 1gi0B | 573 | 5,88007 |
| 2h3D9 | 845 | 9,27702 |
| 2njGy | 936 | 9,14486 |
| 4uUsn | 809 | 9,20455 |
| 7SBPl | 700 | 8,72936 |
| 7jQNV | 940 | 9,02656 |
| 81u8x | 840 | 9,39635 |
| 8HqvW | 940 | 9,02656 |
| 8Hqw5 | 962 | 9,13544 |
| 8Hqwc | 940 | 9,07239 |
| 8Hqwt | 940 | 9,02656 |
| 8LRyP | 809 | 9,04010 |
| 8zDG0 | 661 | 9,49054 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_9558
1gl5q
|
24 | 32,8% | 514 | 1.543E-76 |
| 2 |
phalp2_40218
2qjSX
|
23 | 27,5% | 790 | 1.949E-73 |
| 3 |
phalp2_31094
1UBAd
|
39 | 23,9% | 865 | 3.019E-66 |
| 4 |
phalp2_3091
3KnCh
|
11 | 30,4% | 548 | 5.964E-64 |
| 5 |
phalp2_35243
6Itht
|
25 | 23,6% | 778 | 7.370E-50 |
| 6 |
phalp2_13940
8CJpR
|
7 | 26,5% | 557 | 1.476E-43 |
| 7 |
phalp2_5745
4SbyE
|
114 | 23,6% | 939 | 1.417E-42 |
| 8 |
phalp2_8494
1gcVP
|
15 | 26,8% | 597 | 1.599E-34 |
| 9 |
phalp2_3839
6BqfU
|
33 | 25,3% | 632 | 2.577E-22 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Cyanophage S-SSM6b [NCBI] |
682651 | Kyanoviridae > Greenvirus > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
HQ316603
[NCBI]
CDS location
range 82616 -> 85045
strand -
strand -
CDS
ATGCAGAAATCAGGAAAGACATCTAAGATAGACTTCTATAAGTTTATATCGCCGAAGGGAATCAAAGCAGGCGATGACTCTGAAGGGACTGCTGCAGCTAATGTTATCGCAGTAAAAACTGTAAAGGCAAACAATCAGATTGGTAAAACTCTGAATGGTATTGGCGCAGTTTTACAGGATATTCATAAAAAAATGGCAGTAAATGCCATGATGGAGGCGGATTATCATAAGGAATTAAAAAAATCAATTGCAGATGATTCTAAACCTAAAAATACTAAACCAGTAGCAGAGAAAAGGAGTGGTATTACAGACTTTCTTGCTCCTGTAGTTGGTAGTTTCTTTGAAGGTCTTGCCAATCTTGCTGGTTGGTTTTTGAAGACCTTTGTTGCTAGAGCAGTTCTTGAATGGTTAAGTAATCCAGAAAATTTTGAGAAACTGACTAATATTGTTGAAGGTATAAAAGCAGTAGGAATGTTTATATACAACTTCTTCAAAGGAACCATAGGTGGTATCTTAGATGGCATCGCTAAGATGTGGGATCCAGAAGCATCATGGTGGGAGAAGTTATTAGGATTTGGACAGTTCTTCATCTCGTTAGGAGGACTATTACTTGGGCTCAGATGGCTCAAGAATCCTCTTAAACTAGTAAAAGATTTTGTTTGGGTACTTACAACTCTTTATAACAACTTACTTCGTGGTAAGAAACGAATGAAAGCGCGAGGGCGCTTTGGATTAGTCAAAGGATTAGTGGCGACTACAGTTATTGTTGGTGGTGCTGGTTTGATAATCAACGCTACTAATAATGCTAGTGAAGAGGATATGGCTCCTGTTGACGGTGGTAAAAACACAGTTCCTGTTGGTGATTCGGGATCTAAGAAGTATGCCATCATGACTTATGGTACTGATCAACATAAAGATCCCGACAAAGCAGCGGAATATGTGTCTGATCAGTTGACTAAAGCAAAGGAAATGGGATACAATACTGTATTCATTCCACCTTCTAGTCAAGGAGATAAGTTTGCGGAAGTAAGTGAAGCAACAACTAACGCGGCACAAGCAGCGGGTGCTATCATTGAAAATGCATCGTTTGATCCTGATAAGGAATATGAAAAGATGATGCCATCATCTATGAAGGCAATTCAGTCAAAATATAATGGTGCTGCTGTATTTGGAGATAAGTTTGCTCGTCTTGCAGATAAACCTAACCGTGTTGGAGGAGCGAATACTTTACCTAAACTAGAAGAAAAGGCAGCGGGTGGATGGATTACAGGTCCTCAATCTGGATATCCAGTATCACTAGACGGTGGTAGATCCGTATCATTCATCGGTCATGGAACTGAATATGTTGCACAGAGATCTGGTGGTGGATTTGTAGTTCCATTTGATACTCCTGCAACTAGAAAGAATCCTAGTCTAACAGGTCAAAGAATTGGAGAAGCATCCCGTGGTGGTTTCAAACTAGGTGGTATGTTACCTGGTTTTGACATAGGTGGAGCATTGACAAAGATGTTACCACATTTTGCTGCTGGTGGACAGATTACTGCTATTCAGCAGAAAGCATTAGATGTTCTTGCTAAGTATGAATCTGGTGCTGCTGGTTATAACGCAGTCAATCAAATTGGAACTAATGCTGGCCGCGGCGTTAAAGGATTCTCTGGTGACATTACAAAAATGAAACAGCATGGTGGCAAAGCACTTACCAGATTTACCATCGGTGACATTAAAAAATTACAACATGATGACAGATCAATGTCTGACGATCAGTGGATTAATGCTGGTAAGTTACATGCTGTAGGTAGATATCAGTTTATTGGTAATACATTACCTGGCGTTGCTAAGAGAGCAGGTCTTAAAGATTCTGATCTATTCAGTGAAAAGAATCAGGATATAATGGCAATTCAATTAATGAAGGAACGTGGTATTTCACCATGGGTAGGTCCGAGTGATAAAGCAACTAAAGAAGAAAGAGCAATTGTTGCTGCAGTTCAACAGAATCGTTCTGGTGCGGGTCTTCTTGATTATGATCTGGGAACTACTACTGCTGCAGAAGGCACTAACGCCAGTGAGGGTGCAGCCGATACAACTGCTGAACTAACTCCTGAGCAGAAATTAAATTTCGCCTTAGAAAAATTAGTAGGTGGTATCAAAGATGTTCGTGGTGTTATGCATGGTAGTGAAGTTGCTGCAGCAACGGAAGATAATTTAGATGCTAAAGATCAAGCAGAAAAGAATGAAGTTTCTAAAACAGAAGAACAGATGGCCGCGGCGACTGCAATAGCAACATCAGTAAGTAAGGCAACTGCAGGAAAAGCAGTAGAAACTGCAGCGGGTGCAGGTGGCGGACAAAAAACAATAGTAGTACCAACTGAAGAAAAAGAAGGACTATTAACATTCATGCCTGGTTTCGGATTATTCGGAGGTTCTTCATAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0016020 | membrane | cellular component | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0002c0b1ff_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(4upJo)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50