Protein
- Protein accession
- A0A6J5KIS0 [UniProt]
- Representative
- 5dfIV
- Source
- UniProt (cluster: phalp2_40660)
- Protein name
- TtsA-like Glycoside hydrolase family 108 domain-containing protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MASIPLIGQTTSVSDTTPSPQAQGMQVISPLGNAPGIAADGLDTLANAMSRKQSADAVANLSKSMSDAQVYWTKAQVDGQQNTSDGGNITMPNGTVIGYRQKMQNDFDNWSKTFLDGVKDPRAKNIATDQVQSLRTSVLTNAINFEAQAGIANRSDKLDEAVKGWASQAAGAKSLSDIDGLVQSAKIFIANSGFDEKTRNDKVRSAVSTIIQSANTGAMMRDPNGYKDAALRRYGIDQTPTTPASPNGNAAAIPGGFDGAVAFTLQHEGGYAAKDANGAAVNRGINQAAHPEVDVANLTQQQATDIYRRQYWNGINGDQMVAAGHGPLATAAFDTAVIAGPGKAKELLAASGGDVNKFMDLREAFLGGLVANNPDKYGRFKGAWDSRNADLRAQTGATSASPGHSTTPIDPQMQALVNQLPIDQLPGFIGAASTQQNQNQALYRSQLTVTEGDHIAAFMNGQPVPKPLSEPDYVKAYGPIEGPQRFVNYQKTQQLGADIGSMKIMPPAQMTAMVESYKPDPSKPGFELATTRYQAVAAAADQVNAARQADPVTYAIQAGIGGAKPLDFSSAANLSAGLTQRQGVAATMQGQFQAPFQMLSVPEAKTLNTAFQTMSTVQRLGYLNTIRQAVTDPTAYRTIMQQIAPDSPVTAMAGIIMSKQQPAVTPGGWFSSQASYAQQDVAGLMLEGEALINPNKMDKEDNGRGKTFPIPKEDDMRTVFTNAVGAAFAGDPNGASFAFQAVKAYYVGKAARMGVIDNSQVNSNIMQEALDAVIGGVTDINGKGEVIRPWGMQEDFFKNQAKAKFDAAMVATGLKGSTVDDFGLYGLQSAGDGKYLLKAGTGTLTDKQGNPIVIDVTPAPLPSSLFGDPQTLPAIVPPTSAVQPKTPKLNTQQPGTK
- Physico‐chemical
properties -
protein length: 897 AA molecular weight: 94520,9 Da isoelectric point: 5,78 hydropathy: -0,34
Representative Protein Details
- Accession
- 5dfIV
- Protein name
- 5dfIV
- Sequence length
- 849 AA
- Molecular weight
- 91185,48400 Da
- Isoelectric point
- 7,69363
- Sequence
-
MARIPVYERRIAQESNAPIARVGASPVAAALQNLGQAGMQAVDRVMAADQAAAERAKREAEEVDKAQVPNLLSNGQVYWQQREDERRQAWKVGDPDMREKAGQEFDKWVAESSKALNTDDGRRYFQQHAASMRARLLTDTYSYQRRSTVEKLNADNAVGEANDEILVARAWNNPAEVNAIIARRIEPLLARTDLSEAEKIVAAEKVKARMFLARERAFVENDPQGWLQANGGLPKPGPAAAGSPAPGFDVVVQQILKTEGGYNASDGNSGAPVNFGINQRANPDIDVKNLTREGAIKIYRDRYWNAIGADNLPGPLQATAMDAAVNQGVGWTKGALAEAGGDVAKFNELRRARYREIAENPDQAKFLNTWLARVKDPVAGPGDQAPAAAAGAPAAPTGWASQHMDPDKLYQLRGLAESRVAQVQTQVRSEAERTVGDALAMHKDGKVDPFNLTPDYFDRAFGADGPRRFTEYKAGQGMAQAIGNFASQSPEQIAAVIQAAAPAQGPGYAMADARQQVMVQAARQVIEERAKDPQAFAMRHGLATTRPLDMNQPALMAPELAKRQATAAAMRDRYATPLRVFTEAEAQQLSQVLAAFPTEGKIAYLDQMRRGLSDPQTFRAAMAQIAPDSPVTAVAATILTATDSVVMPGGMFSDSTVLRPRKVAATMLEGEAILNPTRGAKAQDGRGGKFPMPKEADLELAFTQAVGKAFRNDAAGFGTAYQAFKAYYAGAASQRGVLSEQIDNKLAKEAIAAATGGVIDFNGAGEVLKPWGTSDDQFKNEIAAEFNRLIEAAGYKGTSLDNLRAYGLLGLGQGRYAVVNGSDALRAPTGQAIILTLQARSTAQIPGAR
Other Proteins in cluster: phalp2_40660
| Total (incl. this protein): 34 | Avg length: 863,6 | Avg pI: 6,76 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 5dfIV | 849 | 7,69363 |
| 1wVjk | 912 | 8,38716 |
| 1yyAs | 912 | 8,58501 |
| 2kIgW | 868 | 5,56899 |
| 48KDp | 911 | 6,89698 |
| 4AdHx | 905 | 5,97971 |
| 4GaXz | 1024 | 6,52167 |
| 4HGms | 662 | 5,94095 |
| 4g8hA | 860 | 6,48450 |
| 4gEmd | 884 | 6,63194 |
| 4h5fU | 879 | 8,57689 |
| 52btt | 881 | 6,50678 |
| 53irz | 876 | 7,06272 |
| 53txN | 875 | 7,09881 |
| 5Byzg | 889 | 7,16390 |
| 5DBjg | 715 | 6,59386 |
| 5HbI8 | 864 | 5,41871 |
| 5HdaD | 879 | 5,60747 |
| 5dCu9 | 639 | 6,51008 |
| 5jIHZ | 902 | 8,39148 |
| 5kZUi | 834 | 5,49698 |
| 5lb9W | 900 | 8,68404 |
| 5vGxD | 869 | 8,27318 |
| 6GHb7 | 830 | 5,11559 |
| 6SSaR | 860 | 6,14460 |
| 6TzrJ | 860 | 6,14568 |
| 6wZDu | 898 | 5,58190 |
| 87d54 | 865 | 8,49966 |
| 8mXDH | 895 | 6,67963 |
| 8sBPv | 875 | 7,56512 |
| br61 | 861 | 6,23702 |
| e6xL | 853 | 6,01802 |
| fIqL | 879 | 5,90724 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_22297
3tFD
|
1 | 26,6% | 862 | 8.723E-132 |
| 2 |
phalp2_13019
42z0G
|
36 | 34,2% | 614 | 7.546E-129 |
| 3 |
phalp2_38811
1qguC
|
2 | 37,2% | 551 | 3.007E-125 |
| 4 |
phalp2_37859
4MCL5
|
1 | 34,0% | 537 | 4.067E-123 |
| 5 |
phalp2_29633
8el7O
|
1 | 25,4% | 754 | 2.267E-110 |
| 6 |
phalp2_16490
6Q3Q5
|
5 | 25,9% | 867 | 9.606E-106 |
| 7 |
phalp2_39357
4Dlu7
|
3 | 26,2% | 841 | 4.530E-89 |
| 8 |
phalp2_34618
5kpe5
|
3 | 24,1% | 891 | 1.944E-82 |
| 9 |
phalp2_8044
5HvAD
|
6 | 28,5% | 669 | 2.295E-76 |
| 10 |
phalp2_20602
4GdMQ
|
1 | 22,6% | 618 | 1.870E-61 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
uncultured Caudovirales phage [NCBI] |
2100421 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
LR796155
[NCBI]
CDS location
range 27133 -> 29826
strand +
strand +
CDS
ATGGCATCCATCCCGCTGATCGGTCAAACGACGTCCGTATCGGACACGACGCCGTCGCCGCAGGCGCAGGGCATGCAGGTCATCAGTCCGCTGGGCAATGCCCCCGGTATTGCGGCCGATGGTTTGGACACGCTTGCCAACGCAATGTCGCGCAAGCAGAGCGCGGACGCGGTGGCGAACCTGTCGAAATCCATGTCGGATGCGCAGGTCTATTGGACCAAGGCGCAAGTCGATGGGCAACAGAACACCAGCGACGGCGGCAATATCACCATGCCGAACGGAACCGTGATCGGTTACCGGCAGAAGATGCAGAACGATTTCGACAACTGGTCAAAAACCTTTCTGGACGGCGTCAAGGATCCTCGCGCCAAGAACATCGCCACGGATCAGGTGCAATCGCTGCGCACTTCGGTGCTGACAAACGCGATCAATTTCGAAGCGCAGGCCGGGATTGCCAACCGGTCCGACAAGCTGGACGAAGCGGTGAAGGGCTGGGCCAGTCAGGCGGCAGGCGCTAAAAGCCTGTCAGACATCGATGGCCTTGTGCAGTCGGCCAAGATCTTCATCGCCAATTCTGGTTTCGACGAAAAGACGCGCAACGACAAGGTGCGATCGGCCGTGTCGACGATCATTCAATCTGCCAATACCGGGGCGATGATGCGCGACCCGAACGGCTATAAGGACGCGGCCCTGCGGCGGTACGGGATCGACCAGACCCCAACGACGCCCGCCTCGCCCAATGGCAACGCGGCAGCGATCCCCGGCGGCTTCGACGGCGCCGTCGCGTTCACCCTGCAGCATGAAGGCGGCTATGCGGCCAAGGACGCCAACGGCGCGGCCGTCAATCGCGGCATCAACCAAGCCGCGCACCCCGAGGTCGACGTCGCCAATCTGACGCAACAGCAGGCGACCGACATCTATCGTCGCCAATACTGGAACGGCATCAACGGCGATCAGATGGTCGCAGCAGGCCATGGGCCGCTTGCCACGGCGGCTTTCGATACGGCGGTCATCGCAGGCCCGGGCAAGGCCAAGGAACTGCTTGCGGCCTCGGGCGGCGACGTCAACAAGTTCATGGATCTGCGCGAGGCGTTTCTAGGCGGTCTGGTCGCCAATAACCCGGACAAATACGGGCGCTTCAAAGGCGCATGGGACAGCCGAAACGCGGATCTGCGCGCCCAAACTGGTGCGACATCCGCATCGCCCGGTCACAGCACCACACCGATCGACCCGCAAATGCAGGCGCTGGTCAACCAGCTGCCGATCGACCAGCTGCCGGGCTTCATCGGCGCAGCCAGCACGCAGCAAAATCAAAATCAGGCGCTTTATCGGTCGCAGCTGACAGTCACAGAGGGCGATCATATCGCTGCCTTCATGAATGGGCAGCCGGTGCCCAAGCCGCTGTCAGAACCCGATTACGTCAAAGCCTATGGCCCGATCGAGGGGCCGCAGCGGTTCGTGAACTATCAGAAGACCCAGCAGCTGGGGGCTGACATTGGCAGCATGAAAATCATGCCGCCCGCTCAGATGACGGCAATGGTCGAAAGCTACAAGCCCGACCCGAGCAAGCCCGGATTTGAACTGGCGACGACGCGCTATCAGGCCGTGGCTGCGGCGGCGGATCAGGTGAACGCGGCACGGCAGGCGGATCCGGTGACCTATGCGATCCAAGCTGGCATCGGGGGCGCGAAGCCGCTCGATTTCTCATCCGCTGCCAATCTATCGGCAGGGCTGACGCAGCGCCAAGGCGTAGCTGCCACCATGCAGGGGCAGTTCCAAGCGCCGTTTCAGATGCTGTCAGTGCCAGAAGCCAAGACGTTGAACACCGCGTTCCAGACGATGTCGACGGTGCAGCGCCTCGGCTACCTGAACACGATCCGGCAGGCCGTCACCGACCCGACCGCCTATCGCACCATCATGCAGCAGATCGCCCCGGACAGCCCTGTCACGGCGATGGCTGGCATTATCATGTCCAAGCAGCAGCCTGCCGTGACGCCGGGCGGATGGTTCAGCAGCCAAGCGTCCTATGCCCAGCAGGACGTCGCTGGCCTGATGCTTGAAGGCGAGGCGCTGATCAATCCCAACAAGATGGACAAGGAAGACAACGGCCGGGGCAAGACGTTCCCAATCCCCAAAGAAGACGACATGCGGACAGTCTTCACCAATGCGGTGGGGGCTGCCTTTGCCGGGGATCCTAATGGGGCAAGTTTCGCGTTTCAGGCGGTCAAAGCCTATTACGTCGGCAAAGCTGCGCGCATGGGCGTGATCGATAACAGCCAAGTCAATTCGAACATCATGCAAGAGGCGCTCGATGCTGTCATTGGCGGCGTGACCGACATCAACGGCAAAGGCGAAGTCATCCGGCCGTGGGGCATGCAGGAAGACTTTTTCAAAAATCAAGCCAAAGCCAAGTTCGACGCGGCCATGGTGGCAACGGGCCTCAAAGGTTCGACCGTCGATGATTTCGGCCTGTACGGCCTGCAAAGTGCGGGCGATGGCAAATATCTCCTCAAGGCCGGCACCGGCACCCTGACCGACAAACAGGGCAACCCGATCGTGATCGACGTCACCCCAGCCCCGCTGCCCTCGTCGCTGTTCGGGGACCCGCAGACGCTACCCGCAATAGTGCCGCCCACCAGTGCCGTGCAGCCCAAGACACCTAAACTCAACACGCAGCAACCGGGGACAAAATGA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(5dfIV)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50