Protein
- Protein accession
- A0A8S5U4I5 [UniProt]
- Representative
- 6P840
- Source
- UniProt (cluster: phalp2_7713)
- Protein name
- Internal virion protein D
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MYNIGNKESGMDYTLPNSSGSGASGAYQFMQGTWDAYAEKVAPEYVGVAPMNAPAEVQDAVMYEKTSEMYDRYGGNIQLMALEHYGGMGAADEAMRTGRIPENAEWCNGDEYPSQASYAREIAESMGQAVPNLNAFAGAFAKGGKITAGNPSFSLVLDESPLEIARMNDRPFWEKLEDSFKNMWYENGTIAAMRVGLAKMNANPYYSTWKATDEDIKLLDEVLGDNKVAKDSVLLNAENPEQFKALLKMKKEDIEREKRAEQTNFGIHSVIGGALGMLLDPLNLIPFVGEEAFLAKVGARLGAKALVSLGSKRVMKIAETAAVQGALNMADRGLAEHYGIHEANYAVAGVLGVAGGAGIRFLRTMRELNVPMNGEHMQRFLYESERMQDQAVQGALDIADKRAMQTHSLLNTEAPIEKQLDDFLGGIPTLSKKESKALGKSLGKLLPEEDVEKVVGKRASKLLRDAKLSGDATVTDLLRKAGSSPSLGAKVRKAIEKYRKTPMSDRTWNDYLTSKGVNPEAGLNRVKLAEEALQDDKKARVLQAWVKKTKGQDIPVDDLRVGLKQILYRESGVGYTKNEDALIINGTVVRENSPVYDAIVNPEIYDPVSVPMSIPRSEERVVSEHYVPPKKDKQPTVSKAEQEAFTNDVEMGSKTLREVEDENQRGFKSRVMKYIGRKMEDSKYLGDTYGHFTNSVSNHLRDFGRKMLGDPRQNAERHAQGLSLDFSTRKSVMQRQLREYIGGMRQCYRDYFAQHVGAPSKVRRQFGKEFIQAYDQKVKYGRSIEGFSKEIQEAVRQAENFRKMEQEFLRRTGALTRDIPDTGFYRRADVDKVAEFLTNFDSEQDAIDWLANYASKNADRDALERMRVIEEPDMELSEYVDREARNWAYGIIDRNLSNAKVTMRDLNHMDKLEQYQRRFPMDTSALSDKTMPNGEYFTFDDCLRDYDVFSTMEQVANRSSAKATMSSLGVKDMGAFFDGYRDKIERELRKANETRRLIKHSDVSDALEEFDYVVSQLTGYRYGTKRSQDPMNGVVRLLTKMSYAMNGYNMGLNQIGENFGMMSVTGMRAIGNMIPGLDKILHGMRTTTLSHDELKKLRIAADYSQYNFLNPMDLSTPKYDRIGLRAKVMGKLNTAVDYASDITSMLNQLSAWTERAVSMGEADVMSDLIDWAVLGRKGHLFNDNAFNNVGVRDTDKFKDTINKYFGNLDHNDPDAVFKAIQKMQEEDYTAYVSMRAFTAQAVQRGIIQPNLSNANYFTKTGMFPMLLQFKNFSRMAINSHLARALERPDKEAMTQLLSSAVAGAGIWALRTQVYANWKYKDEAERKKFLDDTLTSDNFARAGITRSSLLAGLSFGNDLYEAVSGAPTVRTTVNRQGGSEQGLGSYIDQLPAVAALNTVKDGVGSAWSALNDLVVDNRVYQDDSKTIANMFPLDKFVGTQAVLSGLLDMHKGHISQDSFSKRPETKSSRNPIQMLQKVVTGTNDVEEAQKKQKDTQKSRSKSKKQEQRKYLNMNGGKW
- Physico‐chemical
properties -
protein length: 1517 AA molecular weight: 170075,5 Da isoelectric point: 7,08 hydropathy: -0,58
Representative Protein Details
- Accession
- 6P840
- Protein name
- 6P840
- Sequence length
- 157 AA
- Molecular weight
- 17799,98910 Da
- Isoelectric point
- 9,56777
- Sequence
-
MRRFLLFLAIGIAAVLFIGIKAGPSFGDDHFNYKNPWKPTKNVDEIPDSLYRGWHYEKKWEPFRKCLLRRESGSNFKADSSGGSGGFQFVQKTWDHYVAKADPGYVGVRPNKAPPYLQEEVFWIAVNPTPRKPGLAGRHHWSASHAHGAGYTHVKDC
Other Proteins in cluster: phalp2_7713
| Total (incl. this protein): 47 | Avg length: 164,2 | Avg pI: 8,74 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 6P840 | 157 | 9,56777 |
| 1A5jo | 143 | 9,01927 |
| 1C6T7 | 153 | 9,44373 |
| 1F0Q0 | 129 | 9,07620 |
| 1hb4t | 129 | 9,76446 |
| 1ukgg | 143 | 8,80085 |
| 28moX | 155 | 8,89311 |
| 2PDEb | 128 | 9,14763 |
| 2PQkL | 143 | 9,34348 |
| 2XikP | 127 | 8,92844 |
| 2iVQo | 143 | 8,80047 |
| 4A7Eg | 130 | 7,80868 |
| 4Ag5R | 124 | 9,27437 |
| 4Yho | 143 | 9,05621 |
| 4afW | 134 | 7,83325 |
| 4boo | 137 | 9,84505 |
| 4lQPy | 141 | 6,95638 |
| 4oU1w | 121 | 9,11958 |
| 4ogUo | 124 | 9,54656 |
| 4tBG6 | 129 | 7,79540 |
| 4tmqe | 128 | 9,28043 |
| 4zA0D | 121 | 9,27334 |
| 4zOjR | 130 | 6,49536 |
| 4zRWP | 130 | 7,80868 |
| 4zWvB | 137 | 9,29900 |
| 52Apg | 141 | 6,50530 |
| 55lWv | 124 | 9,09689 |
| 5Bt0a | 131 | 6,57078 |
| 5gxAj | 131 | 9,20533 |
| 5u4ID | 124 | 8,90626 |
| 5xSBo | 133 | 8,95654 |
| 5ydno | 129 | 9,09670 |
| 6A0E7 | 171 | 8,27724 |
| 6IBN0 | 109 | 9,04074 |
| 6JnA6 | 153 | 9,64423 |
| 6JnHa | 141 | 9,11946 |
| 6Jooh | 147 | 10,29091 |
| 6zJml | 121 | 9,27334 |
| 7X69Z | 124 | 8,90594 |
| 82IP4 | 143 | 9,46198 |
| 8DdES | 142 | 9,64629 |
| 8nnSc | 128 | 6,50229 |
| 8p5mP | 128 | 6,36696 |
| Bggp | 124 | 9,48822 |
| fSr3 | 150 | 9,90623 |
| tDuc | 129 | 9,09670 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_21967
7HbLZ
|
2 | 42,8% | 126 | 2.452E-40 |
| 2 |
phalp2_18500
6OOeg
|
2 | 34,2% | 105 | 1.919E-31 |
| 3 |
phalp2_32811
2GfCO
|
95 | 30,7% | 114 | 2.616E-16 |
| 4 |
phalp2_18333
5moh5
|
2 | 26,6% | 120 | 8.142E-08 |
Domains
Domains [InterPro]
1
157 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Caudovirales sp. ct2KA10 [NCBI] |
2825757 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
BK016009
[NCBI]
CDS location
range 2523 -> 7076
strand -
strand -
CDS
ATGTATAACATAGGCAACAAAGAGTCTGGCATGGACTACACACTGCCTAATTCAAGTGGCTCAGGTGCTTCAGGCGCATATCAGTTCATGCAAGGTACATGGGATGCCTATGCTGAAAAAGTAGCCCCTGAGTATGTTGGGGTAGCTCCGATGAACGCTCCTGCTGAAGTTCAGGATGCCGTCATGTATGAAAAGACCTCTGAGATGTATGATCGCTATGGAGGTAATATTCAGTTGATGGCTCTGGAACACTATGGAGGCATGGGAGCCGCTGATGAGGCTATGAGAACAGGGCGTATACCAGAAAATGCTGAGTGGTGTAATGGGGATGAATACCCTTCCCAAGCTTCCTATGCAAGAGAAATTGCTGAGAGTATGGGTCAGGCAGTTCCTAATTTAAATGCTTTTGCAGGGGCTTTTGCTAAAGGAGGTAAAATTACCGCAGGTAACCCCTCCTTTTCCTTAGTTCTTGACGAAAGCCCTCTGGAAATAGCTAGGATGAATGACAGACCCTTCTGGGAAAAACTGGAAGACTCATTCAAGAACATGTGGTATGAAAATGGTACGATTGCGGCTATGCGTGTGGGGTTAGCCAAAATGAACGCCAATCCCTACTACAGCACATGGAAGGCTACAGACGAAGACATAAAGCTACTTGATGAGGTCTTAGGGGACAACAAGGTAGCAAAGGATTCCGTACTGTTAAATGCTGAAAATCCTGAGCAGTTCAAGGCTCTTCTGAAGATGAAGAAAGAAGACATAGAGCGAGAGAAGAGAGCAGAACAAACTAACTTTGGTATTCACTCGGTTATTGGTGGAGCCTTGGGGATGCTATTAGACCCCCTGAATCTCATCCCGTTTGTTGGTGAGGAAGCCTTTTTGGCAAAGGTTGGTGCAAGGTTAGGGGCTAAAGCTCTCGTGTCTTTAGGTTCTAAGCGTGTCATGAAAATTGCAGAAACCGCCGCTGTACAAGGGGCCTTAAATATGGCTGACAGAGGTCTTGCGGAACACTATGGGATTCACGAAGCCAACTATGCAGTGGCAGGTGTCTTAGGTGTTGCAGGCGGTGCTGGTATCCGCTTCCTCCGTACTATGAGGGAACTGAATGTTCCTATGAATGGGGAGCATATGCAACGGTTCCTTTATGAATCCGAAAGGATGCAGGATCAGGCTGTGCAAGGAGCCTTGGATATTGCAGACAAAAGAGCGATGCAAACACATTCTCTCTTGAACACGGAAGCGCCAATAGAGAAACAACTTGACGACTTCTTAGGTGGTATTCCTACTTTGTCTAAAAAAGAAAGCAAGGCTTTAGGTAAATCCCTTGGCAAATTGCTCCCTGAAGAAGATGTAGAGAAAGTAGTGGGGAAGAGAGCCTCGAAGCTCCTGAGAGATGCAAAGTTATCTGGAGATGCTACCGTCACTGACTTATTGCGGAAAGCAGGTAGTAGTCCCTCTTTAGGTGCTAAGGTACGCAAGGCTATTGAAAAGTATCGCAAGACACCTATGTCTGATAGAACATGGAATGATTACCTTACTAGCAAGGGGGTAAATCCTGAAGCTGGGTTAAACCGTGTGAAGCTGGCAGAGGAAGCCTTGCAGGATGATAAGAAAGCCCGTGTTTTGCAGGCTTGGGTTAAGAAGACAAAAGGTCAGGACATTCCTGTTGATGACTTACGCGTTGGCTTGAAACAAATCTTGTATCGTGAGTCTGGCGTAGGGTACACGAAAAACGAAGATGCCCTTATCATTAATGGTACAGTTGTCAGGGAGAATAGCCCTGTATACGACGCTATTGTAAACCCTGAAATATATGACCCTGTAAGCGTTCCTATGTCTATACCGCGAAGTGAAGAGCGTGTTGTCTCGGAACACTACGTACCGCCTAAGAAAGACAAACAGCCTACAGTGTCTAAGGCCGAACAGGAAGCATTCACTAACGATGTCGAGATGGGTTCAAAGACACTGAGAGAAGTAGAAGACGAAAATCAACGAGGATTCAAGAGTCGTGTCATGAAATATATTGGCCGTAAAATGGAAGACTCTAAGTACCTTGGTGATACCTACGGCCACTTTACCAACTCTGTGTCTAACCACCTGAGAGACTTTGGACGTAAGATGCTTGGCGACCCTAGACAAAATGCTGAACGTCATGCTCAAGGTCTTTCTTTGGACTTCTCTACTCGCAAGAGCGTTATGCAGAGACAGCTCAGAGAGTATATTGGTGGTATGAGACAATGCTACAGAGACTACTTTGCTCAACATGTAGGAGCTCCATCGAAAGTACGCAGACAGTTTGGTAAGGAGTTCATTCAGGCTTACGACCAGAAAGTAAAGTATGGTAGAAGCATTGAGGGATTCTCGAAGGAAATTCAGGAAGCTGTTAGACAGGCTGAGAATTTCCGTAAGATGGAACAGGAGTTCTTGCGTAGAACAGGGGCTTTGACACGGGATATTCCTGACACTGGCTTCTATCGTAGAGCAGATGTAGACAAAGTAGCTGAGTTCCTTACGAACTTTGATTCTGAACAGGACGCTATTGATTGGCTTGCTAATTATGCCAGTAAGAACGCTGATAGAGACGCTCTGGAACGGATGCGTGTTATTGAAGAACCTGACATGGAGCTGAGTGAGTATGTAGACAGAGAAGCTCGTAATTGGGCCTACGGTATTATTGACCGCAATTTGTCTAATGCAAAGGTTACTATGCGTGACCTGAATCATATGGACAAATTAGAGCAGTACCAGAGACGTTTCCCTATGGACACTTCTGCCCTGTCTGACAAAACAATGCCAAACGGTGAATACTTTACCTTTGATGATTGCTTGAGAGACTATGATGTCTTCTCAACGATGGAACAGGTAGCAAATCGTAGTTCTGCTAAGGCTACCATGTCTTCCCTTGGTGTCAAGGATATGGGAGCGTTCTTTGATGGCTACAGAGACAAAATAGAACGAGAACTACGGAAGGCAAATGAGACACGTAGGCTGATTAAGCACAGTGATGTATCAGATGCCTTGGAAGAGTTCGACTACGTTGTCTCTCAGCTTACTGGGTATCGCTACGGCACTAAGCGTTCTCAAGACCCTATGAATGGTGTCGTTCGCTTATTGACTAAGATGTCCTATGCAATGAACGGTTACAACATGGGGCTGAATCAGATTGGGGAAAACTTCGGGATGATGTCGGTTACTGGTATGAGAGCTATTGGCAACATGATTCCTGGGTTAGACAAAATTCTCCATGGGATGCGTACTACTACGTTGTCTCATGATGAGCTGAAGAAACTCAGGATTGCCGCTGATTACTCACAGTATAACTTCTTGAATCCTATGGATTTGTCAACACCTAAGTATGACCGTATTGGCCTCCGTGCAAAGGTCATGGGTAAGCTGAATACGGCTGTAGATTATGCCTCTGACATTACTTCTATGCTAAACCAGCTGAGCGCATGGACAGAAAGAGCAGTTAGTATGGGGGAAGCGGATGTTATGTCTGACCTTATCGACTGGGCTGTATTGGGACGAAAAGGACACCTGTTCAATGACAATGCTTTTAATAATGTAGGAGTACGAGACACAGATAAGTTCAAGGATACCATTAACAAATACTTTGGGAATCTTGACCATAACGACCCTGACGCTGTGTTCAAAGCTATTCAGAAGATGCAGGAGGAAGATTATACCGCCTATGTCTCCATGAGGGCCTTTACAGCACAGGCGGTACAGCGAGGTATCATTCAGCCTAATTTGTCTAATGCTAACTACTTCACGAAGACTGGGATGTTCCCGATGCTCTTACAGTTTAAGAACTTCTCTCGTATGGCTATTAACAGTCACCTCGCAAGAGCCTTGGAACGTCCAGACAAAGAAGCAATGACACAGCTGTTAAGTTCTGCTGTTGCTGGGGCTGGCATTTGGGCCTTGCGTACTCAGGTGTATGCAAATTGGAAATATAAAGATGAAGCAGAGCGCAAGAAGTTCCTTGACGACACACTGACTTCTGACAACTTTGCTCGTGCTGGTATTACTCGGTCTTCTCTGTTGGCTGGTTTATCTTTTGGCAATGACTTGTACGAAGCTGTGTCTGGTGCGCCTACAGTACGTACTACGGTAAATAGACAAGGAGGTTCTGAACAGGGCCTTGGAAGTTACATAGACCAGCTTCCTGCTGTGGCCGCTCTGAATACTGTGAAAGATGGTGTCGGTAGTGCATGGAGTGCCTTAAATGACCTTGTAGTAGATAATCGAGTATATCAGGATGACAGCAAGACGATTGCCAATATGTTCCCTCTGGATAAGTTTGTAGGGACACAGGCTGTTCTGTCTGGGTTACTTGATATGCACAAGGGGCATATCAGTCAGGATAGCTTCTCGAAACGCCCTGAGACAAAATCTTCCCGCAATCCGATTCAGATGCTTCAGAAGGTAGTGACGGGGACGAATGATGTAGAGGAGGCGCAGAAGAAACAGAAAGATACACAGAAATCTCGTAGTAAGAGCAAGAAACAAGAACAACGAAAGTATTTGAATATGAATGGAGGTAAATGGTGA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(6P840)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50