Protein
- Protein accession
- A0AAE7VLS4 [UniProt]
- Representative
- 4dTKa
- Source
- UniProt (cluster: phalp2_18122)
- Protein name
- Tail protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MAIATSPTNFHDSIRNMFTSSSGANSTGSIAYQIESLVGGGRKSQEQQQRQNTLVLKMMTEQTRLLKKIADSSGGGLASGISTIDDLLDIFAHGKGKFRGLARRLKSVVKLPVLAMGMLVGGVTDAFGVIAAAGKNILKPLVGLKNLINASGMFKVFGKLTPLLKVFPKLLGKLFIPITIIMGIVDAFSGWSNASEYLGKLNVSFGDKAASAIGTLVNGLLLGIPNWLLEQFGGTNLQKLMMHAKDGIWRGITALGSGLGTITEGAFNWISKQVSDIRQYLPTPGQVADMAWAAFDSVKTAVWDGLTYIGTSIRDGFYDFFKNMGDAIKEWWKNPFGPVPNMFNGISSGSTSQTPGAQIGRMFRPQQQSAPVLSGGGASPTNISFRAPNSPYDIPATQSEYDSGGAGGGWRPGMGASAPDSYYGGGGVGAPSGGPTVAATGKPTAPRTVPQDASDFKPGMNYADISTVLVNKLKQDFGVTEEQAYGVIGNFGHETDGFRANQEYNPLGGGRGGFGLAMWTGPRRKAFEAWAAQNGLDPADRNIQYQWFKKEVTTGEYAGVMDKVKQTGNRYESADVFQRSYEKAGVVNQPSRNRYADMAANGYTGPMQSPVMVNGDVPGVTMSNQNSKRSDDIQPQLKANVRDTVTQALGADYKVDVYSGGQSATSNGRTGSNRHNDGHAGDVRITRPDGSPLSDEEAAKVSQLYLAKGYGSVGLRMSGGGIHFDDFTKDKLGPGQSNIWNYNKDGGYFPADLQSQVARGLTGERPAGLRYTDAEMAAMKNKGSVGAAGLPSGPGLGALVKEHQQPPSTLPAASAGSGGGALPTYDSLMPKNAPAQAELAPVQNPQTGGVSADSNGNKNGPFASGDLRVKDVPSTDEWKMMFVNGSAVT
- Physico‐chemical
properties -
protein length: 889 AA molecular weight: 93774,4 Da isoelectric point: 9,18 hydropathy: -0,34
Representative Protein Details
- Accession
- 4dTKa
- Protein name
- 4dTKa
- Sequence length
- 1071 AA
- Molecular weight
- 108972,60370 Da
- Isoelectric point
- 9,17026
- Sequence
-
VATEYESLRLNVSLVDNVTSQLEKIRGSLASLGGGPAGLGMERLKARTAELTEQMKGLATGFEGGSAAALNVAKSLGLATAGVVALGVAVVKGVAGLNEYARGMQQLGNLARQTGIGAAQIREMSEALQRSGVAADRAQSNIAGLAHAMADISRVNSELRQNLLRGLQGDDRQAMEMLLGDLGRVANNPAAFATRVREALDNVYANVLERTKSSTRAAEARQRFAEAFGMPDLAQLRGEIASVTPAMEAMMTARIAQSEKLVEVTTQIGQSWGVISDSVSALATPAVTFVLRGFADVLRDVASEIQAAVEALRSFEPPEWLKTIGRVVGAGAQKTAEALSYAAGTTEGGGAGGATRRALGSAWNKLRGGGAPTGLEAAAAIPVPQFQTGGMIGAGEMGLVGEAGPELFAPGQSGAILPNWLLKLMVKHMGTYGLMAGLKMAVKDAEQGHTMRTRLRGMLGLEDPGEPAPWQAGGAWKRQAGGSVTGGGSYLVGEAGAELFMGGGGGQGGEGRRLISEQNRQMQELNANSEDQNAQMRALTEELQTLNAAIAGPGGAPAGGGGGGVRMAGLRGLPGFGGGGGGAYGGGGGYGGAGGGPGGPARISGFAGGGGGARGGGASGSWGASAYGGGTTPPSTGLDAGNQTGASPAATAGPAGDPNIPGALLETARTVALTGGPQALDKFMRDNGYPRNGAWCGQFAASVVKSQGLQPPKNPEVASNWRNWGEAVTGAPQPGDVAIRTGGRTGATGSHVTFVSGYDPQTGRISTIGGNQALSRAERARGLTGESRDMASRYEYRRAPGGGVKPYGSDVGGGAFAGGGGTSGGAGATGRFAAPAGGGGRGGRGDLFGERAPMLMRELQDDLGLTRDQAAGLVGNLGYESAGFKSLQEGAPIGGAAGGYGYAQWTGPRRTAFEKWARENQLDPSSHEANAGFLKHELTGSHAGFLAQLKGTKNLAEATRLTHEVYERPADVQPRYWGTRLPGGRQIKPYESAGGRLQYAQRAHGLDRSELDQTMAREVSHRVEGTGKLTVDVNAPAGTKVAAEGGGLFKQTEVNRQTQMTPAREGPTVAV
Other Proteins in cluster: phalp2_18122
| Total (incl. this protein): 12 | Avg length: 971,2 | Avg pI: 9,18 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 4dTKa | 1071 | 9,17026 |
| 255QX | 941 | 9,21500 |
| 5CV8e | 869 | 9,21493 |
| 6XJLy | 1082 | 9,43799 |
| 6XPDS | 951 | 8,71659 |
| 6XX3y | 1059 | 9,13325 |
| 6XvVU | 979 | 9,15040 |
| 6Y28i | 1152 | 9,41304 |
| A0A7S5R2J6 | 886 | 9,17844 |
| A0A7S5RFV6 | 889 | 9,22628 |
| A0A7S5RFZ4 | 886 | 9,17844 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_1047
71YVh
|
8 | 29,2% | 1123 | 5.769E-67 |
| 2 |
phalp2_18578
7hJse
|
24 | 26,3% | 1111 | 1.390E-60 |
| 3 |
phalp2_4964
6I865
|
4 | 24,4% | 1178 | 5.474E-37 |
| 4 |
phalp2_12074
5kidj
|
23 | 25,6% | 1165 | 1.829E-34 |
| 5 |
phalp2_28717
4ENu0
|
8 | 24,7% | 1113 | 4.197E-25 |
| 6 |
phalp2_13442
4ECYe
|
8 | 23,0% | 846 | 1.366E-19 |
| 7 |
phalp2_18187
4F24v
|
1 | 22,5% | 1123 | 3.124E-19 |
| 8 |
phalp2_31646
4EQbE
|
2 | 23,1% | 1136 | 7.041E-16 |
| 9 |
phalp2_1983
4EXXZ
|
3 | 23,0% | 1210 | 1.177E-09 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Rhizobium phage RHEph12 [NCBI] |
2836128 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MW980064
[NCBI]
CDS location
range 58095 -> 60764
strand -
strand -
CDS
ATGGCTATCGCTACATCACCTACCAACTTCCATGACAGCATTCGCAACATGTTCACGTCGTCTTCGGGTGCGAATTCGACCGGCAGCATCGCCTACCAGATCGAAAGTCTGGTAGGTGGCGGACGCAAGTCACAGGAACAGCAACAGCGTCAGAACACCCTCGTTCTGAAAATGATGACCGAACAGACACGGCTGCTCAAGAAGATCGCTGACTCTTCGGGCGGCGGTTTGGCCTCTGGCATTTCTACCATTGACGATCTGCTCGATATCTTCGCGCACGGCAAGGGCAAGTTCCGTGGACTGGCGCGTCGTCTGAAATCGGTTGTCAAACTGCCTGTTCTCGCAATGGGCATGCTGGTAGGCGGTGTCACCGATGCATTCGGCGTCATTGCCGCCGCTGGTAAGAACATTTTGAAGCCGCTTGTCGGCTTGAAAAACCTGATCAATGCTTCCGGCATGTTCAAGGTATTCGGCAAGCTGACTCCGCTGTTGAAGGTTTTCCCGAAGCTGCTAGGCAAGCTGTTTATCCCTATTACCATTATCATGGGCATTGTCGATGCGTTCAGCGGTTGGTCCAATGCTTCCGAGTACCTTGGTAAGTTGAACGTATCGTTCGGCGACAAGGCGGCGTCCGCTATCGGCACGTTGGTAAACGGTCTGTTGCTCGGCATTCCGAATTGGCTGCTTGAACAGTTCGGCGGAACGAACCTGCAAAAGCTGATGATGCATGCCAAGGATGGCATCTGGCGTGGTATTACTGCGCTTGGTTCCGGCCTTGGCACGATCACCGAAGGCGCGTTCAACTGGATTTCAAAGCAGGTATCCGACATTCGTCAGTATCTTCCTACCCCCGGCCAAGTAGCAGACATGGCATGGGCCGCTTTTGATAGCGTCAAGACGGCTGTTTGGGATGGTCTTACCTACATCGGTACGTCTATTCGCGACGGCTTCTATGACTTCTTCAAAAACATGGGCGATGCGATCAAGGAATGGTGGAAAAATCCGTTCGGTCCAGTACCGAACATGTTCAACGGCATAAGCTCTGGTAGTACGTCGCAAACTCCCGGTGCGCAGATTGGCCGTATGTTCCGGCCTCAACAGCAATCGGCTCCGGTCTTGTCGGGTGGCGGTGCATCTCCTACCAACATCTCGTTCCGTGCGCCGAATTCGCCGTACGATATCCCGGCAACGCAAAGCGAATACGATTCTGGCGGTGCGGGCGGTGGCTGGCGTCCCGGTATGGGCGCGTCGGCTCCCGATAGCTACTACGGCGGTGGGGGTGTCGGTGCCCCGAGTGGTGGTCCTACCGTTGCTGCTACTGGTAAGCCAACTGCGCCGCGTACAGTGCCGCAAGACGCATCCGACTTCAAACCGGGAATGAATTACGCCGATATTTCCACGGTTCTGGTAAATAAACTGAAGCAGGATTTTGGAGTTACCGAAGAGCAGGCTTATGGCGTTATTGGTAACTTCGGCCACGAGACAGATGGATTCCGCGCCAATCAGGAATACAATCCACTAGGTGGTGGCCGTGGTGGATTTGGCTTGGCTATGTGGACAGGCCCGCGCCGCAAGGCGTTTGAAGCGTGGGCTGCACAAAACGGTCTTGATCCTGCTGATCGTAATATTCAGTACCAATGGTTCAAGAAGGAAGTTACTACCGGCGAATACGCGGGCGTCATGGACAAAGTGAAGCAGACAGGCAATCGCTATGAGTCGGCTGATGTTTTCCAGCGCAGCTACGAAAAAGCCGGTGTGGTAAATCAGCCAAGCCGCAACCGTTATGCCGATATGGCCGCTAATGGTTATACCGGCCCTATGCAGTCGCCAGTAATGGTAAACGGCGACGTTCCCGGCGTCACCATGAGTAACCAGAACAGTAAGCGTAGCGACGATATTCAGCCTCAGTTGAAGGCGAATGTCCGCGATACGGTTACCCAAGCTCTCGGCGCTGACTACAAAGTCGATGTGTATTCTGGCGGTCAGTCGGCGACCAGCAATGGTAGAACCGGCTCCAATCGACACAACGACGGACATGCTGGCGATGTTCGTATTACCCGACCGGACGGTAGTCCTTTGTCGGATGAAGAAGCGGCGAAAGTGTCGCAGCTTTATCTGGCGAAAGGTTACGGCTCGGTTGGCTTGCGCATGTCTGGTGGTGGTATCCATTTTGACGATTTTACCAAAGACAAACTCGGTCCCGGTCAGTCGAATATCTGGAACTACAACAAAGACGGCGGTTACTTCCCGGCTGACTTGCAGTCACAGGTAGCTCGCGGTCTTACTGGCGAAAGGCCAGCGGGGTTGCGCTATACCGATGCTGAAATGGCAGCGATGAAGAACAAAGGCAGTGTCGGTGCGGCTGGCTTGCCTTCTGGCCCCGGCCTTGGCGCTTTGGTAAAAGAGCATCAACAGCCGCCGTCAACTCTTCCTGCCGCTAGTGCTGGAAGCGGTGGCGGTGCGTTACCTACCTACGATAGCCTAATGCCGAAAAACGCCCCGGCGCAAGCGGAACTAGCTCCGGTGCAGAATCCGCAAACGGGTGGCGTGTCGGCTGACTCGAATGGTAACAAAAACGGGCCGTTCGCGTCTGGCGATCTCAGGGTGAAGGACGTGCCGTCGACTGACGAATGGAAGATGATGTTTGTCAACGGCTCTGCCGTTACGTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0016020 | membrane | cellular component | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(4dTKa)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50