Protein

Protein accession
A0AAX4PP86 [UniProt]
Representative
4lMV0
Source
UniProt (cluster: phalp2_2236)
Protein name
Tail length tape measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAQDRPIGNMKFGVGFDGLDESLNTLDKLNRAIKQTESAMKTNISTMDKGNKTASDYARAEADLTRAYELQAKKIALLEKRKEEQIKAHGAESAAVARTVDQINKASTKYNQYNKEIDKNKQAHIIASSGVDKYSKAIKNNESAMKDEISALRSAGDKVGAYKAQKQGLLKQEELTTKAIKAQENVVKQMTKQFGENSTQVASAKEKLETFKRQQQITSKQIEGTNKGLKESSNAFRGLNDEMSKSEPASKKAVNGLGKVTSGLGSMLGGIGKVVGKVGLGALLTIGNKAFNAVSSNIDSAIKRIDTLANSSRAFQNMGFDANNTAKAMKNISKAIEGLPTALDDSVSNVQLLAASTGDLDLSVDVYKALNDAILGFGGDANMANNAIVQLSQSFSNGKIDAQTWNSMINSGLGPTLNALAKQMGKTTGELKDGLSEGKISVKDFQTALIKMDKEGGGGLKSLEQIAKDSTKGISTSLANAKTAMVRGTAELIKSVDAMLANMNLPTISEFITNAGTNMENFMKKVAAVLPEIAKNFVWVGDVFDGLKEIFANAMDEVDAFIFNFKNGFTQMVDFVKKVWSGDVQSNDYYLLRLAGLDYETIWKIEDFIKVFKEKAETFGEYVQGFWKLFSNDEATSLQGYSMLRTLGMTQDQITGLETAVENTKTFLSGLKDVVLDLADLALYNLNQEFNTLYKFFTEEVGPDILPLMQSWTDSLKRMSDMFKTTGDSANGFKTVIVVAWKIMEDRFQGLWIIVKTIMIAIQITIETVTASIAGILNTFGLIITGQWSRAWDEIKNTGDRIWAAIWGGIKNTFLGKILITLGEFYERNKKIFEGIWETITGILFAVPGFVYDIFTKVPQRMVEAIKGGKNAVVAAFKEVFNAALRAIGKPVNGIIKGASWVLEKLGAEPLTEWDVPQYATGTPAGGHPINGPMMVNDGRGAETVITPDGRAFIPKGRNVVLNAPKGTHVLTAEETAQLQGSKAPKYRYKKGTNFFGNMWDSVKNIAGNVGNTLKNVVGDVWDFISDPGALARKVLGGLDVLGGLTKYPLEVGKGILSKATSALTEKITDLFSSGNLDTSIGTNGVYKYLADVAKSVMKKFPGFMVTSGYRPGDPYSHGKRNAIDIALPGVTGGSPRYTEAANYAFDKFASKIGYVITNGKVRDRSGQSGTGIHNDWRPWPDGDHYDHVHLNGVKDPQNTQISGDSVGGSGVERWRNVAIRALKMTGQYSTANLNALLNQMRTESNGNPNAINNWDINAKNGTPSKGLLQVIDPTFRQYAMPGFNSNIYDPLSNILASIRYALSRYGSLTNAYRGVGYEYGGRVTKEHIARVGEGNKEEVIIPLTGSGLKRSRAMQLLAYANQKLGKKNDTPTPLSGNNSNQDLSILINLMQQQNELLMALLDKSPTIELDGRKVSKEITKYQESETRTRNRQMGLI
Physico‐chemical
properties
protein length:1437 AA
molecular weight:156590,5 Da
isoelectric point:9,17
hydropathy:-0,31
Representative Protein Details
Accession
4lMV0
Protein name
4lMV0
Sequence length
322 AA
Molecular weight
35254,01890 Da
Isoelectric point
9,51806
Sequence
SHGKRNAIDIALPGVTGGSPRYTEAANYAFDKFASKIGYVITNGKVRDRSGQSGTGIHDDWRPWPAGDHYDHVHLNGVKDPQNTQISGDSVGGSGVERWRNVAIRALKMTGQYSTANLNALLNQMRTESNGNPNAVNNWDINAKNGTPSKGLLQVIDPTFRQYAMPGFNSNIFDPLSNILASIRYALSRYGSLTNAYRGVGYENGGIITKEHIARVGEGNKEEVVIPLTGSGLKRSRAMQLLAYANEKLNKQSTSTTVTTSNSNSDSNLQLIIALMQQQNELLVQLLEKNTDVLIDGKSLNRELQKINKTEQRNTNRALGLI
Other Proteins in cluster: phalp2_2236
Total (incl. this protein): 12 Avg length: 1209,8 Avg pI: 9,12

Protein ID Length (AA) pI
4lMV0 322 9,51806
1bSos 460 9,11610
CHlm 414 9,37108
A0A060ANI0 1074 8,43712
A0A5S9MM99 1441 9,19946
A0A6V7BBX8 1439 9,11075
A0A8K1BM39 1441 9,19946
A0A8S5TN96 2177 8,51558
A0A9Y1MRG7 1438 9,32956
A0AAF0AD52 1438 9,32956
A0AAX4PN19 1437 9,16555
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9298
7ls5w
2 73,2% 202 1.544E-83
2 phalp2_26048
76gHp
1 37,0% 321 1.807E-62
3 phalp2_10744
3mWmf
5 37,3% 238 3.985E-33
4 phalp2_33337
6BBGM
4 39,6% 207 9.986E-33
5 phalp2_72
7Qk8L
13 32,2% 326 1.154E-31
6 phalp2_11380
3UBu
3 39,2% 219 3.316E-30
7 phalp2_13621
5tXtb
6 38,9% 203 6.101E-30
8 phalp2_13834
7en41
2 30,7% 267 1.010E-19

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterococcus phage vB_Efm3_KEN18
[NCBI]
3135750 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
PP582173 [NCBI]
CDS location
range 32296 -> 36609
strand -
CDS
ATGGCACAAGATAGACCTATAGGAAATATGAAATTTGGGGTTGGATTTGATGGTTTAGATGAGTCTTTAAATACATTAGATAAGTTAAATAGAGCAATTAAACAGACAGAATCAGCAATGAAAACAAACATTTCTACAATGGACAAAGGTAATAAAACCGCCTCTGACTACGCACGTGCAGAAGCAGATTTAACGAGAGCATATGAATTACAAGCTAAAAAAATTGCATTACTAGAAAAAAGAAAAGAAGAACAAATCAAAGCACATGGTGCTGAATCAGCAGCCGTTGCTAGAACAGTTGACCAAATTAACAAAGCCTCAACAAAATACAACCAGTACAATAAAGAGATTGATAAAAACAAACAGGCACATATTATCGCTTCAAGTGGTGTAGATAAATATAGTAAGGCTATTAAGAACAATGAATCAGCCATGAAAGATGAAATTAGTGCCTTACGTAGTGCTGGCGATAAGGTAGGAGCATATAAAGCACAAAAACAAGGTTTACTGAAACAAGAAGAGTTAACAACAAAAGCAATTAAAGCTCAGGAAAACGTAGTAAAACAAATGACCAAGCAATTTGGTGAAAACTCAACACAAGTTGCTAGTGCTAAAGAAAAACTAGAAACCTTTAAGCGACAACAACAAATAACAAGTAAGCAGATAGAAGGAACAAACAAAGGACTGAAGGAGTCATCTAATGCTTTTCGTGGGTTAAATGATGAAATGTCCAAATCTGAACCAGCTTCTAAAAAGGCAGTAAATGGGTTAGGAAAAGTAACTTCTGGTTTGGGGAGCATGCTTGGTGGGATTGGTAAAGTTGTAGGTAAAGTTGGATTGGGTGCGTTACTAACTATTGGTAACAAAGCGTTTAATGCCGTTTCATCAAACATAGATTCAGCCATTAAACGTATTGACACCTTAGCAAACTCTTCAAGAGCCTTCCAGAATATGGGATTTGACGCTAATAACACAGCTAAAGCAATGAAAAACATCTCTAAAGCAATTGAAGGATTACCAACAGCGTTAGATGATTCTGTATCAAACGTTCAATTATTAGCTGCCTCAACAGGAGACTTAGATTTATCTGTAGATGTTTATAAGGCATTAAACGATGCAATCCTTGGTTTTGGTGGAGACGCAAACATGGCAAACAATGCCATTGTTCAGTTGTCTCAATCATTTTCTAATGGTAAAATAGACGCTCAAACTTGGAACTCCATGATTAACTCAGGACTTGGGCCTACTTTAAACGCATTAGCTAAGCAGATGGGTAAAACGACTGGTGAGTTGAAAGATGGTCTTTCAGAAGGTAAAATATCAGTAAAAGATTTTCAAACAGCCTTAATTAAGATGGATAAAGAAGGCGGCGGCGGTCTTAAGTCATTAGAACAGATTGCCAAAGACTCTACAAAAGGTATCAGCACCTCACTAGCAAATGCTAAAACAGCGATGGTGCGTGGTACAGCAGAACTAATTAAGAGTGTGGACGCAATGCTTGCAAACATGAACTTACCAACAATCAGCGAATTTATTACAAACGCTGGAACAAACATGGAAAACTTCATGAAAAAAGTTGCTGCGGTGCTACCAGAAATAGCAAAAAACTTTGTTTGGGTTGGAGATGTATTTGATGGACTCAAAGAAATTTTTGCAAATGCAATGGATGAAGTGGATGCCTTCATATTCAACTTCAAGAATGGTTTCACGCAAATGGTGGACTTTGTTAAAAAAGTGTGGAGTGGTGATGTACAAAGTAATGACTATTATCTACTAAGATTAGCAGGACTTGATTATGAAACGATTTGGAAAATAGAAGATTTCATCAAAGTCTTTAAAGAAAAAGCAGAAACATTTGGGGAGTATGTACAAGGATTCTGGAAACTATTTTCTAACGATGAAGCCACTTCATTGCAAGGTTATTCCATGTTGCGAACACTGGGAATGACTCAAGATCAGATAACTGGATTAGAAACGGCAGTAGAAAATACTAAGACATTTTTAAGTGGTTTAAAGGATGTTGTGTTAGACTTAGCAGACCTAGCTTTGTATAACCTAAATCAAGAGTTTAATACTCTTTATAAGTTTTTCACAGAAGAGGTAGGGCCAGACATTCTTCCGCTTATGCAATCATGGACAGACAGTTTAAAACGTATGAGCGATATGTTTAAAACAACAGGTGACTCTGCTAATGGATTTAAAACAGTTATTGTTGTAGCATGGAAAATAATGGAGGATAGGTTCCAAGGACTATGGATTATTGTTAAAACAATAATGATTGCCATACAAATCACTATCGAAACAGTAACAGCCTCAATTGCAGGAATACTAAACACCTTTGGACTTATCATTACAGGACAATGGAGTAGAGCATGGGATGAAATTAAGAATACTGGTGATCGCATATGGGCAGCCATTTGGGGTGGTATTAAAAACACGTTCTTAGGTAAAATACTTATCACATTAGGCGAGTTCTACGAAAGAAACAAAAAGATTTTTGAAGGCATTTGGGAGACAATCACAGGTATACTATTTGCCGTACCTGGTTTTGTGTATGATATCTTCACGAAAGTACCTCAACGAATGGTGGAAGCCATCAAGGGTGGTAAAAACGCAGTAGTTGCAGCATTCAAGGAAGTATTCAATGCAGCACTAAGAGCCATTGGTAAACCAGTAAATGGTATTATCAAAGGAGCTTCTTGGGTACTTGAGAAATTAGGCGCTGAACCTCTGACAGAGTGGGACGTACCACAATATGCGACAGGTACACCAGCAGGTGGTCACCCAATTAATGGCCCAATGATGGTTAATGATGGACGAGGAGCAGAAACAGTTATCACACCAGATGGTAGAGCATTCATACCTAAAGGGCGCAACGTGGTATTAAATGCACCAAAAGGAACACATGTCTTGACAGCAGAAGAAACAGCTCAGCTTCAAGGTTCAAAAGCACCAAAATACCGTTACAAAAAGGGTACTAACTTCTTTGGTAACATGTGGGACAGTGTGAAAAATATTGCTGGTAATGTAGGAAACACACTGAAAAATGTAGTAGGTGACGTGTGGGACTTCATTTCAGACCCTGGCGCATTAGCTAGAAAAGTACTGGGTGGCTTAGATGTATTAGGTGGATTAACTAAGTATCCATTAGAAGTAGGTAAAGGTATTCTATCTAAAGCAACAAGTGCCCTGACTGAAAAGATTACTGATTTGTTCTCGTCTGGTAACCTAGATACCTCTATAGGAACAAATGGTGTCTATAAATATTTAGCAGATGTAGCTAAATCAGTAATGAAGAAATTCCCAGGCTTTATGGTAACTAGTGGGTATAGACCAGGTGACCCCTATTCACATGGTAAACGTAATGCTATTGATATTGCACTACCTGGTGTTACAGGTGGCTCACCACGCTACACAGAAGCGGCAAACTATGCGTTTGATAAATTTGCCTCTAAGATTGGTTACGTAATCACTAATGGTAAAGTTCGTGACCGTTCAGGACAATCAGGTACAGGTATTCATAATGACTGGAGACCATGGCCAGATGGAGACCACTACGACCACGTGCATTTAAACGGTGTGAAAGACCCACAAAACACTCAAATCTCAGGTGATAGTGTGGGAGGCAGTGGAGTAGAAAGATGGCGAAATGTAGCAATTAGAGCATTGAAAATGACTGGCCAATACAGTACTGCAAACTTAAATGCATTACTAAATCAAATGCGTACAGAGTCAAATGGTAATCCTAATGCAATTAACAACTGGGATATTAATGCTAAGAATGGAACGCCATCTAAAGGATTACTCCAAGTGATTGACCCAACATTCAGACAGTATGCAATGCCTGGATTCAACAGTAATATATATGACCCACTATCTAACATCTTAGCTTCAATTAGATATGCTTTGTCAAGATATGGCTCACTAACAAATGCTTATCGTGGAGTTGGTTACGAATACGGTGGAAGGGTCACAAAAGAACACATTGCAAGAGTTGGAGAAGGCAACAAAGAAGAGGTTATAATTCCATTAACAGGCTCAGGACTTAAACGTTCAAGAGCTATGCAACTATTAGCATACGCAAATCAGAAACTTGGTAAAAAAAATGATACACCTACACCATTGAGTGGTAATAATTCTAATCAAGACTTATCCATCTTGATTAATTTGATGCAACAACAAAATGAACTCTTAATGGCGTTACTTGATAAGTCACCTACAATTGAACTTGATGGACGTAAGGTAAGTAAAGAAATTACAAAGTATCAAGAATCAGAAACACGTACTAGAAATAGACAAATGGGATTAATATAA

Gene Ontology

Description Category Evidence (source)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4lMV0) rather than this protein.
PDB ID
4lMV0
Method AlphaFoldv2
Resolution 79.85
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50