Protein

Protein accession
A0A5S9MM99 [UniProt]
Representative
4lMV0
Source
UniProt (cluster: phalp2_2236)
Protein name
Tail length tape-measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MADKPVGNMKFGIGFDGLDESLNTLDKLNRAIRQTESAMKTNISTMDKANKTAADYAQQEEDLAKAFELQAKKIQLLEKRKAAQIEQYGKESSAVAKTNNEINKASAVYNKYSRDLEKASQGYIIASSGVDKYNKALSENEKQMKQEVSALKSAGDKTGAYAAQKKGLEKQASLTTKAIAAQENVVKVLTKEYGANSKQVQDAQAKLDSYRRQQTITSKQIEGTNKSIREGANSFKGLNDEMGKSTTSGEKAVKSLGKVTSGLGSMVAGIGKVVGKVGLGALLTIGNKAIGAVSNNLDGAIKRIDTLANSTRAFENMGFSADHTAKAMKNISKAIDGLPTALNDSVSNVQLLAASTGDLDLSVDVYKALNDAILGFGGDANMANNAIVQLSQSFSNGKIDAQTWNSMINSGLGPTLNALAKTMGKTTGELKEGLSEGKISVKEFQEGLIKLDKEGGGGLKSLEQIVKDSTKGIGTSLANAKTAMVRGTAELIKSADQVLANMNLPTIAELISNSGKKVENVMKAVAAGLPELAKNFSWVGGVFSTIGDLSGKALVKVGEFVTNFKNSLGGLIDYIGKVWSGKATNGDQYILAQLGFDYETIWKLEDFTKQFKEKADLFGGYIQGFWKLFSSDEATSLDGYSILRQLGMSPEAITTLETNVETVKTTIDTFGDVLKKVAEFGLHAFYTEIEQLYNFFIEKIAPEIMPMLDSMTGSFDRLGKMFESTSGKVNGMKVAFDIAFSFIIGRLQGLWTIAKGVFIALQIALEVFAAVFVGIWQTLMYTLTGNWSKAWDSIKLMVKRVGEAIWLNIKDTFLGKIVTTLWDFFTENEKVFTGVWDTITGILFAVPNFVFGIFEEVPKKIVEAINNGKSAVVGAFKGVFNAALKAIGKPVNGIIKGASWVLEKLGGEPLKEWDVSKYAYANGTPEGGHPINGPMMVNDGRGAETVITPDGRAFIPKGRNVVLNAPKGTHVLTAEETAQLQGSKAPKYRYKKGTNFFGKLWSNVKGFAGNVGSKLKNVVGDVWDFVSNPEKLAKKVLGGLDVLGGLTKYPLDVGKGILSKATSALTSKITEMFSASSGNLDLGTGTAGVYKYLADVAKSVLSKYPGFQITSGYRPGDPHSHGKRNAIDIALPGVTGGSPRYAEAANYAFEKFAGKVGYVITNGKVRDRSGQSGTGIHNDWRPWPDGDHYDHVHINGIKDPQNTQISGENIGGSGVDRWRGVATQALKMTGQYSASNLAALLNQMRTESNGNPTAINNWDINALNGTPSKGLLQVIDPTFRQYAMPGHNTNIFDPLSNILASIRYALSRYGSLTNAYRGVGYENGGIITKQHMALVGEGNKEEVVIPLTGSGLKRSRAMQLLAYASEKLGASTASISSTAGNTSNVSDMAQMLVLMQEQNQLLMAILAKDTNVVLDGTKLNSKLESIRNTQQINNNRNMGLI
Physico‐chemical
properties
protein length:1441 AA
molecular weight:154808,6 Da
isoelectric point:9,20
hydropathy:-0,22
Representative Protein Details
Accession
4lMV0
Protein name
4lMV0
Sequence length
322 AA
Molecular weight
35254,01890 Da
Isoelectric point
9,51806
Sequence
SHGKRNAIDIALPGVTGGSPRYTEAANYAFDKFASKIGYVITNGKVRDRSGQSGTGIHDDWRPWPAGDHYDHVHLNGVKDPQNTQISGDSVGGSGVERWRNVAIRALKMTGQYSTANLNALLNQMRTESNGNPNAVNNWDINAKNGTPSKGLLQVIDPTFRQYAMPGFNSNIFDPLSNILASIRYALSRYGSLTNAYRGVGYENGGIITKEHIARVGEGNKEEVVIPLTGSGLKRSRAMQLLAYANEKLNKQSTSTTVTTSNSNSDSNLQLIIALMQQQNELLVQLLEKNTDVLIDGKSLNRELQKINKTEQRNTNRALGLI
Other Proteins in cluster: phalp2_2236
Total (incl. this protein): 12 Avg length: 1209,8 Avg pI: 9,12

Protein ID Length (AA) pI
4lMV0 322 9,51806
1bSos 460 9,11610
CHlm 414 9,37108
A0A060ANI0 1074 8,43712
A0A6V7BBX8 1439 9,11075
A0A8K1BM39 1441 9,19946
A0A8S5TN96 2177 8,51558
A0A9Y1MRG7 1438 9,32956
A0AAF0AD52 1438 9,32956
A0AAX4PN19 1437 9,16555
A0AAX4PP86 1437 9,16555
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9298
7ls5w
2 73,2% 202 1.544E-83
2 phalp2_26048
76gHp
1 37,0% 321 1.807E-62
3 phalp2_10744
3mWmf
5 37,3% 238 3.985E-33
4 phalp2_33337
6BBGM
4 39,6% 207 9.986E-33
5 phalp2_72
7Qk8L
13 32,2% 326 1.154E-31
6 phalp2_11380
3UBu
3 39,2% 219 3.316E-30
7 phalp2_13621
5tXtb
6 38,9% 203 6.101E-30
8 phalp2_13834
7en41
2 30,7% 267 1.010E-19

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterococcus phage vB_EfaS-DELF1
[NCBI]
2683673 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
LC513943 [NCBI]
CDS location
range 5645 -> 9970
strand +
CDS
ATGGCAGATAAACCAGTAGGAAATATGAAGTTTGGGATTGGTTTTGATGGATTAGACGAATCCCTAAATACTTTAGATAAACTAAATAGAGCAATTAGGCAAACAGAGTCTGCAATGAAAACAAACATATCCACTATGGATAAAGCTAATAAAACAGCAGCAGACTATGCCCAACAAGAGGAAGACTTAGCAAAAGCTTTTGAATTGCAAGCAAAGAAAATCCAACTTTTGGAAAAACGCAAAGCTGCACAGATTGAGCAATATGGTAAAGAATCCTCAGCAGTAGCAAAAACAAACAACGAAATAAACAAAGCATCTGCAGTATACAATAAATATTCACGTGATTTAGAAAAAGCAAGTCAAGGCTATATCATTGCATCCAGCGGAGTAGATAAGTATAATAAAGCATTGTCTGAAAATGAGAAACAAATGAAGCAAGAGGTTTCAGCATTAAAGAGTGCTGGAGATAAAACTGGTGCATACGCTGCTCAGAAAAAAGGATTAGAAAAACAAGCGTCATTAACTACGAAAGCAATTGCAGCTCAAGAAAACGTGGTTAAAGTACTAACTAAAGAATATGGTGCTAACTCTAAACAGGTTCAAGATGCACAAGCAAAATTAGATAGTTATAGACGTCAACAAACAATCACATCTAAGCAAATTGAAGGTACGAATAAAAGTATCAGAGAAGGCGCTAACTCATTCAAAGGGCTAAACGATGAAATGGGTAAATCAACTACTAGTGGAGAGAAAGCTGTAAAATCGTTAGGTAAAGTCACCTCTGGCTTAGGAAGTATGGTTGCTGGAATCGGTAAAGTGGTTGGTAAAGTTGGACTTGGAGCCCTGCTTACTATTGGTAATAAAGCAATTGGAGCTGTTAGCAACAACCTAGATGGCGCAATTAAGCGTATTGACACATTGGCAAACTCAACAAGAGCTTTTGAAAATATGGGATTTTCAGCAGACCATACAGCTAAAGCAATGAAAAACATTTCTAAAGCCATTGATGGATTACCAACAGCATTGAATGACTCAGTAAGTAACGTTCAGTTATTAGCTGCTTCAACTGGCGACCTTGACCTATCAGTTGATGTGTACAAAGCATTGAACGATGCCATCCTTGGTTTTGGCGGAGATGCCAACATGGCAAACAATGCTATTGTTCAGTTGTCACAATCATTTTCTAATGGTAAAATTGATGCGCAAACATGGAACTCAATGATTAACTCAGGTCTAGGACCTACTCTGAACGCATTAGCCAAAACAATGGGTAAGACTACAGGTGAACTTAAAGAAGGTTTATCAGAGGGTAAAATTTCTGTAAAAGAGTTTCAAGAAGGATTAATTAAACTAGATAAAGAAGGCGGCGGAGGTCTTAAGTCATTAGAGCAAATAGTTAAGGACTCAACCAAAGGTATTGGAACATCTTTAGCCAACGCAAAAACAGCAATGGTTAGGGGTACTGCTGAACTAATAAAATCTGCAGACCAAGTGTTAGCCAATATGAACCTACCTACAATTGCTGAGTTAATATCAAATTCAGGAAAAAAAGTAGAAAATGTTATGAAAGCTGTAGCTGCTGGACTTCCTGAACTTGCTAAAAATTTCTCATGGGTTGGTGGAGTGTTCTCAACAATTGGAGACCTTTCAGGTAAAGCACTAGTTAAAGTGGGAGAGTTTGTTACAAACTTTAAAAACTCACTTGGTGGATTAATTGACTATATTGGTAAAGTATGGAGCGGTAAAGCGACTAACGGTGACCAGTATATCTTAGCGCAACTTGGTTTCGATTATGAAACAATTTGGAAACTAGAGGACTTTACTAAACAGTTTAAGGAGAAAGCAGACCTATTTGGTGGATATATCCAAGGTTTTTGGAAATTATTTAGTTCAGATGAAGCCACCTCATTAGACGGTTATTCAATCTTAAGACAACTAGGAATGTCACCTGAGGCTATAACTACACTTGAAACAAATGTAGAAACAGTTAAAACAACCATTGACACATTTGGTGACGTACTTAAGAAAGTGGCAGAATTTGGATTACACGCATTCTATACAGAAATAGAGCAATTGTATAATTTCTTTATTGAAAAGATTGCTCCAGAAATTATGCCTATGCTTGATTCAATGACAGGTTCATTTGACCGTTTAGGAAAAATGTTTGAGTCAACTAGTGGAAAAGTTAACGGTATGAAGGTTGCTTTTGACATAGCCTTTAGTTTCATTATTGGACGTTTACAAGGCTTGTGGACAATTGCTAAAGGTGTGTTCATTGCTTTACAAATTGCTCTTGAAGTATTTGCCGCAGTATTCGTTGGTATTTGGCAAACACTAATGTACACTCTTACTGGTAACTGGAGCAAAGCTTGGGACTCAATTAAGTTAATGGTCAAACGTGTTGGTGAGGCAATATGGTTAAACATTAAAGACACATTCCTAGGTAAGATTGTAACAACATTATGGGACTTCTTCACAGAAAACGAGAAGGTTTTCACAGGTGTATGGGACACAATCACTGGTATACTATTCGCAGTACCTAACTTTGTATTTGGTATTTTTGAGGAAGTACCTAAGAAAATAGTTGAAGCTATTAATAATGGTAAATCAGCAGTTGTGGGAGCATTCAAAGGTGTTTTCAATGCCGCACTTAAAGCCATTGGTAAACCAGTCAATGGAATTATCAAAGGTGCTTCATGGGTGCTTGAAAAATTAGGTGGAGAACCACTTAAAGAATGGGATGTATCTAAATACGCTTACGCTAACGGTACGCCAGAGGGTGGACACCCAATTAACGGTCCAATGATGGTCAATGATGGACGTGGAGCAGAAACAGTTATCACTCCAGATGGTAGAGCGTTTATTCCTAAAGGACGTAACGTAGTACTAAATGCACCAAAAGGAACACATGTCTTAACAGCAGAAGAAACAGCACAGCTTCAAGGTTCTAAAGCACCTAAGTATCGTTACAAAAAAGGTACTAACTTCTTTGGGAAATTGTGGAGTAATGTTAAAGGTTTCGCGGGTAATGTAGGTAGCAAACTCAAAAATGTTGTAGGAGATGTGTGGGATTTTGTATCTAATCCAGAAAAGCTAGCTAAAAAAGTTTTAGGTGGATTAGATGTTTTAGGTGGGTTAACTAAATATCCATTAGATGTTGGTAAAGGTATTCTAAGTAAAGCAACTAGTGCATTAACAAGCAAGATTACAGAAATGTTCTCAGCAAGTTCAGGTAACTTAGACCTAGGCACAGGAACCGCAGGTGTTTATAAGTATTTAGCAGATGTTGCTAAATCAGTATTAAGTAAATACCCTGGTTTCCAAATAACCTCAGGTTACAGACCAGGAGACCCTCATTCACATGGTAAACGTAACGCAATTGATATTGCTCTACCTGGGGTAACAGGTGGCTCACCTAGATATGCAGAAGCCGCAAACTATGCGTTTGAGAAATTTGCTGGTAAGGTTGGATATGTTATCACTAATGGTAAAGTACGTGACCGTTCAGGACAATCTGGAACAGGTATTCACAACGACTGGAGACCTTGGCCTGATGGAGACCATTACGACCACGTACACATTAACGGTATCAAAGACCCTCAGAACACACAAATTTCAGGAGAAAACATTGGAGGTAGTGGCGTAGATAGATGGCGAGGAGTGGCAACGCAAGCCCTCAAAATGACTGGTCAATACAGCGCTAGTAACTTAGCAGCACTACTTAACCAAATGCGTACAGAGTCAAATGGTAACCCTACCGCAATCAACAACTGGGATATCAACGCATTGAATGGTACACCGTCTAAAGGATTGCTTCAAGTGATTGACCCAACATTTAGACAATACGCAATGCCTGGTCACAATACTAACATCTTTGACCCATTATCTAACATCTTAGCTTCAATTAGATATGCGCTATCAAGATATGGTTCACTAACTAATGCGTATCGTGGAGTTGGTTACGAAAATGGTGGAATCATCACTAAGCAACACATGGCACTAGTTGGAGAAGGCAACAAAGAAGAGGTTGTAATTCCATTAACAGGTTCAGGACTTAAACGTTCAAGAGCTATGCAACTACTGGCATATGCTAGTGAGAAATTAGGAGCTAGCACTGCATCAATTAGTAGTACAGCTGGCAACACGTCAAACGTTTCAGATATGGCACAAATGTTGGTACTTATGCAAGAACAGAATCAATTGTTGATGGCTATTCTAGCAAAAGACACTAATGTGGTACTAGACGGAACTAAGTTGAATAGTAAGTTAGAAAGCATTAGAAATACACAACAAATTAACAACAATCGTAACATGGGGTTAATTTAA

Gene Ontology

Description Category Evidence (source)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4lMV0) rather than this protein.
PDB ID
4lMV0
Method AlphaFoldv2
Resolution 79.85
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50