Protein

Protein accession
A0A0M5M3L4 [UniProt]
Representative
6nwAd
Source
UniProt (cluster: phalp2_11213)
Protein name
Tail length tape-measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAQDRPIGNMKFGVGFDGLDESINTLDKLNRAIKQTESAMKTNISAMEKGSNSMSDLAQKQRDLTTTTELQAKKVQLLEQRREEYIATYGKESRQVSNVTKQINDASAKYNKYSKELTATKQAYILAESGVDKYTQALKDNEKAMNDEIKAFNSVGDKVGAFNAKQRGLKTQSELTEKAIVAQKNAVDAMTKEFGQASPQVEKAENKLQEFGRQSKIVDAQLDSLSSGVDKAERSMKGFSVVGHAIGTTLGNLASNAISRVTGFIGDMTREAITATDSVSKFKKSMDFAGFGGEEIEKATEQMKGYADTTVYGLEDILNTSAQLASNGIPNYMELTEAAGNLNAVAGGSEETFKSVAMMLTQTAGAGKLTTENWNQLVDAIPGASGLLQEAMLKNGAYTGNFRDAMEKGEISSNEFNQAITQLGMNPGAIEAAKSTDTLGGSWDRLKSTVVNAIQSIIEKIGVENITGFINTLSTKIEEAMPSIANFMGKLGEFAKWIADNRESLTWLVGIIGGITLAIKGLAVASAIFGAISVVAGGLVVALGALVGALVVAYTKSETFRDIVNKAFQTVWNFVKPIIDKLIQGFKDWWTIMSWLWVNISSWASWIGNKFTEMKNSVVNTVINLWNSVKNLFSNGLGDTWNKVAGWAVNVWNKFGELKTNATRAVTDTWTGIKNSFSGGISTVVNWMKDLPHKIASAVSNGKNAVTNAFKGVFNAALRAIGKPVNGIIKGASWVLEKLGAEPLKEWEVPQYATGTPAGGHPINGPMMVNDGRGAETVITPDGRAFIPKGRNVVLNAPKGTHVLTAEETAQLQGSKAPKYRYKKGTNFFGNMWDSVKNVAGNVGNTLKNVVGDVWDFISDPGALARKVLGGLDVLGGLTKYPLEVGKGILSKATSALTEKISGLFSSGNLDTSIGTNGVYKYLADVAKSVMKKFPGFVATSGYRPGDPYSHGKRNAIDIALPGVTGGSPRYTEAANYAFDKFASKIGYVITNGKVRDRSGQSGTGIHNDWRPWPDGDHYDHVHLNGVKDPQNTQISGDSVGGSGVERWRNVAIRALKMTGQYSTANLNALLNQMRTESNGNPNAINNWDINAKNGTPSKGLLQVIDPTFRQYAMPGFNTNIYDPLSNILASIRYALSRYGSLSAAYRGVGYENGGIITKEHIARVGEGNKEEVVIPLTGSGLKRSRAMQLLAYANEKLNRSQSAPVSGTTSSTNSDMAQMLLLLQQQNELLMAILAKDTNVVLDGEKLNSKLQKIQSRNQLNANRDLGLV
Physico‐chemical
properties
protein length:1270 AA
molecular weight:136899,5 Da
isoelectric point:9,21
hydropathy:-0,29
Representative Protein Details
Accession
6nwAd
Protein name
6nwAd
Sequence length
585 AA
Molecular weight
62860,53820 Da
Isoelectric point
5,40581
Sequence
MAQDRPIGNMKFGVGFDGLDESLNTLDKLNRAIKQTESAMKTNISAMEKGSNTMSDLAQKQRDLTTTTELQAKKVQLLEQRREDYIATYGKESRQVSNVTKQINDASAKYNKYSKELSSTKQAYILAETGVDKYTQALKDNEKAMNDEIKAFNSVGDKVGAFNAKQRGLENQADLTEKAIIAQKKAVDLMTKEFGQASPQVEKAENKLQEFGRQSKIVDAQLDSLSSGVDKAEKSMKGFSVVGHAIGTTLGNLASNAISRVTGFIGDMTREAITATDSVSKFKKSMDFAGFGGEEIEKATKQMKGYADVTVYGLEDILNTSAQLASNGIPNYIELTEAAGNLNAVAGGSEETFKSVAMMLTQTAGAGKLTTENWNQLADAIPGASGLLQEAMLKNGAYTGNFRDAMEKGEITSDEFNQAITQLGMNPGAIEAAKSTDTLGGSWDRLKSTVVNAIQDIIEKIGVENITGFINTLSKKIEEAIPKMADFADKLGDFGNWITDNKDNLSWLIGIILGITAAIKGLGLAAAIFGSISLLTGGLVVALGALVGALVVAYTKSETFRNIVNKAFQTVWSIVKPIIDRLVQG
Other Proteins in cluster: phalp2_11213
Total (incl. this protein): 21 Avg length: 848,8 Avg pI: 7,15

Protein ID Length (AA) pI
6nwAd 585 5,40581
1kIHz 607 4,42522
3jJQe 472 5,61827
6YER4 613 4,45745
6YbBI 423 5,36409
70QID 637 4,81394
70VGV 422 5,74713
70rUB 437 4,89437
7MxWs 582 6,21133
7OnsM 440 4,89409
8H7nZ 342 9,62695
CGjW 639 9,89198
NaaS 593 4,36952
A0A4D6A937 1502 9,45534
A0A4D6AAW9 1496 9,50324
A0A4D6AAJ4 1496 9,50917
A0A4D6B1D9 1496 9,49672
A0A976SG79 1274 9,15723
A0AAX4PNL4 1226 9,05969
A0AAX4PPP2 1272 9,13093
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25006
NPqa
22 20,6% 392 8.834E-10
2 phalp2_8692
7Yp09
17 19,2% 583 2.033E-09

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterococcus phage IME-EFm5
[NCBI]
1718158 No lineage information
Host Enterococcus faecium
[NCBI]
1352 Firmicutes > Bacilli > Lactobacillales > Enterococcaceae > Enterococcus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KT588072 [NCBI]
CDS location
range 22521 -> 26333
strand -
CDS
ATGGCACAAGATAGACCTATAGGAAATATGAAATTTGGCGTGGGATTTGACGGACTAGACGAATCTATAAACACATTGGATAAGCTGAACAGAGCAATTAAGCAAACAGAATCAGCAATGAAAACCAACATTTCAGCAATGGAAAAAGGCTCTAACAGCATGAGTGATCTTGCACAGAAACAAAGAGACTTGACAACTACTACAGAACTTCAAGCAAAAAAAGTGCAATTACTTGAACAACGTAGAGAAGAGTACATTGCCACTTATGGAAAAGAATCTAGGCAAGTATCAAATGTAACAAAACAAATTAATGACGCTAGTGCAAAGTATAATAAGTACTCAAAAGAGTTAACCGCTACCAAGCAAGCATACATTTTAGCAGAATCTGGTGTAGATAAATACACACAAGCTTTGAAAGATAATGAAAAAGCTATGAATGACGAAATTAAAGCATTCAATAGTGTTGGAGATAAAGTAGGCGCATTTAATGCTAAACAGCGTGGATTAAAAACACAATCTGAACTAACTGAAAAGGCGATTGTAGCTCAGAAAAACGCCGTAGACGCAATGACTAAAGAGTTTGGACAAGCTTCACCTCAGGTAGAAAAGGCAGAAAATAAGCTTCAAGAATTTGGTAGACAGTCAAAAATAGTAGACGCGCAGTTGGACTCATTATCATCTGGCGTAGATAAAGCTGAAAGATCAATGAAAGGTTTCTCAGTAGTTGGACATGCTATTGGGACAACACTTGGTAATTTAGCAAGTAACGCCATATCCAGAGTAACTGGTTTCATAGGTGACATGACACGTGAGGCAATTACTGCTACAGACTCTGTTTCTAAGTTTAAAAAGAGTATGGATTTTGCAGGATTTGGCGGAGAAGAAATAGAGAAAGCTACAGAACAAATGAAGGGGTATGCAGACACCACTGTTTATGGATTAGAAGACATTCTGAATACTTCAGCTCAGTTAGCTTCTAATGGTATTCCAAATTATATGGAGCTAACAGAAGCGGCAGGTAACTTGAATGCAGTAGCTGGTGGCTCAGAGGAAACATTTAAGTCAGTAGCTATGATGTTAACACAAACAGCAGGCGCTGGGAAACTAACAACTGAAAACTGGAATCAATTAGTAGACGCAATCCCAGGGGCTTCAGGACTACTACAAGAGGCTATGTTAAAAAATGGCGCGTATACTGGTAACTTTCGTGACGCAATGGAAAAAGGAGAAATCTCTTCAAACGAGTTTAACCAAGCTATTACACAGTTAGGTATGAATCCTGGAGCGATTGAAGCAGCCAAATCAACAGATACACTAGGTGGCTCATGGGATAGATTGAAATCAACGGTTGTCAATGCTATACAAAGTATTATAGAAAAAATAGGCGTTGAAAATATCACTGGTTTTATCAATACATTAAGTACCAAAATAGAAGAAGCAATGCCTTCTATAGCTAATTTCATGGGTAAATTAGGTGAGTTTGCCAAATGGATTGCAGATAACAGAGAGTCACTGACATGGCTCGTAGGTATCATAGGCGGGATAACTCTTGCAATTAAAGGATTAGCTGTAGCTTCAGCCATATTTGGAGCTATTTCTGTAGTAGCAGGAGGGTTAGTTGTAGCATTAGGCGCATTGGTTGGAGCGTTAGTTGTAGCTTATACCAAATCTGAAACATTTAGAGATATAGTAAATAAAGCATTTCAGACAGTCTGGAATTTTGTTAAGCCAATTATAGATAAACTGATTCAAGGTTTTAAAGATTGGTGGACAATAATGTCATGGCTTTGGGTTAACATATCTAGTTGGGCTTCATGGATTGGTAATAAATTCACTGAAATGAAGAACAGCGTTGTGAACACAGTCATAAATTTATGGAACAGCGTGAAAAACTTATTCAGCAATGGACTTGGAGACACTTGGAATAAGGTAGCTGGCTGGGCGGTGAATGTGTGGAACAAATTTGGTGAGTTAAAAACCAATGCGACACGTGCAGTCACAGACACATGGACAGGAATCAAAAACTCATTTAGCGGCGGCATTAGTACTGTTGTTAACTGGATGAAAGATTTACCTCACAAAATAGCTAGCGCAGTATCAAATGGTAAAAATGCAGTCACAAATGCCTTTAAAGGTGTTTTCAATGCAGCACTAAGAGCCATTGGTAAACCAGTGAATGGTATTATCAAAGGTGCGTCATGGGTGCTTGAAAAATTAGGTGCAGAACCTTTGAAAGAATGGGAAGTACCACAATACGCTACAGGTACACCAGCAGGCGGACACCCAATTAATGGGCCAATGATGGTCAATGATGGACGTGGAGCAGAAACAGTGATTACGCCAGACGGTAGAGCATTTATTCCTAAAGGACGTAACGTAGTATTAAATGCACCAAAAGGAACACATGTCTTGACCGCAGAAGAAACAGCCCAGCTTCAAGGTTCTAAGGCTCCTAAATATCGTTACAAAAAAGGGACTAACTTCTTTGGTAACATGTGGGACAGTGTGAAGAACGTTGCTGGTAATGTAGGAAACACACTTAAAAATGTAGTAGGTGACGTGTGGGACTTTATTTCAGACCCTGGAGCATTAGCTAGAAAAGTACTTGGAGGCCTAGATGTATTAGGTGGCTTAACAAAATATCCCTTAGAAGTAGGTAAAGGTATCCTATCTAAAGCAACAAGCGCACTGACTGAAAAGATTAGTGGTTTGTTCTCATCTGGCAACTTAGATACCTCTATAGGAACAAATGGTGTCTATAAATATTTAGCAGATGTAGCTAAATCTGTAATGAAGAAATTCCCAGGATTTGTGGCAACTAGTGGATATAGGCCAGGTGACCCTTATTCACATGGTAAACGCAATGCTATTGATATTGCACTACCTGGTGTTACAGGTGGCTCACCACGCTACACAGAAGCGGCAAACTATGCTTTTGACAAATTCGCGTCCAAGATTGGTTACGTTATCACTAATGGGAAAGTTCGTGACCGTTCAGGACAATCAGGTACAGGTATTCATAACGATTGGAGACCATGGCCAGATGGAGACCATTACGATCACGTGCATTTAAACGGTGTGAAAGACCCACAAAACACTCAAATTTCAGGTGATAGCGTGGGAGGCAGTGGCGTAGAGAGATGGCGTAATGTAGCAATCAGAGCATTGAAAATGACTGGTCAATACAGTACTGCAAACTTAAATGCATTACTAAATCAAATGCGTACAGAGTCAAATGGTAATCCTAATGCAATTAACAACTGGGATATTAACGCTAAAAATGGAACACCTTCTAAAGGACTACTCCAAGTGATTGACCCAACATTCAGACAGTACGCAATGCCAGGATTCAACACTAATATCTATGACCCACTATCTAACATTCTTGCTTCCATCAGATATGCGTTGTCAAGATATGGGTCGCTAAGTGCAGCGTATCGTGGAGTTGGTTATGAAAACGGTGGAATCATCACGAAAGAACACATTGCAAGAGTTGGAGAAGGTAACAAGGAAGAAGTAGTTATACCACTGACAGGCTCAGGACTTAAACGTTCAAGAGCTATGCAACTATTAGCTTATGCCAATGAGAAACTTAACCGCAGCCAGTCAGCACCAGTATCTGGAACTACTTCAAGTACGAACTCTGATATGGCACAAATGCTTCTATTGTTACAACAGCAGAATGAGTTATTAATGGCTATTCTAGCTAAGGACACCAATGTGGTGCTAGATGGTGAGAAGCTAAATAGTAAACTACAAAAGATTCAATCAAGGAATCAGTTAAACGCTAATCGTGATTTGGGATTAGTTTAG

Gene Ontology

Description Category Evidence (source)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0006ce32b3_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6nwAd) rather than this protein.
PDB ID
6nwAd
Method AlphaFoldv2
Resolution 66.22
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50