Protein

Protein accession
A0A514U222 [UniProt]
Representative
13lZR
Source
UniProt (cluster: phalp2_36441)
Protein name
Tape measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MQANLARGMTPAALATNQAAFRNSVRSLGSMTTETMRVASASEMYTKALLKEDITLRQALRNRQTFNQVLREQYQLSRAASMQWSTNTRGGSTMDLIVPRDAPERLGRFRQTLGAVRAGTISTNVALGEMAIKMGLVSQVANSAAANMIKWGKNTQWAGRQLMVGLTLPVVAAGAAMGKLAYDVEAQMTRINKVYDFSANKDQNAAKNAQETATLRANSMRAAKTAAEQYGASMKDTLTVEANLAATGEKGNELIGKTTEVMRLATLGEMDYQKALDTTITLQSVYGQNTSQLAESFNYMNSVENATSLSIEDFSKAIPKVAAPMKTLNGSLQDVGTLLVAMRARGINAVEGANAIKASFTRVLKPTKQARDMFELLTKQSLTELTDKTNGEVIPTFQALQKATAGLTGKKRQALFAALFGTQQTTRLQAIVEEMANLEDETTQVGKAYKIAQEESSTWADSASREMEQLQQSASGRLKIAIESLKVQLAEAGKPFLEIAGYIVRGVTAIVKAFNDLPGPVKSFATIAIIAGAIAGPIIMLVGLFANLAGQAVKMGAGLLGLVFRFRALLPEQVASRLASMQQTAAMQTQQGQTQALIATVNELTAALGRATAAQQGLTQAQRTGQATSGIRPGVVSGAPLPPVVPTTPTVTGPITMTGAGYRDATGRALTQAEVRNWQAAQAASASINQNATNTRRTWQQMGSSFQNVALAAGFMGTMVTDSGTMANNISQMLIPLALFGPMLIGPITKFAGAIKGLVMPAISAIGSAATSAFGTVATRGRAAMGAVRSSIGGAVAAMGGLSAVMGTVLAVALAIGAAWYIINKNVAASRKEQENIEKSSEAWAKTLGFTYTEQQKIVAQGTHNVSSLNDKMNEFKKNNKDAYNDIQKFYDSKEAEKWGRAIEEGVKVRLHGGTKSAAEEAVKTSLAIMGQRYSNAEFQYKLKAQIDFDDVSQVIEKRLKDAATDMRDATNLKFDQSKSESFGRFFANPGTIQQKAGEAMKQNAKDLWDIYDNTQDAEKKKVFDKIATSVNSESVKLFETYKKKYSKEFKKMGIETFKDWTDYLNKDSNIESGVADISLGQELGLSDGEIHKVQRAADAVKGFSKEFADMQGIPKGKAGVNFDDLSKEIPELQKTKQELWSVKQAEDGYYTALRERSRVGVETSNAEKLNQLNIYRRLAGLEEATSLEQGFQKELDKSTDSLLENMDAWEANSDNIGDFTDAYKSVMSNTRDEALAQAESLLNDQMQGEIDGINARAEARSKALDDAQERADKKFDERQDKTEKRFEAKQEALDKSYEKKQKAFDKRWDNIMENHDKKWEGRTDAINKAYDAKVKKIDAAIKAEEDAENKRQEIFEREKTRIQRAAQMANQNIDFNMAINSGNLDEAAKIGNNMQADLDSWAAEDAAGASQSASDKKIEGLNKQKESVESERDRRLKVIQQMEEAEKKQLQARKEREQEALNAQREAANKALQIARETALKKIQIEREAYNKGIQAQREALQKETNDKIKATQRKYEAAKRAIELELATLKAFVPRNKKELDEQIKKIEQAYSKYGVNLKGKGNDWSKYIKDSLNKNVKVAAEDLKNKIAWDKIAKSVANEISEGAFGLTIGQFSDWVSTGKLPKSGLNEKSGKNKSLDAHHEGGLIGGRSGGSGRTGYSGGRAQSEIDIRAKKGEFMMKDKAVDKYGLDFMENINSGKFGTGGIGGAEGMGLPGLLGAGMAGMMQALIQKGIQQGSDMAMMMGIDGMGIPGAAGMYGGVGLNAEQLQNAATIIATGKGMGATNTDLIVSIMTAMQESTLRNLNYGDRDSLGLFQQRPSQGWGTPEQIRTPSYAARKFFEHLLAMKGRAKLPLWEQAQRVQRSGFPMAYAKWEQMARAVVAGTGFKPFGSGAKRRPVNGPVSRDYAHHSNLPRATDFGVGVGTPVYAAMNGNVTTSTDLRGNGNGGYRSYGRYVVVQNGSEKTLYAHLSRRNVGVGSSVRAGQLLGYSGNTGNSTGPHLHFETWRGGRTVPPGTFGIPGLAVGGKIKYDNTIANLHKNEAVLTAPLTAKLESGIDKIDSGGGNTYNFNINAEAINTEIDFEKVVTKALDRIESKKGRSRVVK
Physico‐chemical
properties
protein length:2103 AA
molecular weight:228210,9 Da
isoelectric point:9,35
hydropathy:-0,47
Representative Protein Details
Accession
13lZR
Protein name
13lZR
Sequence length
565 AA
Molecular weight
59375,74870 Da
Isoelectric point
9,44824
Sequence
MNVALGEMAIKMGLVSQVANSAAANMIKWGKNTQWAGRQLMVGLTVPVLAAGAAMGKLAYDVESQMTRINKVYDFSATKDQNAAKNMQETATLRANSMRAAKTAAEQFGASMKDTLSVEANLAATGEKGNELIGKTTEVMRLATLGEMDYQKALDTTITLQSVYGDNTSKLAEDFNYMNAVENATSLSIEDFSKAIPKVAAPMKTLGGSLQDVGTLLVAMRARGINAVEGANAIKASFTRVLKPTKQAKDMFEILTKQSLTELTDKTNGEVIPTFVALNKATENLRGKNRQALFAALFGTQQTTRLQAIVEEMGNLNDETTQVGKAYKIAGQEQSAWADSASREMEALQNSASGRLKIAIESLKVQLAEAGKPFLEVAGYIVGAVTKIVKAFNELPGPVKTFASIAVITGAIAGPIIMLAGLFGNLLGQALKLGAGMLGLVFRFRALLPEQVAARLASMQAGVAAQTQTGQIAALTAVINELTIALGRATGAQVGLNNAQARGGLGVRPGTVSGAPIPVNQPVPTTPVQGPIVTTGAGYRDATGRTLTAAEVRNWQAAQGILCKH
Other Proteins in cluster: phalp2_36441
Total (incl. this protein): 46 Avg length: 2067,9 Avg pI: 9,36

Protein ID Length (AA) pI
13lZR 565 9,44824
A0A482JH07 2105 9,37404
A0A385UH89 2093 9,41595
A0A345M9T4 2084 9,31099
A0A222Z1V8 2095 9,31724
A0A223FZV2 2107 9,39486
A0A0A0RQH0 2103 9,42400
A0A222YU16 2104 9,12719
A0A222YWR8 2107 9,37230
A0A345M8I8 2102 9,23363
A0A345MH93 2099 9,39725
A0A345MGA5 2102 9,23601
A0A514DJZ4 2095 9,42155
A0A5Q2WG41 2097 9,44509
A0A6M3SXT0 2110 9,26651
A0A221SAV7 2103 9,42155
A0A222YZS8 2095 9,31731
A0A222Z137 2107 9,38249
A0A345M7U7 2099 9,40969
A0A411B617 2105 9,36985
A0A411C4F3 2093 9,42671
A0A411CFR0 2093 9,42948
A0A4Y6EQB3 2099 9,41472
A0A5P8DEF3 2099 9,39725
A0A5Q2WD19 2099 9,41221
A0A5Q2WJF7 2099 9,41221
A0A5Q2WL06 2107 9,37230
A0A5Q2WQA8 2099 9,41221
A0A7G4AW01 2133 9,09393
A0A7G9UWA1 2105 9,21203
A0A7G9W1G7 2105 9,17741
A0A7T0Q346 2099 9,40969
A0A890UXX3 2099 9,41221
A0A890V3G8 2107 9,37230
A0A9E7IIV2 2099 9,40969
A0A9E7IQ10 2099 9,40969
A0A9E7TRZ6 2099 9,39725
A0A9E7TT77 2099 9,39970
A0AA49BT45 2103 9,33123
A0AA49BT64 2094 9,48228
A0AA49EHV1 2107 9,40705
A0AA96H5P4 2103 9,33368
A0AA96H7H5 2103 9,35593
A0AAE8YE60 2103 9,44251
A0AAX4LVN5 2097 9,30313
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_30200
3W9f2
97 39,0% 371 2.486E-76
2 phalp2_23033
47kPP
16 35,5% 411 7.074E-71

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptomyces phage Braelyn
[NCBI]
2593356 Stanwilliamsviridae > Samistivirus > Samistivirus braelyn
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MN096371 [NCBI]
CDS location
range 33863 -> 40174
strand +
CDS
ATGCAGGCTAATTTGGCACGCGGAATGACGCCAGCAGCTCTGGCAACAAATCAGGCGGCATTCAGAAACAGCGTTCGCAGTCTCGGTAGCATGACTACTGAGACTATGCGTGTTGCATCTGCGTCTGAGATGTACACCAAGGCTCTGCTAAAGGAGGATATTACTCTTCGTCAGGCTTTGAGAAACCGACAGACATTTAATCAGGTTCTCAGAGAACAGTACCAGTTGAGTAGAGCCGCTTCAATGCAGTGGAGCACAAACACTCGTGGCGGCTCTACCATGGACCTTATCGTTCCTCGTGACGCTCCAGAGCGTCTAGGACGATTCCGTCAAACACTTGGCGCAGTACGTGCTGGAACCATTTCTACAAACGTTGCACTGGGCGAGATGGCCATCAAGATGGGTCTTGTCAGCCAGGTTGCGAACTCCGCAGCAGCCAACATGATTAAGTGGGGTAAGAACACTCAGTGGGCTGGTCGTCAGCTCATGGTTGGTCTAACCCTTCCAGTTGTTGCGGCCGGTGCTGCAATGGGTAAGCTTGCATATGACGTAGAAGCACAGATGACTCGTATCAATAAGGTATACGACTTCTCTGCCAACAAGGACCAGAACGCAGCAAAGAATGCACAGGAAACTGCAACCCTCAGAGCTAATTCTATGAGGGCAGCTAAGACTGCCGCAGAGCAGTATGGTGCATCAATGAAGGACACTCTAACCGTAGAAGCTAATCTGGCAGCGACCGGTGAAAAGGGTAATGAGCTTATTGGTAAGACAACCGAGGTTATGCGTCTAGCCACCCTTGGTGAAATGGATTATCAGAAGGCTCTGGACACGACTATTACCTTGCAGTCCGTTTATGGACAGAACACAAGTCAGCTAGCAGAGTCATTCAACTACATGAACTCTGTTGAGAACGCAACCTCTCTATCTATCGAGGACTTCTCAAAGGCTATTCCAAAGGTCGCTGCTCCAATGAAGACTCTGAACGGTTCTTTGCAGGACGTTGGTACATTGCTTGTTGCTATGCGTGCGCGTGGTATCAATGCAGTCGAAGGTGCTAACGCAATTAAGGCATCATTCACCCGAGTCCTCAAGCCTACAAAGCAGGCTAGAGACATGTTCGAACTTCTCACTAAGCAGTCATTGACAGAGCTTACTGACAAGACGAATGGTGAGGTTATTCCTACCTTCCAGGCATTGCAGAAGGCAACGGCCGGTCTTACAGGTAAGAAGCGACAGGCTCTATTTGCAGCTCTATTCGGTACACAGCAGACCACCCGACTACAGGCAATCGTTGAAGAAATGGCTAACCTTGAGGACGAAACTACTCAGGTTGGTAAGGCTTACAAGATTGCTCAGGAAGAGTCTTCAACTTGGGCAGATAGTGCCAGCCGAGAAATGGAGCAGCTACAGCAGTCTGCATCAGGTCGTCTAAAGATTGCTATTGAGTCCCTAAAGGTTCAGCTTGCCGAAGCGGGTAAGCCATTCCTTGAGATTGCAGGATACATCGTTCGTGGTGTGACTGCGATTGTCAAGGCATTTAACGACCTTCCAGGACCAGTGAAGTCATTCGCTACCATTGCAATTATTGCCGGTGCTATTGCCGGTCCTATCATCATGCTCGTAGGTCTGTTCGCCAACTTGGCAGGACAGGCAGTTAAGATGGGTGCAGGTCTTCTTGGACTAGTATTCCGATTCCGTGCACTTCTACCAGAGCAGGTAGCGTCCCGTCTTGCATCAATGCAGCAGACGGCAGCAATGCAGACTCAGCAGGGCCAGACTCAGGCTCTCATCGCAACGGTAAATGAACTTACCGCAGCCTTGGGACGAGCAACAGCGGCACAGCAGGGATTGACTCAGGCGCAGAGAACTGGCCAGGCAACATCAGGAATTCGACCTGGTGTAGTTTCTGGTGCGCCTCTTCCACCTGTAGTTCCAACGACACCAACTGTAACTGGACCAATCACAATGACTGGTGCGGGATATCGTGATGCTACGGGACGTGCTCTTACTCAGGCAGAGGTAAGAAATTGGCAGGCAGCTCAGGCAGCCTCTGCAAGCATTAATCAGAATGCTACGAACACTCGTCGCACCTGGCAGCAAATGGGAAGTAGCTTCCAGAATGTTGCTCTGGCAGCAGGATTCATGGGAACAATGGTTACCGACTCCGGAACCATGGCTAACAACATTTCTCAGATGCTTATCCCTCTAGCTCTATTCGGTCCAATGCTTATCGGTCCTATCACCAAGTTCGCTGGTGCTATCAAGGGACTCGTCATGCCAGCAATCAGTGCAATTGGTTCTGCTGCGACATCAGCATTCGGAACGGTAGCTACCAGAGGTCGAGCAGCCATGGGAGCAGTTCGTAGCTCAATTGGTGGGGCAGTTGCAGCCATGGGTGGACTAAGCGCGGTAATGGGAACGGTCCTTGCAGTAGCGCTTGCGATTGGTGCAGCCTGGTACATCATTAACAAGAATGTAGCAGCCTCTCGTAAGGAGCAGGAGAACATTGAAAAGTCCTCCGAGGCATGGGCAAAGACCCTAGGATTTACTTACACTGAGCAGCAGAAGATTGTTGCACAGGGCACCCACAATGTTTCTTCTCTGAATGACAAGATGAACGAATTCAAGAAGAACAACAAGGACGCCTACAACGACATCCAGAAGTTCTACGACTCAAAGGAAGCAGAGAAGTGGGGTCGTGCTATCGAGGAAGGTGTAAAGGTTCGTCTTCACGGTGGTACAAAGAGCGCGGCGGAAGAAGCCGTAAAGACCTCTCTTGCTATCATGGGTCAGCGTTACTCCAATGCCGAGTTCCAGTACAAGCTAAAGGCACAGATTGACTTTGACGACGTTTCTCAGGTTATCGAGAAGCGTCTAAAGGATGCTGCCACGGATATGCGTGACGCTACTAACTTGAAGTTTGACCAGTCTAAGTCTGAGTCCTTCGGACGATTCTTCGCAAACCCAGGTACCATTCAGCAGAAGGCCGGTGAGGCAATGAAGCAGAATGCAAAGGATTTGTGGGACATCTATGACAACACTCAGGATGCAGAGAAGAAGAAGGTATTCGATAAGATTGCCACCTCTGTAAACTCTGAGTCTGTCAAGCTCTTTGAGACCTACAAGAAGAAGTATTCCAAGGAATTCAAGAAGATGGGAATCGAGACCTTCAAGGACTGGACTGACTATCTCAACAAGGACAGCAACATCGAATCTGGTGTTGCAGACATCTCCCTTGGACAAGAGCTTGGCCTTTCAGATGGCGAGATTCACAAGGTTCAGCGTGCAGCAGACGCAGTAAAGGGATTCTCCAAGGAATTCGCTGATATGCAGGGTATTCCAAAGGGTAAGGCTGGAGTTAACTTTGATGACCTTTCAAAGGAAATCCCTGAGCTACAGAAGACCAAGCAGGAGCTGTGGTCTGTAAAGCAAGCTGAGGATGGATACTACACCGCACTTCGTGAAAGAAGCCGTGTTGGTGTAGAGACCAGCAATGCCGAGAAGCTTAACCAGCTTAATATTTACAGAAGATTGGCCGGTCTCGAAGAGGCTACCTCTCTTGAGCAGGGATTCCAGAAGGAGCTAGACAAGTCAACCGATTCCCTTCTTGAGAATATGGACGCATGGGAAGCGAACTCCGATAACATCGGAGACTTCACTGACGCATATAAGAGTGTAATGTCAAACACTCGTGACGAAGCATTGGCACAGGCAGAGTCTCTGCTCAATGACCAGATGCAGGGAGAAATCGATGGCATCAACGCTCGTGCAGAGGCACGTTCAAAGGCCCTGGATGACGCTCAGGAAAGAGCAGATAAGAAGTTCGATGAGCGACAGGACAAAACTGAAAAGCGTTTTGAGGCAAAGCAGGAAGCCCTTGACAAGAGCTACGAGAAGAAGCAGAAGGCGTTTGACAAGCGCTGGGATAACATCATGGAGAACCACGACAAGAAGTGGGAGGGCCGTACTGATGCTATCAACAAGGCTTATGACGCAAAGGTCAAGAAGATTGATGCCGCAATCAAGGCTGAGGAAGACGCAGAAAACAAGCGTCAGGAAATCTTCGAGCGTGAGAAGACTCGTATTCAGCGCGCTGCCCAGATGGCCAACCAGAACATCGACTTCAACATGGCTATCAATTCTGGTAACCTCGATGAGGCTGCCAAGATTGGTAACAACATGCAGGCAGACCTGGATTCATGGGCTGCCGAAGATGCCGCTGGTGCAAGCCAGTCTGCTTCTGACAAGAAGATTGAGGGTCTAAACAAGCAGAAGGAATCTGTCGAATCCGAAAGAGACCGTCGTCTCAAGGTCATTCAGCAGATGGAAGAGGCTGAAAAGAAGCAGCTACAGGCACGCAAGGAGCGTGAGCAGGAAGCTCTAAACGCTCAGCGTGAGGCTGCCAACAAGGCTTTGCAGATTGCTCGTGAGACTGCGCTCAAGAAGATTCAGATTGAGCGTGAGGCTTACAACAAGGGAATTCAAGCTCAGCGTGAGGCATTGCAGAAGGAGACCAATGACAAGATTAAGGCAACGCAGCGCAAGTACGAAGCTGCAAAGCGTGCCATTGAACTTGAACTTGCAACCCTTAAGGCATTCGTTCCGCGCAACAAGAAGGAACTCGATGAGCAAATCAAGAAGATTGAGCAGGCTTACAGCAAGTACGGTGTAAACCTAAAGGGTAAGGGTAATGACTGGTCCAAGTACATCAAGGACAGCCTTAACAAGAATGTCAAGGTAGCAGCCGAAGACCTAAAGAACAAGATTGCATGGGACAAGATTGCAAAGTCTGTCGCCAATGAAATCTCTGAGGGTGCATTCGGTCTAACCATTGGTCAGTTCTCCGATTGGGTTTCAACAGGTAAGCTTCCTAAGTCTGGTCTGAATGAGAAGTCTGGAAAGAACAAGTCTCTCGATGCTCACCACGAAGGTGGACTTATTGGCGGAAGGTCTGGAGGTTCTGGACGTACCGGATATTCCGGTGGTCGAGCACAGTCCGAAATAGACATTCGAGCCAAGAAGGGTGAGTTCATGATGAAGGACAAGGCTGTGGATAAGTACGGTCTTGACTTCATGGAGAATATTAACTCTGGAAAGTTCGGAACTGGTGGAATTGGTGGAGCCGAAGGAATGGGTCTACCAGGACTTCTTGGTGCAGGCATGGCAGGAATGATGCAGGCGCTAATTCAGAAGGGTATCCAGCAGGGTTCCGACATGGCCATGATGATGGGTATTGATGGAATGGGTATTCCTGGAGCGGCCGGAATGTATGGTGGCGTTGGTCTAAATGCAGAGCAGCTACAGAATGCAGCTACCATTATTGCGACTGGTAAGGGAATGGGTGCAACAAACACCGACCTTATCGTATCCATCATGACTGCTATGCAGGAGTCAACCCTAAGAAACCTCAACTACGGTGACCGTGACTCTCTTGGTCTATTCCAGCAGCGTCCTTCACAGGGTTGGGGTACTCCGGAGCAGATTAGAACTCCTTCTTATGCAGCACGTAAGTTCTTTGAGCACCTACTTGCAATGAAGGGTCGAGCAAAGCTGCCTCTATGGGAGCAGGCTCAGAGAGTTCAGCGTTCTGGATTCCCAATGGCTTATGCAAAGTGGGAGCAGATGGCTCGTGCGGTTGTTGCTGGAACTGGATTCAAGCCATTTGGTAGCGGTGCAAAGCGTCGTCCGGTAAATGGCCCGGTATCCAGAGACTATGCTCACCACTCCAACCTACCTAGAGCTACTGACTTTGGTGTAGGAGTAGGTACACCAGTCTATGCAGCAATGAACGGTAATGTCACCACCTCTACGGACCTACGTGGAAACGGCAATGGCGGATACCGCTCATACGGTCGATACGTAGTCGTTCAGAACGGTTCTGAGAAGACTCTGTATGCTCACCTTAGCCGCAGAAATGTAGGTGTCGGTTCAAGCGTTCGTGCAGGACAGCTCCTTGGTTACTCTGGTAACACAGGTAATTCAACTGGTCCTCACCTTCACTTTGAAACATGGCGCGGTGGCCGAACAGTTCCTCCAGGCACATTTGGAATCCCTGGTCTTGCAGTAGGTGGAAAGATTAAGTATGACAACACCATTGCAAACCTTCACAAGAATGAGGCAGTCCTAACGGCTCCACTCACAGCAAAGCTTGAGAGTGGAATTGATAAGATTGACTCAGGAGGCGGTAACACGTACAATTTCAATATCAATGCAGAGGCTATCAACACTGAGATTGATTTCGAAAAGGTTGTTACCAAGGCTCTCGACAGAATCGAAAGCAAGAAGGGAAGGAGCAGGGTCGTCAAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0004222 metalloendopeptidase activity molecular function None (UniProt)
GO:0016020 membrane cellular component None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (13lZR) rather than this protein.
PDB ID
13lZR
Method AlphaFoldv2
Resolution 74.36
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50