Protein
- Protein accession
- A0A2Z2E9W3 [UniProt]
- Representative
- 28Azc
- Source
- UniProt (cluster: phalp2_26493)
- Protein name
- Tape measure protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MKQAAYGSLTYNVNINDTKAQSSLRTLKGAIRSTGHEWRSNASAMQAAGDSASALEAKITGLNKEIELQSDYNKRLADALKHANAQTDKEKLAVMRWTNELTRSNAALKRRQSELASARAAEIRFSTGIDKAKTSHKAYTSAIEANEKALLAEGKEEQASAKHKQLLAAKTSALKDELGREEKAFKALKSSASSSNVDINKQSAVVSKAREAYARAREEQRKYSTGLHKMEQYSKSTAEISQSLASRLRAEGKSYSAMAVELKSLIGSRKGLMAQYKTEASELALVKKRSGETSAAYSTQAKKVNELGAKIGETESKIRSLNKRVGLSGTAINSFSDKIGGMQKKYSGVATAMSAVSRGTGYATLGLAAVTKQGVSMATSLQSSFVKTKNLIVKSNTEGTGEINRNLAKMKSNAQSYSKEYGLTQQKIADGYQDLIKRGYSSAQALGSMKTLVKGAIATGDDFNDVTAVSTQTLESFGLRAKSTAQMAKNTSKVVNEMAYAADMTATDFQSLGKGMEYVGNTAHQAGFSISETASAMGILSNNGLESDKAGTGLRKTINSLVSPTDNATGALKKLGLSTSSFTDKNGKMKSMSEIFGILNRHMKGLSGHDRTDVFHAIFGTTGQQAGGILTDNYKSLGKLNDEVARSAKNDYIGGLSAKNMETAQNQFNKFKVTFQALEIEFANVLLPYLTKGAKALTGLMDKFDKLSPSAKKAAGATLLLVPAISAVTGVIAAFIRNSGTIATVISKLFSTSKVKTASSSKSAIAAINEQTAAVKALSAAWGEAGRAAGAEAEEAGAGVGGGSKGKAGKTASEVESVGTREEYKAAKKSGSKTSILKRMAKGQTTTEDAEAISSRVGKIGTTGSKLGKIARGTGSVLKRVPWLQTGLAAMNLIGINKKNAGQKVGSTAGQLGGTIGGGAIGTGLGGLIGGGLGTFFGPAGTVAGAKIGSTVGGLAGSLFGGSKGQDWGGQLGKKIQKSLKNFKMPKMSTVTKSIGDWFGGIGKWISKLKLPSINFGKMFKGFKLPNIGKMFKNFKLPKISGIGKWFSGLFKGLKMPSLSSLGKSIGSAFSKMFSGLGKTGIGKSISKMLAGVGKTMSGWAKGIGKFFAPAVKAIQTPFKKIGKWFKTSPVGKSIVTIGKDIAGVVKGIGKFIAALGAIAGKLAAIGLVKLFKGIGKALGGMGKIFSKVGKTIGRWAKGVRKTIDNMVKPIQKTMGKIGKGISKAWSGALKTVTKFVKNMYHTATKWIGKLVSPLAKTWHSISKTAEKWWKNISGTVGRWVHNLYKTATKWIGRLLSPVAKAWKSISKTAGRWWKSISNAIGSWAHKIYKNVTKWFRNLLSPVARAWKNVSKTIGGWIKGIWKNISKFGNNMASFFKKLPGRISGALKGAWHGIWNAMAGIVNNGVIHPVVKGWNAVAGAINGVEKKIGVGKSFRLSTASYGSAKLSTYANGTPGGPALVNDAKSRYWREAYKLPDGRMGMFPNKRNIIVNLPKGTEIAKGEDARVIQPYLEHNGGKIPAFASATGWLDGIGSAIGGAVHGVGNWFSGVKTKASKLIDNLGKMIKSPAKYLSAMIASPLNALAKGGGIAAKAVGMTGDIVIHSLTSWFKKMLKAGQDEQLVGNVKLGGSVASRARALAKAFKKAYPASNNGGIAGILGNWIQESNLNPSAVNASDHGTGLGQWTFTRETGLRNWLRRHGYAWNSAAGQIGYALNEPGANGMLKAVLRMTNPTAAAQKFFATWESGGNMDASGGARLRNASAVYRYIKGMEHGGLVDKAQMINIAEHNKPEMVVSLTNKDAAIRQLKQSISYLENGNVATTVDTKNVKSADSEKLDQIAAAIQQTNALLQAILSSNDTPNVAYVASQSVVDAVEAQRLAKARYNNLIN
- Physico‐chemical
properties -
protein length: 1887 AA molecular weight: 199374,2 Da isoelectric point: 10,22 hydropathy: -0,24
Representative Protein Details
- Accession
- 28Azc
- Protein name
- 28Azc
- Sequence length
- 823 AA
- Molecular weight
- 86804,65470 Da
- Isoelectric point
- 10,14644
- Sequence
-
MATIGADTIAYKINVQNMQRLSDLIDKVGTLGKGMTGLNRKLDDFGKKANSIHTKGINDAKESLNDVEKSADKATDKTNEFSRAAGKAGKSGNFNHQINGLNSVSSRADRASKAFNRFNSAGNRMAEVGRRTAISSLAIGAGLIKSANDAVKVQHTLRETFNLAKYGGEDAAEAQKNVNKMQDDGNKYSVRYGVSQRKIADGYQELIKRGYSTNQALAAQKSLLQASMASGDDYNDVVHNSTAALESFGMRSNSTSKMLNNTKDVVNKMAYASDLTATDFQSMGVAMEYVGARAHQSGYSLSETASAIGILSNNGLEAQKAGTGLRKVMQSIQSPTKGGAEALKSMGLSAKSFVDQRGNMKSVTETMALLNKQTQGMSKAKKGVIFHALFGATGENAGAILANSSKQLDELNKKVEKSTKNDYVGKLSKSNTMTAQSQIKIFQQSLNSLGIAFATTVLPNLNSALRLFDKLLFKINEMPKSQKKIVTWGIVAVGAIAPVSFALSGLLKTMGALKAAWAFIVPAKAAAIKAPSTLGGGIAPAGVAKAGGSIMGKVAAGASVAGAGIDIGASLYSAITTKNNTTRYKSYGKAVGTAIGTGVGMFFGGPAGAAIGATVGHVVGGWAGKAVHSFSRTKMGHKIGKVLSRELKPANKALNSLGKSASRNLNKNMPAIRRSMRSLGRALAPAGKFVKKYFVLEMKHGIRAFGHILNGAILITVDVVKALAGTFSGLFKMFKGELKLFNDFFTGRWGSLWGDAKTVFSGFGKAVGSIAEGIWNTFSHWFGMIFNLGKDVGNFISDLMGKSGSKAPSLSKGLVKNNKRITN
Other Proteins in cluster: phalp2_26493
| Total (incl. this protein): 13 | Avg length: 1644,1 | Avg pI: 10,15 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 28Azc | 823 | 10,14644 |
| 28wbp | 1007 | 10,20865 |
| 6AmR1 | 890 | 10,04187 |
| A0A1I9KK50 | 1886 | 10,19788 |
| K4I4C2 | 1887 | 10,07159 |
| A0A2K9V595 | 1887 | 10,19788 |
| A0A1S5RCP6 | 1887 | 10,21742 |
| A0A2P0ZKY9 | 1887 | 10,21091 |
| A0A2H4PBB0 | 1886 | 10,12381 |
| A0A2K9VCB0 | 1886 | 10,13264 |
| A0A4Y5FE72 | 1673 | 10,11343 |
| A0AAE9H533 | 1887 | 10,07765 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_20820
6edap
|
22 | 41,9% | 601 | 4.598E-135 |
| 2 |
phalp2_20949
7lC8j
|
5 | 37,4% | 516 | 1.182E-112 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Lactobacillus phage P2 [NCBI] |
1928330 | Maenadvirus > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
KY381600
[NCBI]
CDS location
range 15998 -> 21661
strand +
strand +
CDS
ATGAAACAGGCCGCATATGGTAGCTTAACATATAATGTTAATATTAACGATACCAAGGCTCAATCAAGCCTGCGTACGTTAAAAGGAGCTATCCGCTCCACAGGCCACGAATGGCGCTCTAACGCTTCTGCAATGCAAGCCGCTGGTGACAGTGCAAGTGCTCTGGAAGCCAAGATAACAGGACTTAACAAAGAAATTGAACTACAATCTGATTATAACAAGCGCTTAGCTGACGCCTTAAAACATGCCAATGCTCAAACTGATAAGGAAAAACTTGCGGTTATGCGCTGGACGAACGAATTAACCCGAAGTAATGCGGCGTTGAAACGGCGTCAAAGCGAATTGGCAAGTGCACGTGCCGCTGAAATCAGATTCTCAACTGGTATCGACAAGGCCAAAACCTCACATAAAGCTTATACCAGTGCGATTGAGGCAAACGAAAAGGCTTTACTAGCAGAGGGCAAAGAGGAACAAGCAAGCGCCAAACATAAACAACTCTTGGCGGCCAAGACATCTGCTTTAAAAGATGAACTCGGTCGTGAGGAAAAGGCATTCAAGGCGTTAAAATCTAGCGCTAGTTCATCTAACGTTGATATTAACAAACAGTCTGCCGTTGTATCAAAAGCTCGTGAAGCTTATGCTAGAGCCCGTGAGGAACAGCGTAAGTATTCTACCGGCTTGCACAAGATGGAACAGTATTCTAAGTCAACTGCGGAAATATCGCAGTCGCTTGCTTCAAGATTGCGAGCGGAGGGTAAGAGCTACTCTGCAATGGCGGTAGAATTGAAGTCATTAATTGGTTCTCGCAAGGGTCTAATGGCTCAATACAAGACCGAAGCGTCTGAGTTGGCGTTAGTCAAAAAACGCTCCGGCGAAACAAGTGCGGCTTATTCAACGCAAGCTAAAAAGGTCAACGAACTAGGTGCGAAAATTGGGGAAACGGAATCAAAAATCCGTTCTTTAAACAAGCGTGTTGGCCTATCTGGTACAGCGATAAACTCATTTAGCGACAAAATTGGTGGTATGCAGAAAAAATATTCTGGCGTTGCCACTGCGATGTCTGCTGTTTCTCGTGGTACTGGTTATGCAACCTTGGGGCTTGCCGCTGTGACCAAGCAAGGCGTTTCGATGGCGACCTCTCTACAATCTTCTTTTGTGAAGACTAAAAACTTAATTGTTAAGTCTAATACAGAAGGTACTGGCGAGATTAATCGTAACCTTGCAAAGATGAAATCAAATGCTCAATCATATTCTAAGGAGTATGGATTGACACAACAAAAGATTGCCGATGGCTATCAAGACTTAATCAAGCGTGGTTATAGTTCAGCACAAGCGTTAGGTTCAATGAAGACATTGGTTAAAGGTGCTATCGCCACTGGTGATGACTTTAATGATGTCACAGCCGTTTCTACACAGACTTTGGAATCATTCGGTCTAAGAGCTAAATCTACCGCTCAAATGGCTAAGAATACGTCTAAGGTTGTTAACGAGATGGCCTACGCCGCCGATATGACGGCTACCGATTTCCAAAGCTTGGGTAAAGGTATGGAATATGTCGGTAACACGGCCCATCAAGCAGGGTTTAGTATCTCCGAAACTGCTAGTGCCATGGGTATTTTATCGAACAACGGTTTGGAATCTGACAAAGCTGGTACTGGACTACGTAAGACGATTAACTCATTGGTATCTCCTACTGATAACGCTACTGGCGCTTTGAAGAAATTAGGGTTATCAACAAGTAGTTTTACAGATAAGAATGGTAAGATGAAGTCGATGTCGGAAATCTTCGGCATTTTGAATCGTCACATGAAGGGCTTATCTGGTCACGACCGTACTGATGTATTCCACGCAATCTTTGGCACAACTGGTCAGCAAGCCGGTGGTATCTTAACCGATAATTATAAATCACTTGGAAAACTTAATGACGAGGTTGCTAGGTCTGCGAAGAACGATTATATTGGTGGTTTATCTGCAAAGAACATGGAAACTGCGCAGAATCAGTTTAACAAGTTTAAGGTTACTTTTCAAGCTTTAGAGATTGAGTTTGCAAACGTATTACTTCCATATTTAACCAAAGGCGCTAAGGCGCTGACTGGATTAATGGATAAGTTTGATAAGCTAAGTCCTTCCGCTAAAAAAGCCGCCGGAGCAACGCTTCTATTAGTTCCAGCAATTAGTGCTGTTACTGGTGTAATCGCCGCTTTCATTCGGAACTCAGGGACAATCGCGACTGTTATTTCTAAGTTGTTTAGCACGTCTAAGGTTAAAACTGCCAGTTCAAGTAAATCCGCCATAGCCGCAATTAACGAACAAACGGCCGCTGTTAAAGCTCTTTCTGCGGCATGGGGAGAAGCTGGACGAGCCGCCGGAGCAGAAGCAGAGGAAGCCGGTGCCGGAGTTGGCGGCGGTTCTAAAGGAAAAGCTGGAAAGACAGCTTCCGAAGTGGAGTCAGTTGGCACTCGCGAAGAATATAAGGCGGCTAAGAAGTCTGGTTCCAAGACATCTATTCTGAAACGTATGGCCAAGGGACAGACAACCACAGAAGACGCCGAAGCCATCTCATCAAGAGTTGGCAAAATTGGCACTACTGGTTCTAAGCTAGGGAAAATAGCTAGAGGCACTGGTAGTGTTCTTAAACGTGTTCCATGGTTACAAACTGGGTTAGCCGCCATGAATCTTATCGGTATCAACAAGAAGAACGCCGGCCAGAAGGTTGGTTCCACTGCTGGTCAATTAGGTGGTACAATTGGTGGCGGTGCTATTGGTACTGGCCTTGGGGGCCTAATCGGTGGTGGACTCGGAACTTTCTTTGGCCCAGCAGGAACAGTCGCTGGTGCTAAGATTGGTTCTACTGTTGGTGGACTTGCTGGTAGCTTATTTGGTGGTTCTAAGGGCCAAGATTGGGGTGGACAATTAGGTAAGAAGATTCAGAAGTCTTTAAAGAACTTCAAAATGCCTAAGATGTCAACCGTCACTAAGTCAATCGGTGATTGGTTTGGTGGTATTGGTAAGTGGATTAGTAAGCTAAAGCTACCAAGTATTAATTTTGGTAAAATGTTTAAAGGATTTAAACTTCCTAACATTGGGAAGATGTTTAAAAACTTCAAACTGCCTAAAATTAGTGGAATTGGTAAGTGGTTTAGCGGATTGTTCAAAGGGCTTAAAATGCCTAGTTTAAGTAGTCTTGGCAAATCAATCGGTTCAGCATTTAGTAAGATGTTCTCTGGTCTTGGCAAAACAGGTATCGGAAAATCAATCTCTAAAATGCTGGCCGGTGTTGGTAAAACCATGTCAGGCTGGGCTAAGGGGATTGGTAAGTTCTTTGCCCCGGCTGTTAAGGCAATTCAAACACCTTTTAAGAAAATTGGTAAGTGGTTTAAGACCAGCCCAGTTGGTAAGTCAATTGTAACTATTGGTAAAGATATTGCCGGCGTAGTCAAGGGTATTGGTAAGTTTATTGCCGCCCTAGGAGCAATTGCCGGAAAGTTAGCCGCAATCGGTTTAGTCAAGTTATTTAAAGGAATTGGCAAGGCTCTTGGAGGCATGGGCAAAATCTTCTCTAAGGTCGGCAAGACAATTGGAAGATGGGCTAAGGGTGTTCGTAAGACCATTGACAATATGGTTAAACCAATTCAAAAAACGATGGGTAAAATCGGTAAAGGCATTTCTAAGGCATGGAGTGGCGCTTTAAAGACTGTCACAAAATTTGTTAAGAATATGTACCACACCGCCACTAAATGGATTGGAAAATTAGTAAGTCCACTGGCTAAAACTTGGCATTCCATCTCTAAAACTGCTGAAAAATGGTGGAAGAACATATCAGGAACTGTTGGTCGCTGGGTTCATAATCTCTACAAAACTGCCACTAAATGGATTGGAAGGCTGTTAAGTCCCGTAGCTAAGGCTTGGAAGTCAATTTCCAAAACTGCTGGTAGATGGTGGAAATCTATCTCCAATGCGATTGGTAGTTGGGCTCACAAAATCTACAAAAATGTGACTAAGTGGTTCAGAAACTTGCTGTCACCAGTTGCCAGAGCATGGAAAAATGTATCTAAAACCATCGGCGGCTGGATTAAGGGTATTTGGAAAAATATCTCTAAATTCGGTAACAATATGGCAAGCTTCTTTAAAAAGCTTCCGGGGCGGATTTCCGGCGCACTAAAAGGTGCATGGCATGGTATCTGGAACGCAATGGCCGGAATTGTTAACAACGGTGTTATTCACCCTGTTGTTAAAGGTTGGAATGCCGTTGCCGGAGCCATTAACGGTGTCGAAAAGAAAATTGGTGTTGGCAAGAGCTTTAGACTTAGCACAGCTAGTTACGGTTCGGCTAAGCTAAGCACCTATGCCAATGGTACTCCGGGAGGGCCAGCGTTAGTAAATGACGCTAAGTCACGTTATTGGCGTGAAGCCTACAAACTCCCAGATGGCCGCATGGGTATGTTCCCTAACAAGCGCAATATCATTGTAAACTTGCCAAAAGGTACGGAAATCGCCAAAGGTGAAGACGCTAGAGTAATCCAGCCTTACCTTGAACACAATGGTGGTAAAATCCCAGCATTCGCTTCTGCAACTGGTTGGTTAGATGGTATTGGTAGCGCTATTGGCGGCGCCGTTCATGGCGTTGGTAACTGGTTCTCTGGAGTTAAAACTAAAGCTTCTAAACTTATTGATAACTTAGGTAAGATGATTAAATCACCCGCTAAGTATTTATCAGCTATGATTGCCTCACCATTAAACGCTCTGGCAAAAGGTGGTGGTATCGCCGCTAAAGCCGTTGGTATGACTGGTGATATTGTTATCCACTCCCTTACTAGCTGGTTCAAAAAGATGTTAAAGGCTGGCCAAGATGAACAACTTGTCGGTAACGTCAAACTTGGCGGTAGCGTTGCTTCACGTGCACGGGCGTTAGCTAAAGCGTTCAAAAAGGCTTATCCGGCTTCAAATAACGGTGGTATTGCCGGTATCTTAGGTAACTGGATTCAAGAATCCAACCTGAACCCTTCCGCCGTCAACGCCAGTGACCATGGTACTGGTTTAGGTCAGTGGACGTTTACTCGTGAAACTGGACTGCGTAACTGGTTAAGAAGACATGGCTACGCATGGAACTCTGCCGCTGGTCAGATTGGCTATGCCTTAAACGAACCCGGCGCAAACGGCATGTTAAAGGCCGTATTAAGAATGACAAATCCTACTGCCGCCGCTCAAAAGTTCTTTGCAACTTGGGAATCCGGTGGTAACATGGACGCCTCTGGTGGTGCCCGTCTACGGAACGCCTCTGCTGTATACCGCTACATTAAGGGTATGGAACATGGCGGACTTGTTGATAAAGCTCAAATGATTAACATTGCGGAACACAACAAACCAGAAATGGTTGTGTCTTTGACCAACAAAGACGCCGCAATTCGTCAATTGAAGCAATCAATCAGTTATCTTGAAAACGGGAACGTTGCCACTACCGTAGATACTAAAAATGTAAAGTCTGCTGATAGCGAAAAGCTCGACCAGATTGCGGCCGCTATCCAGCAAACGAACGCATTATTGCAAGCTATCTTGTCATCAAACGATACGCCAAACGTTGCTTACGTTGCTTCTCAAAGTGTCGTAGACGCTGTTGAAGCACAACGCCTAGCTAAAGCAAGATACAACAACTTGATTAACTAG
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0098003 | viral tail assembly | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(28Azc)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50