Protein
- Protein accession
- A0A8A1VCL1 [UniProt]
- Representative
- 1gM3W
- Source
- UniProt (cluster: phalp2_25084)
- Protein name
- Tape measure protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MPIYDAGDAAINIRPGGLGEFRKELDAFLRSVDARLAVAIHPNIAQAKADIERWRHEEQAKHVDMRVDAKLAEAQAEVARWRAAQEADPVRIQIKADQDQANREISRAVGGAKSKALEGLKKDLQLNLKIAGVAGIPSAITGVAGLTSGLVELSQAALLVPGALAGIGAAVASLVTGLSGVKEAFSAYSKAQDDSVESTRQASENQRAAERAARDYSRAQQDVTIAVRDAQNEIRDLNLELKGSALDEADAILNLQQAQEDYAKGGFRTQLEQQRAQLRILQSEQRVEEVRNRNSELAQKTADANARGVAGAPGVVSANDRLASANDAVQAAQQRTVTSTLGVEKALAHLSPRAKEFVTDMSAMRGQLREQFKFPAQDALFDGLSAKVKSFVAADLPLITKGFAGVDTGINHTVGNLLDSLKSPGGQGILTRILGNTSAAQEQFSHAIDPLVKGLGTLAAAGSDSLPRLAKAFTDASTRFANFIDAADKDGRLAHWIDNGINAARELGHVFGNIGSTLASVSRAVGGDFMTRLDHVTKKMADFFQSSKGQQDLKGIFAQGHAELDKWLPVLKELGPIIGAVFKSAADLTGGWLTVLTPIAGILRTQPGLLRAVVDAVLLWKTITPITRAAGDGFRAMGDFLARVRNETENIATSAAKAESAAAPKYANLKEYTAAIRREAEAMGTAMPGAAAVLQGYYRQAEQGAKNSADAMESGMGRSARAAEEAAGNKAKGVGKFSSAVGALSAGSGPLMALTIVGIPLVEEFVTKFMRGMDDARNRVEQLRQTTQDLIGTLDSVTNMTGLQSRIEVAKKLQAGNTGPGNGIEGSALDAARDLGIGGPGGQDLVTAALPGGDAKYDEIMKPLRDKVRPAVDDFIKNQGLDLGRLGIDEPTLVDAFLGVPAALDKVDSIRDLGGTHSPIDLGTLQSQVQARDDDSAKAAIVGQNLNKQTKGAQGAVSQAQQAQAAAVPQPHLKGEFAPLFPDAVVNTDGSSTTIVSTAPPNPGMHLSEGTESKQGVPPNQDKWTYRLSPDDARRLTFDQGGWTPSGKGPGPTGGYIAEVHQDEFVINRDAARRVPSSFLHALNSGAVDPSSLPGYEPGGQVGPDGLGIVRHLTDGALVKPGPLPGADNDPGSLIPQRLFGGNAIVGLGGTGAPDPTNIDSWLNAPGMANPKNVAYKAGSILLSGVLGFFGLDQSILSPSNPWNQAIQQTVGGLSKGIGPYAQANQTAGINMQNYIDGNIPAILANAQSGAYTGVPSGVQGMSGSQESGTASAMSGSPSGAGVERWRPMVRETLQKYGPQFGITNYKAWEDAIMRQIATESSGNPSAINLDDSNAKAGHPSQGLLQFIPSTFAANNISGGQFLDPAAQMAAIIPYVIRTYGMASDGSPNRIGRGINYADGGTVIGPTGVDVINANLTAGETVINRSASNRYGTAALASVNAGTARIDPTPTPGGMQAAGAASAIAGAAQPIPIKPLQPNAPIVAPPAGAGPAVAPPTAAAAPPPQSPAAQQPQQATGPQETPQQGASAQPSIAPAPTSEDHELPALKKGITEGAAQLGTLLQAAAGAGGSMGGGMGAMAGPMIQGATQMGGHLANDVANIFSSALVGNLGDNTTTGAYGAPVVSAPPQPSTVNRNTYFGDVSASDPTDFLNKQRLYEQQREQSLVSYVP
- Physico‐chemical
properties -
protein length: 1669 AA molecular weight: 172872,9 Da isoelectric point: 5,70 hydropathy: -0,23
Representative Protein Details
- Accession
- 1gM3W
- Protein name
- 1gM3W
- Sequence length
- 1509 AA
- Molecular weight
- 158034,73000 Da
- Isoelectric point
- 5,95777
- Sequence
-
MPDYTAGKAKILLEPIARNFYNNARAAIRASVANKPLHADVRLRPVGDGFATEADKKIKATHDLVKRVELKPILAPTFRTSVRDQVNAATQNLHLSVQLSATVDFTRANAQILAWRRRQEAVPLRIHVSPQQGGITGVTTGAGSRNWRARNIPGSAGAGPQGGNFAATPQINSRQFQAQMIAVMKSAVQQANGPSAPKINPQINGNIGSNAQLKRQMANLQRDFQENFRLRIQPTFAGVQLRLPLGSLGTQVIPAAVAGIASLIASLHQLSQAGLAVPGGLAVAGASIGTFALGLAGIGDAWKALTAEQDRSAEQATTDAARMAQAYDSVRNASVDVTRAQKDLNDARKDARRDLEDLHREERSNNLSVAEAALNLREARNELAKGGFKGTLDRDRAVLRELQAQESLSVALERQKRNKEDVTEADKKGINGSDAVVSAQENLIRSTQALTAATTQLRIESSGAANAADKVKAAMENLGPETAKVVETVWGMKPAFQDLRRTVAGNMFKDFSQNIEVLANTSMPLLQRRLGGMGTVWNETLKQMTSTLGTSSTQSIFDRIFGNTEETQRRLNRSIDPIIRGMGTLAAAGSDAMPRLALGIEDVTNKFANWVQASDDNGRLDKMINSGIDGFSTLGRIISNVGTSVSGIGKAFGGGFLNMLENLTKKLSNFLNSARGQEDLRIFINNVKTDFEAWRPILQDLAPIFGTAMEAARQAIQAVLPIINIFTEVLEHSPALVTAVATAFLGWSAIRPIISGVEVGVGLMSTALVGLGTGFDGAKKKGADAAKEIDTAFGKVGKAGSGVSNAAQKLAMVGSVLGPYGAIVGGVAVTITALLSLMDTHDDAADAADRQRRSTDRLADSLERVSKAAGAATIAAVVDEMADFQPDASAGLKGNVLDATSKLGIDANAFAASVLPGGSAARGGFLEQFKAKVQPTVEAELKRMGLDLDPARVTSAFLGNQDDVKWFTSEISRLSSISEAEGNGAITYDLGNVAEAIQRAGGTTAAAGLAGQFLNSRAPQVTSAQDLETQKQQATTGKLPTLSPNGAAFFKGVQVTKIGFDGDVVGAQVVGLSDPAAKNLEDRNNQVSNPYRSGFGENTRDITFSEADSRELFSYAAGGPTPSGKGRGPSGGHIVEVHDDEWVLPPHARHAIGDERLWQLTGNRNLPRRRGFDLGGPGNLPVPLAPPIIPPVDPVAPAIPDIVAPAIAPPTPAVPQAGLPDPNAHGTSTPGVGPLPGPPELQNAAPGVTAAPMDPSDPAYGSQYAPGADGQPTDLGGQFLNAWFPWLGTVQSAMKGGEGAAQLGLGDSSYDPLSYVAEWGGNFISNFANTLFGGVLGFFGMSPDNAYFNAIRQVTGFYGDKFGPMIAGGGADQYVDGATQDTFNQGVDRYANYPQQVQLSDGSIVSIAPPAGVAGSQLLASTATVGDPFEGSGVIAAGSPGAQAKIDASRAPNAQVIPDANIIAYIRQQAEAFGLTMGDEGKAGGDPYRTSESASLHTIGMAGDIFGAP
Other Proteins in cluster: phalp2_25084
| Total (incl. this protein): 2 | Avg length: 1589,0 | Avg pI: 5,83 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 1gM3W | 1509 | 5,95777 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_27883
8IrIz
|
99 | 29,5% | 1252 | 1.737E-124 |
| 2 |
phalp2_34982
4HM46
|
2 | 27,7% | 1193 | 3.010E-93 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Mycobacterium phage prophi91-3 [NCBI] |
2813226 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MW584187
[NCBI]
CDS location
range 24947 -> 29956
strand +
strand +
CDS
ATGCCGATCTATGACGCTGGCGATGCCGCGATCAACATCCGGCCGGGGGGCCTCGGCGAGTTCCGCAAGGAGCTGGACGCGTTCCTGCGATCAGTCGATGCACGGCTCGCGGTCGCCATCCACCCGAACATCGCGCAAGCCAAGGCTGATATTGAGCGCTGGCGCCATGAGGAGCAGGCCAAGCACGTCGACATGCGTGTGGATGCCAAGCTCGCCGAGGCGCAGGCCGAGGTCGCCCGCTGGCGTGCAGCCCAAGAAGCCGACCCGGTTCGGATTCAGATCAAGGCCGATCAGGATCAGGCCAACCGGGAGATCTCGCGCGCGGTCGGGGGCGCCAAGTCCAAGGCCCTGGAGGGTCTGAAAAAGGATCTCCAGCTCAACCTGAAAATTGCTGGCGTGGCAGGTATTCCGTCGGCGATCACCGGAGTAGCCGGGCTCACCAGCGGCCTGGTGGAGCTGTCGCAGGCGGCGCTGCTGGTACCGGGCGCCCTGGCTGGCATCGGGGCTGCCGTCGCCTCCCTGGTCACCGGACTATCGGGCGTCAAAGAGGCATTCTCGGCATACAGCAAGGCGCAAGACGATTCGGTCGAGTCGACGCGGCAGGCATCGGAGAATCAGCGCGCAGCCGAGCGCGCGGCTCGCGACTACTCGCGCGCGCAGCAGGACGTGACGATCGCCGTCCGCGACGCGCAAAACGAGATCCGCGACCTGAATCTTGAACTCAAGGGCTCCGCGCTCGACGAAGCTGACGCCATTCTCAATCTTCAGCAGGCACAAGAGGATTACGCCAAGGGGGGATTTAGGACTCAGCTCGAACAGCAGCGTGCCCAATTACGCATCCTGCAGAGTGAGCAGCGCGTCGAGGAAGTGCGCAACCGCAACTCAGAGTTGGCGCAGAAGACCGCCGACGCGAATGCGCGCGGTGTTGCGGGCGCTCCGGGCGTAGTGTCCGCCAACGATCGACTGGCCTCGGCGAACGATGCTGTCCAGGCCGCCCAGCAGCGCACTGTGACAAGCACCTTGGGAGTTGAGAAAGCGCTCGCGCACCTGTCCCCCAGGGCCAAGGAATTCGTCACGGACATGTCGGCGATGCGCGGCCAGCTGCGCGAACAGTTCAAGTTCCCTGCGCAGGATGCCCTGTTCGACGGCCTGAGCGCCAAGGTCAAATCGTTCGTTGCCGCCGATCTGCCGCTGATCACCAAGGGGTTCGCCGGGGTCGACACCGGCATCAACCACACGGTCGGCAACCTACTCGACTCGCTCAAATCACCTGGTGGCCAGGGCATCCTGACGCGCATCCTGGGCAACACGAGTGCCGCGCAAGAGCAGTTCTCGCACGCGATCGACCCGCTCGTCAAGGGGCTGGGAACACTGGCCGCTGCCGGTTCAGACTCGCTGCCGCGCCTGGCGAAGGCATTCACCGACGCATCCACCCGGTTCGCGAACTTCATCGACGCCGCGGACAAGGACGGGCGTCTGGCGCACTGGATTGACAACGGCATCAACGCCGCTCGCGAACTCGGCCACGTGTTCGGCAACATCGGCTCCACGCTCGCGTCGGTATCACGCGCGGTCGGTGGCGATTTCATGACGCGCCTCGATCACGTGACTAAGAAGATGGCCGATTTCTTCCAATCGTCCAAGGGTCAGCAAGATCTCAAGGGCATCTTTGCCCAAGGGCACGCCGAGCTAGACAAGTGGCTACCAGTACTCAAGGAGCTAGGCCCGATCATCGGCGCGGTGTTCAAGTCGGCAGCCGACCTCACTGGCGGATGGCTCACCGTGCTCACGCCAATCGCCGGCATCCTGCGCACCCAGCCTGGACTACTGCGCGCTGTCGTCGATGCCGTTCTGCTATGGAAGACGATCACCCCGATCACTCGCGCAGCTGGCGATGGATTCCGCGCCATGGGCGACTTCCTGGCGAGGGTGCGCAACGAAACCGAGAACATCGCTACGTCGGCCGCCAAGGCCGAAAGCGCAGCCGCGCCGAAGTACGCCAACCTCAAGGAGTACACCGCGGCCATCCGCCGGGAAGCCGAAGCGATGGGTACCGCCATGCCCGGTGCCGCAGCGGTTCTACAGGGCTACTACCGGCAAGCCGAGCAGGGCGCCAAGAACTCTGCCGACGCCATGGAAAGCGGCATGGGCCGATCCGCCAGAGCGGCCGAAGAGGCCGCGGGCAACAAGGCCAAGGGTGTCGGAAAGTTCAGCAGCGCAGTGGGTGCGCTATCGGCAGGGAGCGGTCCACTGATGGCCCTCACCATTGTCGGAATCCCACTGGTGGAGGAGTTCGTCACCAAATTCATGCGCGGTATGGACGATGCCAGGAACCGAGTCGAGCAACTGCGCCAGACAACGCAGGATCTCATCGGCACGCTCGATTCGGTCACCAATATGACTGGTCTGCAGTCACGTATTGAGGTCGCCAAGAAGCTCCAGGCCGGTAACACAGGGCCCGGCAATGGCATTGAGGGCAGCGCATTGGACGCCGCCAGGGATCTTGGCATCGGGGGACCCGGTGGTCAGGATCTCGTCACTGCAGCACTGCCCGGCGGCGACGCGAAGTACGACGAGATCATGAAACCGCTGCGCGACAAGGTGCGTCCGGCCGTCGATGACTTCATCAAGAATCAGGGCCTTGATCTCGGCAGGCTCGGCATTGACGAGCCAACCCTGGTTGACGCCTTCCTCGGCGTCCCGGCGGCACTCGACAAAGTAGACAGCATCCGCGATCTTGGCGGCACGCATAGCCCTATCGACTTGGGCACGCTCCAAAGTCAGGTCCAAGCACGCGACGACGACAGCGCCAAGGCGGCAATCGTTGGACAGAACCTCAACAAGCAGACCAAGGGCGCGCAGGGCGCCGTGTCGCAGGCTCAGCAGGCCCAGGCGGCGGCGGTTCCGCAACCACACCTCAAGGGTGAATTCGCACCGTTGTTCCCGGACGCCGTGGTCAATACCGACGGTAGCTCCACCACGATTGTGTCGACAGCGCCGCCGAACCCCGGCATGCATCTGTCGGAGGGCACCGAGAGCAAGCAGGGTGTTCCGCCAAATCAAGACAAGTGGACCTATCGCCTGTCCCCCGACGATGCACGGCGCCTGACTTTCGACCAAGGGGGCTGGACTCCCTCAGGCAAGGGGCCCGGACCCACTGGCGGCTACATCGCCGAGGTCCATCAGGACGAGTTTGTCATCAATCGCGATGCGGCACGGCGTGTTCCGTCGTCGTTCCTGCACGCGCTCAACAGTGGCGCGGTAGACCCGTCATCACTGCCGGGGTATGAACCGGGCGGTCAGGTCGGCCCCGACGGTTTGGGCATCGTGCGTCATCTGACCGACGGCGCTCTTGTGAAGCCGGGCCCCCTACCGGGCGCCGACAACGACCCGGGCTCGCTTATTCCGCAAAGGCTATTCGGTGGCAACGCGATTGTCGGATTGGGCGGTACGGGCGCACCGGACCCGACCAATATCGATTCCTGGCTCAACGCACCCGGAATGGCCAACCCTAAGAACGTCGCCTACAAGGCCGGAAGCATTCTGCTCAGCGGCGTTCTGGGGTTCTTCGGTCTCGATCAGTCGATTCTGTCGCCATCGAACCCGTGGAACCAGGCGATCCAGCAGACCGTCGGCGGTCTGAGCAAGGGCATCGGACCGTACGCGCAGGCCAACCAGACCGCCGGTATCAACATGCAGAACTACATCGACGGCAACATTCCGGCGATCCTGGCGAACGCGCAGTCCGGTGCGTACACAGGTGTGCCGTCCGGCGTGCAGGGTATGTCCGGCAGCCAGGAGTCTGGCACAGCCTCGGCGATGTCCGGAAGTCCTTCCGGCGCGGGTGTTGAGCGCTGGCGCCCCATGGTTCGCGAGACGTTGCAAAAGTACGGGCCGCAGTTTGGCATCACGAACTACAAGGCATGGGAAGACGCGATCATGCGTCAAATCGCCACCGAGAGTAGCGGCAACCCGTCGGCTATCAATCTCGACGACTCCAACGCTAAGGCTGGCCACCCCTCACAGGGGCTGCTGCAGTTCATTCCAAGTACGTTTGCCGCCAACAACATCAGCGGAGGGCAGTTCTTGGATCCGGCGGCACAGATGGCCGCGATCATCCCGTATGTAATCCGCACCTATGGCATGGCCTCGGACGGATCTCCCAACCGGATCGGTCGCGGCATCAACTACGCCGACGGCGGAACGGTCATCGGGCCCACCGGTGTCGACGTGATCAACGCCAACCTGACCGCCGGTGAAACCGTCATCAATCGCTCGGCGTCCAACCGCTACGGCACCGCGGCACTGGCATCGGTCAACGCCGGTACCGCGCGTATCGACCCGACGCCAACACCGGGTGGCATGCAGGCGGCCGGCGCGGCGAGCGCCATCGCGGGGGCAGCACAGCCGATCCCCATCAAGCCACTACAGCCGAATGCACCGATCGTGGCTCCTCCCGCTGGCGCGGGCCCGGCCGTGGCTCCCCCGACCGCAGCCGCAGCACCGCCCCCCCAGTCCCCGGCGGCGCAGCAGCCGCAGCAGGCCACCGGCCCGCAGGAGACCCCGCAACAGGGCGCCTCGGCTCAGCCCAGCATTGCCCCGGCGCCGACCTCGGAGGACCACGAACTGCCCGCGCTGAAGAAGGGCATCACCGAAGGGGCCGCGCAGCTCGGCACGCTGCTACAGGCCGCGGCCGGTGCCGGCGGATCCATGGGTGGGGGCATGGGTGCGATGGCGGGCCCGATGATCCAGGGCGCTACGCAGATGGGCGGTCACCTGGCCAACGACGTTGCGAACATCTTTTCCTCGGCGCTTGTGGGCAACCTGGGCGACAACACGACCACGGGCGCCTATGGAGCTCCGGTTGTTTCGGCACCGCCGCAGCCCTCGACCGTCAACCGCAACACCTACTTCGGTGACGTGTCCGCTTCGGATCCGACCGACTTCCTCAATAAGCAGCGGCTCTACGAGCAGCAGCGCGAACAGTCGTTGGTCAGCTATGTGCCATAG
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(1gM3W)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50