Protein
- Protein accession
- A0A3G2KCL9 [UniProt]
- Representative
- 6deMH
- Source
- UniProt (cluster: phalp2_35192)
- Protein name
- Tail tape-measure protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MDKLKVENEMATRISVDTIAATKSLSAFRSSISAATNAWKANETALKNSGQYAEAAKARISGLNEVIELQKAKISELKSRQEGLNLSNKDQLETWLKLDKDISQASKQLASYEVQVNRANSTLKYQTSGLAELQTSFRRAQESSRAYANNLRANGKELKANDVEIKGLSIGLKTLSKQYDLQKKELDQLAKTQGKNSEAYRKQKVRLDETSASIGKTKSQISELKTRTDDLHASLIRKDTLGSGFFASARNKILGIKNAEELTNRETKSLRETLKTSFAGVFVSNLASNAIMAMTSNMHGLIEAGREYNKEQDTMRTVWKSLTTEAPQDGRQLINFINDLSQHSIYSAETINKMAQSFYHVDSNVKHAKQWTNDFVRLGSTMHMTNAQIAEAGEQYAKIVAGGKASQEDMNVMINRFPMFGEAIQKATGKSMKQLQELSQQGKLTADDFVKAMDYLGKKYKTGQSEAMTSYMGMSMYLKSRFSKLSGDVQKSSFKMSKSAKDALVQVTSDKSMQRYANSISKALAGVLSLLSKTIAFMSKHQTAVKVFSKTMIASFAFTKTARLVTAFYMTLGKGIAVYKGLSSAAKIAALNQKMLNLAMKSNVIILVISAIAALIIELKHLYDTNKQFRKFINGIAKFAKSGLKKVGSFFKNTFKQISKSQEQSNREQAKANKQAEKNWHNFTNSLAKNWKSYWRNRQREQRQDEKQNQQYWNNVRKSASRGWKNIETSARSGVNNINRWYNNLNNSTSRIVRNMYRQHPKTFQSMYRVIQDHTRSWHDLVSGHWDRLGQDTSQTAKDMRKNNHQIFKDMYDRLNDLTGGSLGRMLKSWQDHMSQIGDAIATGKKNAMRAMADLANGVLKPFNTLMNDIKNGLNWILDKIGASKISGDWSISVPSYATGTAGNPDGTKRSSLALVNDGPGEHFREMYRLPNGQIGMFPNKRNFLAFLPKGISILNGEASHQLAKAFNLPRYANGVGDFFSGLTDKLDNAGEFIDKVIEHPIEALNEVFKRFVHISTPIKYASDLVVNVPIYIAKQAGKWIKKQFEELADPGGSGVERWRPYVVKALAMLHLSGSLVGKVLRQIQTESGGNPKAMGGTDGLADGHAMGLMQVKPGTFAANKLPGHGNIWNGFDNLLAGLNYARKRYGDSLSFLGQGHGYANGGRIDTEQFIRIAEQNKPEYVIPTDINKRSRAYQLLGEVIARFRGEEPSVQTTRDDQSIDKDNFRSLESKLDQLHKDIQSLINLGTQQVAAIHSQGKFDPKRQDILQAQRLSMKLNSF
- Physico‐chemical
properties -
protein length: 1279 AA molecular weight: 142992,1 Da isoelectric point: 9,85 hydropathy: -0,57
Representative Protein Details
- Accession
- 6deMH
- Protein name
- 6deMH
- Sequence length
- 446 AA
- Molecular weight
- 48937,11110 Da
- Isoelectric point
- 9,35631
- Sequence
-
MSQIGDAIATGKKNAMRAMADLANGVLKPFNTLMNDIKNGLNWILDKIGASKISGDWSISVPSYATGTAGNPDGTKRSSLALVNDGPGEHFREMYRLPNGQIGMFPNKRNFLAFLPKGISILNGEASHQLAKAFNLPRYANGVGDFFSGLTDKLDNAGEFIDKVIEHPIEALNEVFKRFVHISTPIKYASDLVVNVPIYIAKQAGKWIKKQFEELADPGGSGVERWRPYVVKALAMLHLSGSLVGKVLRQIQTESGGNPKAMGGTDGLADGHAMGLMQVKPGTFAANKLPGHGNIWNGFDNLLAGLNYARKRYGDSLSFLGQGHGYANGGRIDTEQFIRIAEQNKPEYVIPTDINKRSRAYQLLGEVIARFRGEEPSVQTTRDDQSIDKDNFRSLESKLDQLHKDIQSLINLGTQQVAAIHSQGKFDPKRQDILQAQRLSMKLNSF
Other Proteins in cluster: phalp2_35192
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_28873
5i1Sw
|
11 | 42,7% | 379 | 3.093E-77 |
| 2 |
phalp2_427
7zt3x
|
4 | 30,7% | 458 | 7.252E-49 |
| 3 |
phalp2_39661
6T6Py
|
9 | 34,5% | 402 | 5.564E-44 |
| 4 |
phalp2_32810
2G5Ou
|
3 | 30,3% | 385 | 2.582E-34 |
| 5 |
phalp2_10032
7dnml
|
5 | 36,5% | 312 | 1.120E-29 |
| 6 |
phalp2_16583
7C5Au
|
1 | 26,9% | 382 | 1.073E-20 |
| 7 |
phalp2_26052
79GVR
|
1 | 26,8% | 376 | 2.480E-18 |
| 8 |
phalp2_7599
5FUza
|
2 | 24,6% | 495 | 1.246E-14 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Lactobacillus phage LR2 [NCBI] |
2419582 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MH837543
[NCBI]
CDS location
range 14205 -> 18044
strand -
strand -
CDS
GTGGATAAATTGAAAGTTGAAAATGAAATGGCGACGCGGATATCAGTTGATACAATTGCTGCGACTAAAAGTTTATCTGCGTTCCGGAGTTCAATTAGTGCGGCCACGAATGCTTGGAAAGCTAATGAAACTGCGTTAAAAAATTCTGGCCAATATGCTGAAGCTGCTAAAGCGCGTATTTCTGGATTAAATGAAGTAATTGAACTTCAAAAGGCTAAGATTTCTGAATTAAAAAGTCGTCAGGAAGGCTTGAACTTATCTAATAAGGATCAATTAGAAACATGGCTGAAATTAGATAAAGATATCTCTCAAGCTAGTAAACAATTAGCTTCGTATGAGGTACAAGTTAATCGGGCTAATAGTACATTAAAGTATCAAACTTCGGGTTTAGCTGAATTACAAACTAGTTTTCGAAGAGCCCAGGAGTCTTCACGAGCATATGCAAATAATTTGCGTGCGAATGGTAAAGAGCTTAAAGCAAATGACGTTGAAATTAAGGGCTTAAGTATTGGATTAAAAACTCTTTCTAAGCAGTACGACCTTCAAAAGAAAGAGTTAGATCAGCTTGCTAAAACACAAGGAAAGAATAGTGAAGCATACAGAAAACAAAAAGTGAGGTTAGATGAAACTAGTGCGTCAATTGGAAAAACAAAATCGCAAATTTCTGAATTGAAAACTAGAACTGACGACTTGCATGCTTCACTAATTCGTAAAGACACTTTAGGATCTGGTTTCTTTGCATCTGCTAGAAACAAAATTCTTGGCATAAAAAATGCAGAAGAATTAACAAATCGTGAAACTAAATCGTTAAGGGAAACATTAAAAACATCATTCGCAGGTGTTTTTGTATCTAATCTTGCATCTAATGCGATTATGGCTATGACAAGCAATATGCATGGCTTAATTGAAGCGGGTCGCGAATATAATAAAGAACAAGATACAATGCGTACTGTATGGAAGTCTTTGACAACCGAGGCACCCCAGGATGGCAGACAACTAATTAATTTTATTAATGATCTTTCTCAGCACTCAATATACTCTGCAGAAACAATCAATAAAATGGCTCAAAGTTTTTACCATGTGGATAGCAATGTTAAACATGCTAAACAGTGGACTAATGACTTTGTGCGTTTAGGTTCAACCATGCATATGACGAATGCTCAGATTGCAGAAGCAGGTGAGCAGTATGCAAAGATTGTTGCTGGTGGTAAAGCGAGTCAGGAAGATATGAATGTTATGATCAATCGTTTTCCGATGTTTGGAGAAGCAATTCAAAAAGCAACTGGAAAATCGATGAAACAGTTACAAGAACTTTCCCAGCAAGGCAAATTAACTGCTGATGATTTTGTCAAGGCCATGGACTATTTGGGAAAGAAATATAAGACTGGACAATCTGAAGCGATGACAAGTTACATGGGAATGTCCATGTATCTTAAATCGCGCTTTTCAAAACTATCAGGGGATGTTCAAAAGTCGTCCTTTAAGATGAGTAAATCTGCTAAAGATGCTTTAGTTCAAGTTACGTCTGATAAATCTATGCAACGTTATGCTAATAGTATAAGTAAAGCATTAGCCGGAGTGTTAAGTTTATTATCTAAAACTATTGCTTTTATGAGTAAGCATCAAACAGCAGTTAAAGTTTTTTCTAAAACAATGATTGCCTCTTTTGCATTTACTAAGACTGCAAGATTAGTAACAGCTTTTTATATGACTTTGGGAAAAGGAATTGCTGTTTACAAAGGATTGTCTAGTGCTGCAAAGATTGCCGCCCTAAATCAAAAAATGCTTAATCTTGCTATGAAAAGTAATGTAATTATTTTAGTTATTTCTGCTATTGCTGCATTAATAATTGAATTAAAGCATTTATATGACACTAATAAACAATTCAGAAAATTTATTAATGGGATTGCTAAGTTTGCAAAGAGTGGATTGAAAAAAGTAGGAAGTTTTTTCAAAAATACATTCAAACAGATCAGCAAGAGCCAGGAACAATCCAATCGTGAGCAAGCTAAGGCTAATAAGCAGGCAGAAAAGAACTGGCATAACTTTACTAATAGTTTGGCCAAGAATTGGAAGTCTTATTGGCGTAATCGACAACGTGAACAACGCCAAGATGAAAAACAAAATCAACAGTATTGGAATAATGTTCGAAAGTCGGCATCACGTGGTTGGAAGAACATAGAAACAAGCGCTCGTTCAGGTGTGAATAATATTAATCGTTGGTATAACAATTTAAATAATTCAACTTCTAGAATTGTTCGAAATATGTATCGGCAGCACCCTAAAACATTCCAATCTATGTATCGGGTTATTCAAGATCATACAAGATCTTGGCATGACTTAGTGAGTGGCCACTGGGATCGGTTAGGACAAGATACCAGTCAAACAGCTAAAGATATGCGAAAAAATAATCATCAAATTTTTAAGGATATGTACGATCGTTTGAATGACCTTACTGGTGGTAGCCTAGGACGGATGCTTAAATCGTGGCAAGATCATATGTCTCAGATTGGGGATGCAATTGCAACTGGTAAGAAGAACGCCATGCGAGCAATGGCTGATTTAGCTAATGGTGTTCTAAAACCATTTAATACCTTAATGAATGATATTAAGAATGGTTTGAACTGGATCCTTGATAAAATTGGTGCTAGCAAAATTAGTGGTGACTGGTCAATCTCAGTTCCGAGCTATGCGACTGGGACTGCTGGAAATCCTGATGGCACTAAAAGGTCTTCACTTGCACTAGTAAATGATGGTCCGGGTGAACACTTTCGTGAAATGTATCGTCTTCCTAATGGTCAAATTGGAATGTTTCCTAATAAGCGTAATTTCTTAGCATTCTTACCAAAAGGAATTTCTATTCTTAACGGAGAAGCTTCTCACCAGTTGGCAAAAGCTTTTAACTTGCCTCGTTATGCTAATGGAGTAGGTGACTTTTTTAGCGGACTTACTGATAAGCTTGATAATGCCGGAGAGTTTATTGATAAGGTTATTGAACACCCAATAGAAGCGCTAAATGAGGTATTTAAGCGCTTCGTGCACATTTCAACACCAATTAAGTATGCTAGTGATTTGGTTGTTAATGTACCCATTTACATTGCTAAACAAGCAGGAAAATGGATTAAGAAACAATTTGAAGAATTAGCTGATCCTGGTGGTTCTGGTGTGGAACGTTGGCGGCCATACGTAGTAAAGGCATTAGCAATGTTGCATTTATCAGGCAGTCTTGTAGGTAAGGTACTTCGTCAAATTCAGACGGAATCTGGCGGTAACCCTAAAGCAATGGGTGGAACTGATGGTTTAGCTGACGGTCATGCGATGGGATTAATGCAAGTTAAGCCAGGAACATTCGCTGCTAATAAACTTCCGGGCCATGGAAATATTTGGAATGGATTTGATAACTTGCTTGCGGGACTGAACTATGCTCGTAAACGTTATGGAGATAGTCTTTCTTTTCTTGGCCAAGGTCATGGATATGCGAATGGTGGTCGAATTGATACTGAACAATTCATTCGGATTGCGGAACAGAATAAGCCAGAATATGTTATTCCGACAGATATTAATAAAAGGTCTCGTGCTTATCAATTACTTGGTGAGGTTATTGCACGTTTTAGAGGTGAAGAACCTAGCGTTCAGACAACACGAGACGACCAATCTATCGATAAAGATAATTTCAGGTCGTTAGAATCGAAATTAGATCAGTTACATAAGGACATACAATCATTAATCAATTTAGGAACTCAACAAGTTGCTGCAATTCATAGTCAGGGTAAATTCGACCCCAAGCGTCAAGATATTTTGCAAGCACAAAGACTGTCAATGAAATTAAATTCGTTTTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0098003 | viral tail assembly | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000172d02e_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(6deMH)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50