Protein
- Protein accession
- A0AAU7PG56 [UniProt]
- Representative
- 3xPSf
- Source
- UniProt (cluster: phalp2_2140)
- Protein name
- Tail tape measure protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MGAIKSIMATEVKIGTEGSTKSLNQLENAIKATTYAWKSQAASAKNAGQLLKSAEARYDGITKSISQTQAKIKILEERQKECDRTTESGANAYAKYESKLIKARQSLEKLTNQQQQAAKALEYQKSGLGKLQQAYSSSKALSESYVARLEAEGKKYEAAKERAKQYAFAVKNLDAQLQIQRNELDKIADTSGKTSQAYRSQQIRINETATKLAQANNRMKELNKSIQRSNPSVFDHFKQKLSSVNKEASETHRTLKDVFMGSFLGNALSNGLSNLSGKISNIYSEGMSLNAAAGSINAKFKAMGMSNKVIARLNSQIGMLKRNTAMSGTQIAQLQTQMTNWSAIGSKGAQKIVTTLGAIGDSSRLSGDQIAQMGSSLMRVGASGKVSLSSLNRLAKSAPSFYTTLAKGAGMSVDKMKELLASGKVTQKQFQTWLADASKYSDTAFQGWGKTQTGALKSMHDAWHTLEQTMTAPLFNAKSSGLQDIKTLLSSKELTNGAKAIGDAIQHGISFLVKHKKDISGIVKDIFSIGIEVGKDVWKTFSAIFESVAKSFGLINSHGKKSGSALHDFKTVLDKVASNKKAIQEIANALIVIAGIKTLGKVGGGLMTVVGAVNKIRDKTSLANAGTDKENFLTGSMQSIRSARRTGGLSTAGKLMTGAAAAGVAADSLSTLYTAFKNDKKGSTKQFQDVGSGIGSAIGGGLGLYFGGPLGAAVGSQLGKALGGVAGKGAKSFQKGWKKNKPPKKFWSLENFGYSTHNFFKGIKSGLNSFNKWWDKKWKSIKKGFSNTWKAIREAPGKAWKKINKSWNSFWSSFNKSFKGSWNKIKKFFGGLWDDIIKMPSKAWKKISKGFESFSKDFKKTWRSLGKGIQDIWDTAWGKIKKLAHDGVAGIVKVINTGISGIDTVIAAFGGSKTAIKKIKFATGTGAFSGPRRAITKPTLAMLNDGNDSPETGNKELIWRPATGQAGIVQGRNSTAMLMPGDEVLNASETKSLMAFAGIEHFAKGTGVLSGIASWASGIGSWIGQKASALMKWFKKVTVIIANPAKALADVFKTSTKGLKGVMVDLGQGMMKSAKNAATSWWSTLWSMASDKLGGSGSSSALLNAAEKYGAGKPYVWGATGPESFDCSGLVMYALKQAFGINYPHYSGSQYAASTHISKSQAKPGDLVFWGSGGSEHVGVYAGGNNYYSAQSPSQGIGMNTLSSVVGKGSPLFARVPGLKDDTSSEKKSSKGSSKLQSMIKDQVGSGFWKFMSKLGSLFGGGDSGSGGMFSPSMIRDAAKQMHVSVSDRFVQKLQTVIQNESGGRNIVQQIHDVNSGGNEARGILQYTPTTFAAYAVAGHNNIMNPYDQLLAFFNNSDWKNSIGWTTIWDTRKLDWLHSGPQGHRRYANGGLVATEQLAHVAEGNKPEMIIPLDIAKRSRANQLLTQVQDKFAAEAPQTTTSGQTTDSKELLTRMDKMLALLGMLLKGQGDMEVTLDGQKLGKVLKKQQTIEDMRTALLYG
- Physico‐chemical
properties -
protein length: 1501 AA molecular weight: 161097,1 Da isoelectric point: 9,95 hydropathy: -0,36
Representative Protein Details
- Accession
- 3xPSf
- Protein name
- 3xPSf
- Sequence length
- 885 AA
- Molecular weight
- 95506,45140 Da
- Isoelectric point
- 10,30729
- Sequence
-
MAGTHETYSLGINVNYGSVNKAEHALGAVYTSLGKVGERADRLHMPSALPREINHIDTVTASYIQRLESEGRTYQANLQKVKAYQSAVAELSNKQRALQNDLTKIAESSGKASDAYRLQKVRINETATELSHFKSGIQATQTELRSSNPTFFDKVKSKLAGVNHEANDTHTTFKDMFMGSALGGAVSNALSSVTSQIGGAVKQGMNLNAAVYKINSRFQAMGMSARQVRSLDSQLGELKANTAMTGDNVATLQARMLSWSTIGTKGAMEMAKTMAGIGDSSKLNGDQVEQLSGQLMRLGSTGKVTGASLSRITRNAPTFYATLAKGAGMSQSRLKSLLSSGKVTEQQFQTWMAQASKYSDTAFKSFGETQAGAMNYIKVKWQGLSQEMTKPLFDAKSSGLQSLKSILTSKELTNGAKAIGNGIAASVEYLDKHKKDISDSLKDIINISVILGKDIWKNVASIIGNIGKAFGLVNKNAKGDGLHQFAQGLGNLSKNKAALKTISNLITAALALKGLKLATVFADPFISIGKKGVKSVQAINGFAKGVKGVTSAEDLKKMSDVGQNFYGWGDSIKTSAGKLKDFFAGLKPLVPKATALGKKIGDALTKGVKASVKLGMDKRFATGVLAGAAVATPEAMSAIKDRHSANKRSQDIGGAVGAMAGGTLTSMIPVVGPLLAPVGAIIGKYAGRWGGHAVNQFTKGWQKNKPPKNFWSMQNLGWSTHNMFKQIGSGWNHFWAGMGDWRKKQAKGIGQWANSVGRGWNRGVKGVKTWFKNIPSNLGKTGNHIKTWAGRVGNNIHKGWNKGVKVSHSFFKNLPRNMSKAGRSLQRGWNKTWKGVSKNRYVKAFKKGRVFQTAFKDMHSRFNSFKKWFGKGWNKTWKGVNKNRY
Other Proteins in cluster: phalp2_2140
| Total (incl. this protein): 2 | Avg length: 1193,0 | Avg pI: 10,13 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 3xPSf | 885 | 10,30729 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_1109
7vQjn
|
20 | 25,4% | 830 | 6.080E-74 |
| 2 |
phalp2_23533
7euEk
|
5 | 25,8% | 823 | 7.130E-50 |
| 3 |
phalp2_12070
5hYmz
|
8 | 24,5% | 652 | 4.489E-37 |
| 4 |
phalp2_7660
6v5eq
|
4 | 23,0% | 824 | 9.062E-25 |
| 5 |
phalp2_26493
28Azc
|
13 | 23,0% | 582 | 8.371E-09 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Lactobacillus phage G2-Guo [NCBI] |
3155564 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
PP779550
[NCBI]
CDS location
range 25185 -> 29690
strand +
strand +
CDS
ATGGGAGCGATTAAAAGCATCATGGCAACTGAAGTCAAGATTGGGACAGAGGGTTCTACCAAATCTCTGAATCAGCTTGAAAATGCGATAAAAGCAACAACTTACGCCTGGAAAAGCCAGGCAGCCAGTGCAAAAAATGCTGGACAATTGCTGAAGTCGGCAGAGGCTAGATATGATGGTATAACGAAGAGCATCAGCCAGACGCAAGCGAAGATTAAAATCTTAGAAGAACGGCAAAAAGAGTGCGATAGAACGACAGAGAGCGGTGCTAACGCCTATGCCAAATATGAAAGCAAGCTGATTAAAGCCCGGCAAAGTCTGGAAAAGTTGACCAACCAACAGCAGCAGGCTGCCAAGGCGCTTGAATACCAGAAGTCCGGGCTTGGCAAACTGCAACAGGCCTATTCTAGTTCCAAAGCTTTATCAGAATCATATGTTGCCCGCCTAGAAGCTGAGGGCAAAAAGTATGAGGCTGCAAAGGAACGTGCTAAACAGTATGCTTTTGCGGTTAAGAATCTTGACGCGCAATTGCAGATTCAGCGTAATGAGCTGGACAAAATAGCAGATACATCAGGCAAAACAAGCCAGGCGTATCGATCTCAGCAAATCAGAATTAATGAGACGGCTACCAAGCTGGCGCAGGCTAATAACCGGATGAAGGAGCTCAACAAGTCTATACAGAGATCTAATCCATCGGTATTCGACCACTTCAAGCAGAAGCTCTCAAGCGTCAATAAGGAGGCTTCTGAGACTCATCGGACGCTTAAGGACGTCTTCATGGGGTCGTTCCTCGGAAACGCTCTGAGCAATGGCTTGTCAAATCTGAGTGGCAAGATATCCAATATCTACTCCGAAGGAATGTCGCTTAACGCAGCCGCCGGATCGATCAATGCAAAGTTCAAGGCAATGGGCATGTCTAACAAAGTAATTGCCCGTCTTAACTCACAAATTGGGATGCTTAAGCGCAACACCGCTATGTCTGGAACTCAGATTGCTCAGCTACAGACTCAGATGACCAACTGGTCAGCGATTGGCAGTAAGGGAGCACAGAAAATCGTCACGACTCTGGGCGCTATCGGTGACTCAAGTCGACTGAGCGGTGATCAGATCGCTCAGATGGGATCCAGCCTGATGAGAGTCGGTGCTTCTGGCAAGGTCAGCCTTTCCTCCCTAAACCGGCTGGCCAAGTCAGCGCCATCCTTCTACACCACGCTCGCCAAGGGCGCTGGAATGTCGGTCGACAAGATGAAGGAACTGCTGGCCAGTGGTAAGGTGACTCAGAAACAGTTCCAGACCTGGCTGGCAGATGCATCGAAGTATTCTGATACAGCATTCCAGGGCTGGGGCAAGACTCAGACGGGTGCTCTTAAGTCAATGCATGATGCCTGGCACACTCTGGAGCAGACGATGACTGCTCCGCTTTTCAATGCTAAGTCTAGCGGACTGCAGGACATTAAGACGCTGCTGTCATCTAAAGAACTGACCAACGGAGCTAAGGCAATCGGTGACGCTATCCAGCACGGGATCAGTTTTCTTGTCAAGCATAAGAAAGATATCAGCGGAATCGTCAAGGATATCTTCAGCATCGGCATAGAGGTCGGCAAGGATGTCTGGAAGACATTCAGCGCCATCTTCGAGAGCGTAGCCAAGAGCTTCGGACTGATCAATAGCCATGGCAAGAAGTCAGGATCAGCACTTCATGATTTTAAGACGGTGCTTGATAAGGTGGCCAGCAACAAGAAGGCTATCCAGGAGATCGCTAATGCTCTGATTGTTATTGCCGGAATCAAGACTCTTGGCAAGGTTGGCGGCGGACTGATGACGGTCGTTGGTGCCGTCAACAAGATTAGGGACAAGACTTCGCTGGCCAATGCCGGCACTGACAAGGAGAACTTCCTGACCGGTTCGATGCAATCGATCAGATCAGCCAGAAGGACTGGCGGGTTGAGTACTGCCGGGAAGTTAATGACTGGTGCTGCAGCCGCTGGCGTTGCAGCCGATTCATTGAGTACTCTTTACACGGCTTTTAAGAATGATAAAAAGGGGTCAACAAAACAATTCCAAGACGTTGGCTCGGGCATTGGCTCAGCCATTGGTGGCGGCCTTGGCCTTTACTTTGGTGGCCCACTAGGCGCTGCTGTTGGGTCACAACTGGGGAAGGCCTTAGGTGGTGTAGCCGGCAAGGGCGCTAAGTCATTCCAAAAGGGCTGGAAGAAGAACAAGCCACCGAAGAAGTTCTGGAGTTTGGAAAACTTCGGCTATTCAACACACAATTTCTTTAAAGGCATCAAATCTGGCTTAAACAGCTTCAATAAATGGTGGGACAAGAAGTGGAAGTCGATCAAGAAGGGCTTCAGCAATACTTGGAAAGCTATTCGTGAAGCACCAGGCAAAGCCTGGAAGAAAATTAATAAGAGCTGGAATTCATTCTGGTCATCGTTTAACAAGTCATTCAAGGGATCCTGGAACAAAATCAAGAAGTTCTTTGGCGGCTTATGGGATGACATTATCAAGATGCCGTCCAAGGCCTGGAAAAAGATCAGCAAGGGCTTTGAGAGTTTTAGCAAGGACTTCAAGAAGACCTGGAGAAGCCTGGGCAAGGGGATCCAAGACATTTGGGATACTGCTTGGGGCAAGATCAAGAAGCTCGCACATGATGGCGTAGCCGGGATTGTTAAGGTAATCAACACTGGTATCAGCGGAATTGATACTGTCATTGCGGCTTTTGGCGGTTCAAAGACAGCAATCAAGAAGATCAAGTTTGCGACCGGTACTGGTGCCTTTAGTGGCCCTAGAAGAGCTATTACCAAGCCGACGCTGGCAATGCTCAACGACGGCAACGACAGCCCAGAGACTGGCAATAAGGAACTGATCTGGCGGCCAGCAACTGGCCAGGCAGGTATCGTCCAGGGTCGCAATTCAACAGCCATGCTGATGCCTGGTGATGAAGTTTTGAACGCTTCTGAGACCAAGTCATTAATGGCATTTGCCGGAATTGAGCATTTCGCAAAAGGAACTGGCGTTCTGAGTGGCATTGCAAGCTGGGCCTCAGGAATTGGCAGTTGGATTGGCCAAAAAGCATCAGCATTGATGAAGTGGTTCAAGAAAGTAACTGTAATCATTGCTAACCCAGCCAAAGCATTGGCTGATGTCTTCAAGACCAGCACCAAGGGCTTGAAGGGCGTTATGGTTGATCTTGGCCAAGGCATGATGAAGTCGGCCAAGAATGCGGCAACCAGTTGGTGGTCAACACTTTGGAGCATGGCTAGTGACAAGCTGGGTGGCTCTGGCTCATCTAGTGCTTTACTGAACGCGGCAGAGAAATATGGTGCAGGCAAGCCTTATGTTTGGGGTGCAACAGGTCCAGAGAGCTTTGACTGTTCTGGTTTGGTCATGTACGCGTTAAAGCAGGCATTTGGGATTAACTATCCACACTATTCCGGCTCACAATACGCTGCATCGACCCACATTTCCAAGTCACAAGCTAAACCGGGTGACCTGGTCTTTTGGGGATCTGGCGGTAGTGAGCACGTCGGCGTTTATGCTGGTGGAAATAACTACTACTCAGCACAAAGTCCAAGTCAGGGCATTGGGATGAATACCTTATCCTCTGTTGTAGGCAAGGGATCACCATTGTTTGCCCGGGTTCCAGGACTGAAAGATGATACTTCAAGCGAAAAGAAATCGTCAAAGGGCTCATCCAAGCTTCAAAGCATGATTAAAGACCAGGTCGGCTCCGGCTTCTGGAAGTTCATGTCTAAGCTGGGAAGCTTGTTTGGCGGTGGTGACAGTGGATCTGGCGGCATGTTCAGTCCAAGCATGATCAGGGATGCCGCTAAGCAAATGCATGTTTCAGTGTCAGATAGATTTGTTCAGAAGCTTCAAACAGTTATCCAGAACGAATCTGGCGGCAGAAACATTGTTCAACAGATTCATGATGTCAACAGTGGTGGTAACGAAGCCAGAGGTATTCTGCAGTACACGCCAACGACCTTTGCAGCCTATGCTGTCGCTGGTCACAACAACATCATGAATCCATACGACCAACTGCTTGCCTTCTTTAACAACAGCGATTGGAAGAACTCCATCGGCTGGACAACCATTTGGGACACTCGCAAGCTTGATTGGCTGCACTCTGGTCCGCAAGGCCATAGAAGATATGCCAATGGCGGTCTGGTGGCAACTGAGCAGCTTGCTCACGTGGCCGAAGGAAACAAGCCAGAAATGATAATTCCTCTGGATATTGCCAAGCGGTCCAGAGCAAATCAATTGCTGACGCAAGTGCAAGACAAATTTGCGGCTGAAGCACCGCAAACGACGACTAGTGGTCAGACAACTGACTCTAAAGAGCTGCTCACACGGATGGACAAGATGCTTGCCCTGCTAGGGATGCTGCTTAAGGGCCAGGGGGACATGGAAGTTACTTTGGATGGCCAAAAGCTTGGCAAGGTACTGAAGAAGCAGCAGACGATTGAAGACATGCGTACTGCTCTGCTTTATGGCTAG
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0008234 | cysteine-type peptidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(3xPSf)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50