Protein
- UniProt accession
- A7KUS1 [UniProt]
- Protein name
- Baseplate hub protein
- PhaLP type
-
endolysin
evidence: GO annotation
probability: 89 % (predicted by ML model)
- Protein sequence
-
MALATSANYHGPESNSYQTGFRAKLAGTDKYNALIEKVAAETGCNPLLMKVLMAMESGGQMLGTNSLGYFGLMQTNYKRFSKELGAAYRTDPYLQIKSGAIELIEKKDYIMGVIKRGNGKGPKEKWTEEIFQTAWAYNGYTINASSKMMDGYRYAQTFQAVYEGMGGSMADKVSSGQMGGGTGTGGGGGTGGEVAKPKYDGGAPDWKGGGERGIGGTGPGTSTKMDDYEKNDPNSAMFIGMSPEVEEDDYPKHRYAGNPFVLRIGDSQFFIPPTSIKTDKTSSIAEQHILRAKTPIMTKSGYTLHTLSIQMYFFGEEQINGYPIEGPGGKTYYMDGLRPLLAQFYKNPFLPIRNELINNQHNIYNVALQDISYTVDPDFSDAIVVNLTLIECCVEPYINYPEFVYDRIFVYPLFRWWYQQQMLGKETERYSGTWLKPVNKHLNGKAFFKVLDRDLIEQMKDDVMKKNDGKTTKDGQTINPEYALARMKIGDIRKLMVEWETGDAVLTNVTVSVGKNLTPMYMDDYEKPLFQDLGGMVRGFTLEYYCTNRGELESFQALASHLEELSRDYRFRFVSGYLAIDNELINLAGIQNAMITNIQTSTVEGMSDNYIVRITCREFNANQKNEERLNGINYTMKESLNKYGFVDAIPTKTNRKSEIAYEAEVMKILNDLELYPDLELPTFEVANKAVAEINAYREKRGQSKLPYDKLKQPDNATWCDPDFYMAYPSTKQAYSHMQLGDMGDKIINQLRNGSYDEIKNMSNDEGFWKTADDIQQLFTKGVVDNSWDADADGKRKLLKSSSVTFENATKDFLVDAGQSWGDDNSIPEDKQLVEMMTHDMLKYSHRGRMTRAFPSYMLLFVDEGQWVDGKRLWNNYYTYHAIQQMDIISDKENPVDLAFVTLSNVYGTFDFQAKMSDPRKYAKEPIGLPGRIGALIDDFSFTVTAKVMAERSQLMEQVKLREGARVHMRMGYSATAGNMPIVFNGKIASVNEGEVIQMLCQGDGAELINQYYSTDPKGDTPNEPHNTFQEMLTKRTSNYWFTVSEDWEFGDNYLSYYGIEHFGFVESEAGGFWKGLGKNFEILKSFLTDSVPFSAYDIMKNIYKGSNSPVTGQETGWSPFDGEKNINIAAYNKTPWDIGQILSMFVPEFICAPHHHGFRSTLFFGMPHWPVKYEYILKDGKEGNSYTDYQEKVKPFQQFHMITSGNDIIANKIQASSEKLKHIAVGMYKMGSGQDAAESYTVWADRTIIKEHQKMMLVETGVWQDLLGPDFLYTALGKYGLKPVLEAPGSLLSTIGDLLESAPLGDTMDDVGKKFQEWGEGWIEWLDADSIFTPGQVQARTVAIGALQRKFMEMYQGELVILGDPSIKPWDIFYLDDTHMLMNGTAQAGKVSHSLSLQTGFTTVIKPDLITSRTDGKGMRTGVMNGLILMGSALAVITSRKILMSRFVNSVARLGLKGAMKTGSGAGKVVSKFTPKAIKQARVAKWGAKQLKAGGTFLKEGKYLTKGLNLLKGNIVTLVLFGAVSEWVGTWFEKNTKYNNMIYIYPLWKMGEPFAAGITGGAHIIPGYMDKRFSDPYSTGKFVQPKMQFIDEKGREKKKKGGSSGSGGSGGGGSSASRNAIVEGGKSYEGKLKYVFGGTNIEGGTGDCSGFTSHVFKKFGNLNIGRTTGEQVKNGTQVENGKEQPGDLIFFKNTYNSTHIYGVSHVGIVIGDKKMVHLGDSGCQISDYTTKYWKEHFLMFRSYITDMGSIEVISGGAFRHPLERRTSITSGFGMRKGGMHKGTDLAPLGYAGKGQALETKVYAVADGKIIDMGLSDSMGNYVILATANKAGAPHVITYMHFRQHAPGVSKGMQVKAGTFIGYMGTTGESTGVHLHIEVNPGTSRNREGRIDPVPWFQQQGVQL
- Physico‐chemical
properties -
protein length: 1903 AA molecular weight: 212021,00000 Da isoelectric point: 6,57664 aromaticity: 0,11035 hydropathy: -0,42317
Domains
Domains [InterPro]
Taxonomy
Name | Taxonomy ID | Lineage | |
---|---|---|---|
Phage |
Bacillus phage 0305phi8-36 [NCBI] |
458639 | No lineage information |
Host |
Bacillus thuringiensis [NCBI] |
1428 | Bacteria > Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
ABS83705.1
[NCBI]
Genbank nucleotide accession
EF583821
[NCBI]
CDS location
range 60268 -> 65979
strand +
strand +
CDS
ATGGCATTAGCAACATCGGCTAATTATCATGGCCCCGAAAGTAATAGTTACCAAACCGGTTTCCGGGCAAAGCTCGCCGGTACAGATAAATATAATGCACTAATTGAAAAAGTAGCGGCAGAGACAGGATGTAACCCACTACTGATGAAAGTTCTGATGGCTATGGAATCGGGCGGACAGATGCTTGGAACAAACAGTCTCGGATACTTCGGTCTGATGCAGACGAACTACAAACGATTCAGTAAAGAACTTGGTGCAGCCTATCGAACTGACCCGTACCTACAAATCAAATCGGGTGCGATCGAGTTAATCGAGAAGAAAGATTACATCATGGGTGTAATCAAACGTGGTAACGGTAAAGGCCCGAAAGAAAAATGGACAGAAGAAATCTTCCAAACAGCATGGGCTTACAATGGATATACAATCAACGCAAGTTCTAAGATGATGGATGGATATAGATATGCACAAACCTTCCAAGCTGTATACGAAGGTATGGGCGGTAGCATGGCTGATAAAGTTTCGTCTGGTCAGATGGGCGGAGGCACTGGAACTGGCGGTGGTGGCGGTACTGGTGGAGAAGTAGCCAAGCCAAAGTATGACGGCGGTGCGCCGGACTGGAAAGGTGGAGGCGAACGAGGTATCGGCGGAACAGGGCCGGGAACAAGTACGAAGATGGATGACTATGAGAAGAATGATCCGAACAGCGCAATGTTCATCGGTATGTCACCTGAGGTAGAAGAGGATGATTATCCGAAGCATCGTTATGCAGGGAATCCATTCGTTCTTCGTATTGGTGATAGTCAGTTCTTTATTCCGCCAACGTCCATTAAAACAGATAAAACGTCGAGTATCGCAGAGCAACATATACTGCGTGCGAAGACACCTATCATGACGAAATCCGGTTACACTTTACATACGCTTTCGATTCAGATGTACTTCTTCGGTGAAGAACAAATCAATGGATACCCAATAGAGGGGCCGGGCGGGAAGACGTATTATATGGATGGTCTTCGTCCTTTACTAGCGCAGTTCTATAAGAATCCATTCTTACCGATTCGAAATGAGTTAATCAATAACCAGCACAACATTTATAACGTTGCCTTGCAGGACATTTCTTACACAGTTGACCCTGACTTCTCGGACGCAATTGTCGTAAATCTAACACTGATTGAATGTTGCGTGGAACCATACATCAACTACCCTGAGTTTGTATACGATAGAATCTTTGTCTATCCGTTATTCCGTTGGTGGTATCAACAACAGATGCTCGGCAAAGAGACAGAGCGTTATTCGGGTACATGGCTAAAGCCTGTTAACAAACATCTAAACGGGAAAGCATTCTTTAAAGTATTAGACCGTGACCTCATTGAACAAATGAAGGACGATGTAATGAAGAAGAATGACGGGAAGACAACGAAGGATGGTCAAACGATTAATCCGGAGTACGCACTTGCTCGCATGAAGATCGGCGACATACGAAAACTGATGGTCGAATGGGAGACTGGGGATGCGGTACTTACAAACGTCACGGTATCCGTTGGTAAAAACTTAACACCAATGTATATGGACGATTATGAGAAACCATTGTTCCAAGACTTAGGCGGAATGGTACGTGGCTTTACGCTAGAGTATTATTGTACAAACCGTGGAGAGTTAGAATCCTTCCAAGCACTTGCGTCACACTTAGAAGAGTTAAGCCGTGACTATCGTTTCCGATTCGTATCGGGTTACCTTGCGATTGATAATGAACTGATTAACTTAGCCGGGATCCAAAACGCTATGATAACAAATATCCAAACAAGTACCGTTGAAGGTATGTCGGATAACTATATCGTTCGTATCACATGTCGTGAGTTTAATGCGAACCAAAAGAATGAGGAGCGCTTAAACGGTATCAACTACACAATGAAAGAATCCCTAAACAAATACGGATTCGTAGATGCGATACCAACGAAGACAAACCGTAAGAGCGAGATTGCATACGAAGCAGAGGTTATGAAAATCCTAAACGACTTAGAGCTATATCCTGACCTTGAGTTACCTACATTCGAAGTAGCGAACAAAGCGGTAGCAGAAATCAATGCGTACCGTGAGAAGAGAGGTCAATCGAAGTTACCATACGATAAGCTCAAACAACCTGACAATGCGACGTGGTGTGACCCTGACTTCTATATGGCATATCCATCTACGAAGCAAGCCTATTCTCATATGCAGTTAGGAGATATGGGTGACAAGATTATCAACCAACTTCGTAACGGATCCTATGATGAAATTAAAAACATGAGTAACGACGAAGGCTTTTGGAAAACAGCAGATGACATTCAGCAGTTATTCACAAAAGGTGTTGTCGATAATTCATGGGATGCAGATGCAGACGGAAAACGTAAGTTACTCAAGAGTTCATCTGTCACATTTGAGAATGCCACAAAAGACTTCTTAGTGGATGCAGGACAAAGCTGGGGCGATGACAATAGTATTCCGGAAGACAAACAACTTGTTGAGATGATGACACATGACATGCTTAAATACAGTCACCGTGGTCGTATGACTCGTGCCTTCCCTTCTTACATGCTACTATTCGTAGATGAAGGACAATGGGTAGACGGCAAGCGATTATGGAATAACTATTATACCTATCATGCGATTCAGCAGATGGATATTATATCGGATAAAGAGAATCCAGTTGACCTTGCGTTTGTTACTTTATCAAACGTATACGGTACATTCGACTTCCAAGCGAAGATGAGTGACCCACGTAAGTATGCAAAAGAACCAATTGGATTACCGGGTCGTATCGGTGCATTAATCGACGACTTTAGTTTCACCGTTACCGCTAAAGTAATGGCAGAACGCTCGCAGTTAATGGAGCAAGTGAAACTGCGTGAAGGCGCACGAGTACATATGCGAATGGGTTATAGTGCAACAGCAGGTAATATGCCGATTGTATTTAACGGTAAGATTGCTTCTGTTAATGAAGGGGAAGTTATTCAGATGCTATGTCAAGGCGACGGCGCAGAACTGATTAACCAATACTACAGCACAGATCCAAAAGGTGATACGCCAAATGAACCGCACAATACATTCCAAGAGATGCTAACGAAACGTACAAGTAACTACTGGTTCACTGTATCAGAGGACTGGGAATTCGGTGACAACTATCTAAGTTACTACGGCATTGAACACTTTGGATTCGTAGAGAGTGAAGCAGGTGGTTTCTGGAAAGGTCTAGGCAAGAACTTCGAAATCCTAAAGAGCTTCCTAACGGACTCTGTACCGTTCTCTGCTTACGACATTATGAAGAACATCTATAAGGGTTCGAACTCACCAGTAACAGGACAGGAGACAGGATGGAGTCCATTTGACGGCGAGAAGAATATTAACATCGCTGCTTACAATAAGACGCCGTGGGATATTGGACAGATACTATCTATGTTCGTTCCGGAATTTATCTGCGCCCCTCATCATCACGGATTCCGTAGTACCTTGTTCTTCGGTATGCCACACTGGCCTGTGAAATATGAATACATTTTAAAAGACGGCAAAGAAGGAAACAGCTACACGGATTATCAAGAGAAGGTAAAACCATTCCAGCAGTTCCATATGATTACGTCGGGTAATGATATTATCGCCAACAAAATACAGGCAAGCAGTGAGAAGTTAAAACACATTGCAGTCGGTATGTATAAGATGGGTAGCGGGCAAGATGCTGCTGAGTCCTATACAGTATGGGCGGATAGAACGATCATTAAAGAACATCAAAAGATGATGCTTGTAGAAACTGGCGTATGGCAAGATTTACTTGGCCCCGACTTCTTATATACAGCGCTCGGTAAGTACGGATTAAAACCGGTACTCGAAGCGCCGGGTTCCTTACTATCTACAATCGGTGACTTATTAGAATCAGCACCGCTAGGAGATACAATGGATGATGTCGGTAAGAAGTTCCAAGAATGGGGCGAAGGATGGATTGAATGGTTAGATGCAGATAGTATTTTCACTCCTGGTCAAGTGCAAGCACGTACCGTTGCGATTGGTGCATTGCAACGTAAGTTCATGGAAATGTATCAGGGCGAACTTGTTATCCTCGGTGATCCATCTATTAAACCGTGGGACATCTTCTACTTAGACGATACACACATGCTGATGAACGGTACAGCGCAAGCAGGGAAAGTATCACACTCACTTAGTTTGCAGACAGGATTTACTACAGTTATCAAGCCGGACTTAATCACAAGTCGTACCGATGGTAAGGGAATGCGTACTGGTGTTATGAACGGACTCATCCTTATGGGTTCTGCACTTGCTGTTATTACATCACGTAAAATCCTTATGTCACGCTTCGTTAACTCGGTAGCGAGACTCGGATTAAAAGGTGCAATGAAAACAGGTAGTGGTGCAGGTAAGGTTGTATCAAAGTTCACACCAAAAGCTATCAAGCAAGCAAGGGTAGCGAAGTGGGGTGCGAAACAACTGAAGGCTGGCGGTACGTTCCTAAAAGAAGGAAAGTATTTAACCAAAGGTCTGAACTTGTTAAAAGGTAATATCGTAACCCTCGTACTATTCGGTGCTGTATCAGAGTGGGTTGGCACTTGGTTCGAGAAGAATACGAAGTACAACAATATGATTTATATCTATCCATTATGGAAGATGGGTGAGCCATTCGCCGCAGGTATCACTGGCGGAGCGCACATCATTCCGGGCTACATGGATAAACGATTCTCCGACCCTTACTCAACTGGCAAATTCGTTCAACCGAAAATGCAGTTCATAGATGAGAAAGGTAGGGAGAAAAAGAAGAAAGGTGGATCATCGGGTAGCGGTGGTTCCGGTGGCGGAGGAAGCAGTGCTTCTCGTAATGCTATCGTTGAGGGTGGTAAATCATATGAAGGGAAACTGAAATACGTATTCGGTGGTACAAACATCGAAGGCGGTACAGGTGACTGTTCAGGATTTACAAGCCACGTATTCAAGAAGTTCGGTAACCTAAACATTGGTCGTACGACAGGTGAACAGGTAAAGAACGGTACGCAAGTTGAGAACGGAAAAGAGCAACCGGGTGACTTAATCTTCTTCAAGAATACATATAACAGTACACACATCTACGGTGTATCTCACGTTGGTATCGTTATTGGTGATAAGAAGATGGTTCACTTAGGAGATAGCGGTTGTCAAATCTCCGACTATACAACGAAATATTGGAAAGAGCATTTCCTTATGTTCCGTTCGTATATTACTGACATGGGTTCTATCGAAGTCATTTCAGGCGGAGCGTTCCGTCATCCACTTGAACGTCGTACTTCTATTACATCAGGCTTCGGAATGCGTAAAGGCGGTATGCATAAAGGAACAGACTTAGCTCCACTCGGATATGCAGGTAAAGGGCAAGCACTTGAAACCAAAGTCTATGCCGTTGCAGATGGTAAGATTATAGATATGGGACTAAGTGATTCAATGGGTAACTATGTAATCCTTGCTACAGCAAATAAAGCAGGTGCGCCGCACGTTATTACGTACATGCACTTTAGACAACATGCTCCGGGCGTTTCAAAAGGTATGCAAGTGAAAGCCGGTACGTTCATTGGATACATGGGTACGACAGGTGAATCAACTGGTGTCCATCTTCACATTGAAGTCAATCCGGGAACGTCACGCAATAGGGAAGGAAGGATTGACCCTGTTCCGTGGTTCCAACAACAAGGGGTGCAGTTATAA
Gene Ontology
Description | Category | Evidence (source) | |
---|---|---|---|
GO:0001897 | symbiont-mediated cytolysis of host cell | Biological process | Inferred from Electronic Annotation (InterPro) |
GO:0008234 | cysteine-type peptidase activity | Molecular function | Inferred from Electronic Annotation (UniProt) |
GO:0042742 | defense response to bacterium | Biological process | Inferred from Electronic Annotation (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available.