Protein
- Protein accession
- A0AA47KXM1 [UniProt]
- Representative
- 4HBgJ
- Source
- UniProt (cluster: phalp2_14460)
- Protein name
- Minor tail protein
- Lysin probability
- 99%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MARHYKGRHRKQTDSNAKRIAATAAGLGVAGAGALMAAPAANAHDWSGVAMCESSGNWAINTGNGFYGGLQFTQSTWDAFKPAGAAARADLATQADQIAAAEATLLAQGIGAWPVCGAHLGWGGTTPGSEAPAPAPEPAPPAPPVATTCAWVQPVTGTKSQDFHGGHNGTDIAAPTGTPIYAATSGTIDLAGFNNDPGGYGNYIQQTADNGATIQYGHVSEIYVSAGEYVYAGDKIGAVGNAGSSTGPHLHLRIDGVDPEAYLINQGVDINWSAPAGCNAAPAPAPAPAPEPAPAPAPVADPVVTVDVSLTPDGASIAVVVSGDTLSHIAEAHGTDWPSVWKLNESIVPNPDLIFPGQELRLN
- Physico‐chemical
properties -
protein length: 363 AA molecular weight: 36668,1 Da isoelectric point: 4,78 hydropathy: -0,12
Representative Protein Details
- Accession
- 4HBgJ
- Protein name
- 4HBgJ
- Sequence length
- 331 AA
- Molecular weight
- 34070,87930 Da
- Isoelectric point
- 5,64300
- Sequence
-
MSHGRHAKPPEHAPLARSAVVGTLALAATGTVLAVEEPAATASGLDWGKVAQCESSGNWATNTGNGYYGGLQFEQATWAAYGGLAYASRADLATEAQQVAVAERVLVGQGSGAWPVCSRGAAPVAAAAPAPVQAPAAVPVSTEATHTYVVQPGDWLSAIGPKVGEDWHQLYADNEKVIGGNPDMIFPNEVLQVHPNGAVTVHPNTVAETPKHSAPASGAALVADAQHYLGVRYVYGGANPAKGFDCSGLVQWVAAEQGISLPRTAAAQSTVGERIASLNEAQPGDLLFFFAPVSHVAMYIGGGKMITAPQPGESVMVQSVWATPTVIRRIA
Other Proteins in cluster: phalp2_14460
| Total (incl. this protein): 2 | Avg length: 347,0 | Avg pI: 5,21 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 4HBgJ | 331 | 5,64300 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_37839
4Jmb8
|
51 | 41,7% | 302 | 1.507E-48 |
| 2 |
phalp2_3417
3R9PN
|
1 | 34,1% | 354 | 3.231E-36 |
| 3 |
phalp2_13707
6wZHy
|
3 | 35,8% | 229 | 1.687E-18 |
| 4 |
phalp2_39637
6Kn72
|
20 | 32,6% | 202 | 1.332E-17 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Rhodococcus phage Jflix2 [NCBI] |
3049372 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
OQ938594
[NCBI]
CDS location
range 2095 -> 3186
strand +
strand +
CDS
ATGGCAAGGCACTACAAGGGACGGCATCGGAAGCAGACAGACAGCAACGCCAAGCGCATCGCAGCGACGGCGGCCGGTCTGGGCGTAGCGGGCGCTGGCGCGCTCATGGCGGCCCCGGCTGCCAACGCTCACGACTGGTCCGGCGTGGCCATGTGTGAGTCGTCTGGCAACTGGGCGATCAACACAGGCAACGGCTTCTACGGTGGCCTGCAGTTCACGCAGAGCACCTGGGACGCCTTCAAGCCTGCGGGCGCAGCGGCGCGAGCTGACCTGGCCACACAGGCCGATCAGATCGCAGCCGCAGAGGCAACCCTGCTCGCGCAGGGCATCGGCGCATGGCCGGTGTGCGGCGCGCATCTCGGCTGGGGTGGAACGACTCCCGGCTCCGAGGCTCCGGCCCCGGCTCCCGAGCCCGCACCCCCCGCTCCGCCGGTCGCAACGACCTGCGCGTGGGTCCAGCCGGTCACCGGCACGAAGTCGCAGGACTTCCACGGCGGCCACAACGGCACCGACATCGCGGCCCCGACCGGTACGCCGATCTACGCAGCGACGTCCGGCACCATCGATCTCGCTGGGTTCAACAACGACCCTGGTGGGTACGGCAACTACATCCAGCAGACGGCCGACAACGGCGCGACTATCCAGTACGGCCACGTCTCCGAGATCTACGTCAGCGCAGGCGAATACGTCTACGCGGGCGACAAGATCGGAGCCGTCGGCAACGCTGGCTCGTCCACCGGCCCGCACCTGCACCTGCGTATCGACGGCGTCGACCCGGAGGCGTACCTGATCAACCAGGGCGTCGACATCAACTGGTCGGCACCGGCAGGCTGCAACGCTGCACCGGCCCCCGCTCCCGCACCGGCTCCCGAGCCCGCCCCGGCCCCCGCTCCCGTAGCGGATCCGGTCGTGACGGTCGACGTGAGCCTGACCCCCGACGGCGCGTCCATCGCGGTCGTGGTGTCCGGCGACACGCTGTCCCACATCGCTGAGGCGCACGGGACGGACTGGCCGTCGGTCTGGAAGCTGAACGAGTCCATCGTTCCCAACCCCGACCTCATCTTCCCCGGACAGGAACTGAGGCTCAACTGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0004222 | metalloendopeptidase activity | molecular function | None (UniProt) |
| GO:0031640 | killing of cells of another organism | biological process | None (UniProt) |
| GO:0042742 | defense response to bacterium | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(4HBgJ)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50