Protein
- Protein accession
- A0AAE9CDJ2 [UniProt]
- Representative
- 2etws
- Source
- UniProt (cluster: phalp2_31329)
- Protein name
- Glucosaminidase domain-containing protein
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MLTITGKSVVSLENIVGFIKKNNPNFDENIAKEFLNVGEVYNIRGDVAICQSIIETGWFKYVGGTAVTPDQHNYCGLGVTSLGLKGNSFETIKDGVTAQMQHLLAYANNDDIPSGEKLIDPRFKYVTRGSANNSWEGLSRKWSSSATYGVDIIALYSKLLQYNQENKPSQPAIKESFIKADEFKTIQTLNVFLASKKTEDIFSVEVARDRDDFRYIVFHKNFK
- Physico‐chemical
properties -
protein length: 223 AA molecular weight: 25041,0 Da isoelectric point: 6,84 hydropathy: -0,33
Representative Protein Details
- Accession
- 2etws
- Protein name
- 2etws
- Sequence length
- 523 AA
- Molecular weight
- 57572,33740 Da
- Isoelectric point
- 7,68414
- Sequence
-
VNILGTSQISAERLTEYVRRNNPSFDGEIAKAFIEVGSVYGIRGDLAMCQSIVETDWFRFGNGTAVTPDQHNYCGLGVLTKGMKGHSFPTIKDGVRAQLQHLYAYASTKPLPSGEQLIDPRFTYVERGIAPTWNDLGGRWAADKKYGDIINGVYQPLVSSENIDKGVVTKMAKVVIDAGHGGKDSGCVSPDQTMMEKNIVLAIALKMRDILVAEYPGIEVKLIRDNDVFYELSQRARIANAWGADIFISIHCNGGGGFGFESYRMKGQSDAKTMKLQGCMHDALMEFYGKSNRKDRGQRDANYAVLRETNMTACLTENLFMDDPNNEIKKFQDPNYVYGVANAHAVGVARYFGIASNGNKPNVSVPSNGEEIGTLTVTGDNVRIRSGAGTNYDVAGKLGNSATRKVFAEVNGWLKINEGFVFYDSSYIRFDRKPKPTPKPEGGSGETFLRVIAGSYTDRDNANETIRQLKSYGIDAFLVPFEKDGTNYLRVVAGSYKERANAEEMVRELASHGIQGFLVAFTK
Other Proteins in cluster: phalp2_31329
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_30857
omqp
|
3 | 30,0% | 360 | 1.017E-55 |
| 2 |
phalp2_23542
7lm13
|
6 | 28,4% | 488 | 3.308E-35 |
| 3 |
phalp2_29416
1iTfP
|
5 | 24,9% | 369 | 2.540E-15 |
| 4 |
phalp2_39497
5jS1N
|
14 | 26,3% | 338 | 1.788E-06 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Bacillus phage vB_BanS_Sophrita [NCBI] |
2894790 | Sophritavirus > Sophritavirus sophrita |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
OK499991
[NCBI]
CDS location
range 145063 -> 145734
strand -
strand -
CDS
ATGTTAACTATAACTGGAAAGAGTGTTGTATCGTTAGAAAATATAGTCGGCTTTATAAAGAAAAATAATCCTAATTTCGATGAAAATATTGCGAAAGAATTTTTAAATGTTGGAGAGGTATATAATATCCGTGGAGATGTAGCAATATGTCAATCCATTATTGAAACAGGATGGTTTAAATATGTTGGAGGAACAGCAGTTACACCAGACCAACATAATTATTGTGGTTTAGGAGTTACTTCATTAGGTCTTAAAGGAAATAGTTTTGAAACTATTAAAGATGGAGTAACAGCACAAATGCAACATTTATTAGCTTATGCTAATAACGATGATATCCCTAGTGGAGAAAAACTTATTGACCCTCGTTTTAAGTATGTAACACGTGGTAGTGCTAATAATTCATGGGAAGGTTTAAGTCGAAAATGGTCATCGTCAGCTACATATGGAGTAGACATCATTGCTTTATATAGCAAACTATTACAATACAATCAAGAAAATAAACCTTCGCAACCTGCAATTAAAGAGTCATTTATTAAAGCTGATGAATTTAAAACAATCCAGACATTGAATGTATTTTTAGCATCTAAGAAAACAGAAGACATTTTCTCAGTAGAAGTTGCTCGTGACAGAGATGATTTTAGATATATAGTTTTTCATAAAAATTTTAAATAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0004040 | amidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(2etws)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50