Protein

Protein accession
A0AAE9CDJ2 [UniProt]
Representative
2etws
Source
UniProt (cluster: phalp2_31329)
Protein name
Glucosaminidase domain-containing protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MLTITGKSVVSLENIVGFIKKNNPNFDENIAKEFLNVGEVYNIRGDVAICQSIIETGWFKYVGGTAVTPDQHNYCGLGVTSLGLKGNSFETIKDGVTAQMQHLLAYANNDDIPSGEKLIDPRFKYVTRGSANNSWEGLSRKWSSSATYGVDIIALYSKLLQYNQENKPSQPAIKESFIKADEFKTIQTLNVFLASKKTEDIFSVEVARDRDDFRYIVFHKNFK
Physico‐chemical
properties
protein length:223 AA
molecular weight:25041,0 Da
isoelectric point:6,84
hydropathy:-0,33
Representative Protein Details
Accession
2etws
Protein name
2etws
Sequence length
523 AA
Molecular weight
57572,33740 Da
Isoelectric point
7,68414
Sequence
VNILGTSQISAERLTEYVRRNNPSFDGEIAKAFIEVGSVYGIRGDLAMCQSIVETDWFRFGNGTAVTPDQHNYCGLGVLTKGMKGHSFPTIKDGVRAQLQHLYAYASTKPLPSGEQLIDPRFTYVERGIAPTWNDLGGRWAADKKYGDIINGVYQPLVSSENIDKGVVTKMAKVVIDAGHGGKDSGCVSPDQTMMEKNIVLAIALKMRDILVAEYPGIEVKLIRDNDVFYELSQRARIANAWGADIFISIHCNGGGGFGFESYRMKGQSDAKTMKLQGCMHDALMEFYGKSNRKDRGQRDANYAVLRETNMTACLTENLFMDDPNNEIKKFQDPNYVYGVANAHAVGVARYFGIASNGNKPNVSVPSNGEEIGTLTVTGDNVRIRSGAGTNYDVAGKLGNSATRKVFAEVNGWLKINEGFVFYDSSYIRFDRKPKPTPKPEGGSGETFLRVIAGSYTDRDNANETIRQLKSYGIDAFLVPFEKDGTNYLRVVAGSYKERANAEEMVRELASHGIQGFLVAFTK
Other Proteins in cluster: phalp2_31329
Total (incl. this protein): 8 Avg length: 448,9 Avg pI: 8,48

Protein ID Length (AA) pI
2etws 523 7,68414
138WX 434 8,44641
13bLW 432 8,29736
2msOv 440 9,17400
7qJ0g 518 8,54272
82z4N 472 9,42981
gpQE 549 9,42265
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_30857
omqp
3 30,0% 360 1.017E-55
2 phalp2_23542
7lm13
6 28,4% 488 3.308E-35
3 phalp2_29416
1iTfP
5 24,9% 369 2.540E-15
4 phalp2_39497
5jS1N
14 26,3% 338 1.788E-06

Domains

Domains [InterPro]
Representative sequence (used for alignment): 2etws (523 AA)
Member sequence: A0AAE9CDJ2 (223 AA)
1 523 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01520, PF01832, PF05036, PF08239

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage vB_BanS_Sophrita
[NCBI]
2894790 Sophritavirus > Sophritavirus sophrita
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
OK499991 [NCBI]
CDS location
range 145063 -> 145734
strand -
CDS
ATGTTAACTATAACTGGAAAGAGTGTTGTATCGTTAGAAAATATAGTCGGCTTTATAAAGAAAAATAATCCTAATTTCGATGAAAATATTGCGAAAGAATTTTTAAATGTTGGAGAGGTATATAATATCCGTGGAGATGTAGCAATATGTCAATCCATTATTGAAACAGGATGGTTTAAATATGTTGGAGGAACAGCAGTTACACCAGACCAACATAATTATTGTGGTTTAGGAGTTACTTCATTAGGTCTTAAAGGAAATAGTTTTGAAACTATTAAAGATGGAGTAACAGCACAAATGCAACATTTATTAGCTTATGCTAATAACGATGATATCCCTAGTGGAGAAAAACTTATTGACCCTCGTTTTAAGTATGTAACACGTGGTAGTGCTAATAATTCATGGGAAGGTTTAAGTCGAAAATGGTCATCGTCAGCTACATATGGAGTAGACATCATTGCTTTATATAGCAAACTATTACAATACAATCAAGAAAATAAACCTTCGCAACCTGCAATTAAAGAGTCATTTATTAAAGCTGATGAATTTAAAACAATCCAGACATTGAATGTATTTTTAGCATCTAAGAAAACAGAAGACATTTTCTCAGTAGAAGTTGCTCGTGACAGAGATGATTTTAGATATATAGTTTTTCATAAAAATTTTAAATAA

Gene Ontology

Description Category Evidence (source)
GO:0004040 amidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2etws) rather than this protein.
PDB ID
2etws
Method AlphaFoldv2
Resolution 87.58
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50