Protein

Protein accession
A0A023NGE0 [UniProt]
Representative
4GkQr
Source
UniProt (cluster: phalp2_4675)
Protein name
Glycoside hydrolase family 19 catalytic domain-containing protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTDKTPSPDFLIPMHEHIDKIILAAAPRINIEEWRDPIITSTREWNIYDERFCMFLAQIAHESNDMNRLSENLNYSVEALLSLFGRHRISEADADKYGRKAGQPANQQMLANILYGGDWGRRNLGNIHPNDGWDYRGQGPKQITGRYNYQVCGEAIGEDLVNNPSLLSTDKYVGMKAACWYFSVRTKGTDIVQVTRQINGGTIGLEDRRIRFERALSEYNKLKALAYNS
Physico‐chemical
properties
protein length:229 AA
molecular weight:26175,2 Da
isoelectric point:6,53
hydropathy:-0,59
Representative Protein Details
Accession
4GkQr
Protein name
4GkQr
Sequence length
141 AA
Molecular weight
15440,26500 Da
Isoelectric point
6,27846
Sequence
MTPEIIKSAFPKASDAIIDAILEYAPRYGIDAKQMPMFLAQAGHESGEFTVFCESLNYSADALVKIFSRHRISEADAEKYGRTSGHAANQEMIANLIYGGAWGAKNLGNTQPGDGWMFRGRGIFQLTGRANYVAFVKDSPN
Other Proteins in cluster: phalp2_4675
Total (incl. this protein): 23 Avg length: 211,4 Avg pI: 8,39

Protein ID Length (AA) pI
4GkQr 141 6,27846
16aWN 140 9,64210
1KxgC 217 9,38481
1LBMi 221 9,28617
2Zueb 108 6,03098
3fZ6a 286 9,25465
4NOD1 214 6,75227
4NP8L 216 5,79419
4emA6 222 8,92599
4fS2W 181 9,56100
4g7Jf 239 9,06504
4o01f 173 10,30780
56FWf 214 6,75267
6zjn8 214 6,75267
89KzH 238 9,12623
8iQjP 241 9,06408
A0A1P8VVH0 291 6,00023
A0A2R3UA80 215 9,94014
A0A6J5M118 221 9,68884
A0A9E7MZJ3 213 9,49279
A0AAF0I9X9 215 9,49279
A0AAV2PF34 213 9,80604
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26839
3X8d3
3 52,2% 111 2.876E-39
2 phalp2_5234
8cPFI
3482 43,2% 141 2.712E-35
3 phalp2_9408
8GIg1
520 39,4% 137 6.138E-27
4 phalp2_39815
Hj9s
55 37,8% 119 2.693E-25
5 phalp2_36471
1bFy4
16 40,2% 134 3.690E-25
6 phalp2_12287
7e8ZJ
95 36,8% 122 2.215E-23
7 phalp2_29588
83X5u
30 40,3% 109 2.007E-22
8 phalp2_14082
8g4v0
2 34,7% 92 4.225E-20
9 phalp2_24890
f0po
5 36,3% 143 5.241E-16
10 phalp2_31323
2aR1M
3 32,7% 116 4.225E-14

Domains

Domains [InterPro]
Unannotated
Representative sequence (used for alignment): 4GkQr (141 AA)
Member sequence: A0A023NGE0 (229 AA)
1 141 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Nitrincola phage 1M3-16
[NCBI]
1472912 No lineage information
Host Nitrincola
[NCBI]
267849 Proteobacteria > Gammaproteobacteria > Oceanospirillales > Oceanospirillaceae >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KJ534580 [NCBI]
CDS location
range 54260 -> 54949
strand -
CDS
ATGACTGACAAAACCCCTTCCCCAGATTTCCTGATCCCAATGCACGAACATATTGATAAGATTATTCTTGCAGCAGCCCCACGTATCAACATTGAAGAATGGCGTGACCCAATCATCACGTCTACACGTGAGTGGAATATTTACGATGAACGTTTCTGTATGTTCCTTGCTCAGATAGCCCACGAATCAAATGATATGAACAGACTATCAGAGAATCTGAATTACTCTGTAGAGGCTTTACTATCATTGTTTGGGAGACATCGTATCTCGGAAGCTGATGCCGATAAATACGGAAGGAAGGCTGGGCAACCAGCCAACCAACAAATGCTTGCCAACATCCTTTACGGTGGAGATTGGGGCAGACGTAATCTAGGAAACATTCACCCCAATGATGGTTGGGATTACCGTGGACAAGGCCCAAAGCAGATCACAGGACGATACAATTATCAAGTATGCGGTGAAGCTATTGGAGAAGATTTAGTCAATAACCCATCTCTGCTATCTACAGATAAATATGTAGGAATGAAAGCCGCCTGCTGGTATTTTTCTGTAAGGACTAAGGGAACAGATATTGTTCAAGTTACACGTCAGATTAATGGTGGTACAATAGGACTGGAAGATAGACGCATACGATTTGAACGTGCGTTATCTGAATATAACAAACTGAAGGCACTTGCCTACAATAGCTGA

Gene Ontology

Description Category Evidence (source)
GO:0004568 chitinase activity molecular function None (UniProt)
GO:0006032 chitin catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4GkQr) rather than this protein.
PDB ID
4GkQr
Method AlphaFoldv2
Resolution 95.21
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50