Protein
- Protein accession
- A4JX20 [UniProt]
- Representative
- 8jI3j
- Source
- UniProt (cluster: phalp2_12784)
- Protein name
- Endolysin
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MGVVGAAAAALLFSVVPKFEGVKLVGYLDPVGIPTKCMGDTRDVVVGRAYSEAECRSSLETQLIAHAEPVLRCTPGLRGRPYQLAAAVSFAYNVGAHAYCNSTTAKRFNAGDLRGACRAINESDSGRRQWVFANCRTVIDPKTKKPLPVCDTLPGLVKRRAEERAICERGL
- Physico‐chemical
properties -
protein length: 171 AA molecular weight: 18397,1 Da isoelectric point: 9,24 hydropathy: -0,07
Representative Protein Details
- Accession
- 8jI3j
- Protein name
- 8jI3j
- Sequence length
- 200 AA
- Molecular weight
- 22238,24170 Da
- Isoelectric point
- 9,24317
- Sequence
-
MRLKFDWGRNSKLGVDSSSNNVKGVKFKALVGAATAAILIGTFGDNATTGFTERHEGMVLRGYKDPIGIPTKCAGDTYNVVLDKVYTYGECIDSLDTQLYNHALPVLKCLPSIRDNVHFLAAAIDHNYHLGNFCSSSMMKAHRAGDDKTACVRYNENSLGKPQYIYVKDKFIDGKWTYKSLNGLIKRAAERRSMCERGLK
Other Proteins in cluster: phalp2_12784
| Total (incl. this protein): 17 | Avg length: 175,6 | Avg pI: 9,21 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 8jI3j | 200 | 9,24317 |
| 1lKRD | 189 | 9,29945 |
| 34Bm | 194 | 9,08464 |
| 4fIRR | 164 | 9,11269 |
| 4fwOn | 184 | 9,00038 |
| 4gGOz | 174 | 9,31783 |
| 7QagF | 186 | 9,01218 |
| 8cGuj | 195 | 9,01018 |
| Q6JIK8 | 163 | 9,41008 |
| A0A0S0N859 | 186 | 9,01218 |
| A0A514CTT7 | 156 | 9,22989 |
| A0A514CW31 | 156 | 9,22989 |
| A0A5B9N6W7 | 186 | 9,13609 |
| Q8W6S6 | 134 | 9,68123 |
| A0A9X9NYL1 | 186 | 9,01231 |
| A0AA96KH54 | 162 | 9,46526 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_11002
4Mz75
|
307 | 33,5% | 143 | 2.649E-39 |
| 2 |
phalp2_35864
4Lj4w
|
95 | 31,4% | 181 | 2.363E-35 |
| 3 |
phalp2_4305
8aoFg
|
2477 | 28,7% | 153 | 9.948E-31 |
| 4 |
phalp2_7028
8i7gz
|
89 | 26,2% | 145 | 7.617E-26 |
| 5 |
phalp2_9410
8Hj1q
|
5109 | 31,0% | 145 | 3.621E-25 |
| 6 |
phalp2_26525
83EoO
|
66 | 26,7% | 172 | 2.637E-20 |
| 7 |
phalp2_21319
1KsyR
|
184 | 30,2% | 149 | 1.694E-19 |
| 8 |
phalp2_30973
1fTML
|
10 | 25,0% | 168 | 5.099E-18 |
| 9 |
phalp2_3556
4E0oc
|
53 | 28,8% | 180 | 4.503E-15 |
| 10 |
phalp2_34995
4LMnK
|
728 | 23,6% | 186 | 4.503E-15 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Stanholtvirus sv6442 [NCBI] |
431891 | Stanholtvirus > |
| Host |
Burkholderia pseudomallei [NCBI] |
28450 | Proteobacteria > Betaproteobacteria > Burkholderiales > Burkholderiaceae > Burkholderia > pseudomallei group |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
CP000625
[NCBI]
CDS location
range 21735 -> 22250
strand +
strand +
CDS
GTGGGTGTTGTGGGGGCTGCTGCGGCAGCCCTTCTTTTTTCCGTCGTCCCGAAGTTCGAGGGCGTCAAACTGGTCGGGTATCTCGATCCGGTCGGCATCCCGACAAAGTGCATGGGCGACACGCGCGATGTCGTCGTCGGCCGGGCGTACAGCGAGGCCGAGTGTCGCTCGTCACTCGAAACGCAACTGATCGCCCACGCCGAACCCGTGCTGCGTTGCACGCCGGGGCTGAGAGGTCGTCCGTATCAGCTCGCGGCGGCCGTGAGCTTTGCATACAACGTCGGCGCGCATGCCTACTGCAACAGCACGACGGCGAAGCGCTTCAACGCGGGCGACCTGCGCGGCGCGTGCCGCGCGATCAACGAATCCGATAGCGGTCGGCGGCAGTGGGTCTTTGCGAACTGCCGGACCGTTATCGACCCGAAAACGAAAAAGCCTCTGCCGGTATGCGACACGCTACCGGGTCTGGTGAAGCGGCGTGCGGAAGAGCGCGCGATCTGCGAGCGGGGGCTCTGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0003796 | lysozyme activity | molecular function | None (UniProt) |
| GO:0009253 | peptidoglycan catabolic process | biological process | None (UniProt) |
| GO:0016998 | cell wall macromolecule catabolic process | biological process | None (UniProt) |
| GO:0030430 | host cell cytoplasm | cellular component | None (UniProt) |
| GO:0042742 | defense response to bacterium | biological process | None (UniProt) |
| GO:0044659 | viral release from host cell by cytolysis | biological process | None (UniProt) |
Enzymatic activity
| EC Number | Entry Name | Reaction Catalyzed | Classification | Evidence | Source |
|---|---|---|---|---|---|
| 3.2.1.17 | None | Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. |
match to sequence model evidence used in automatic assertion
ECO:ECO:0000256 |
ARBA:ARBA00000632 |
Tertiary structure
PDB ID
upi0000f597cf_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(8jI3j)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50