Protein

Protein accession
6Bgp4 [EnVhog]
Representative
4IiQ
Source
EnVhog (cluster: phalp2_21013)
Protein name
6Bgp4
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKKLILIFAICASIAQGATLTDLIIALEKVESNCNPSAYNESEKALGSLQIRPIMLDDYNRIYGTNVKHQQMSNRLTSHNIAIGIFRHYEKYIKRNGEKLNAKHLAFIWNGGGSAWKRVANPKSDQKQKNLEIYWTKVSKHL
Physico‐chemical
properties
protein length:142 AA
molecular weight:16186,5 Da
isoelectric point:9,73
hydropathy:-0,41
Representative Protein Details
Accession
4IiQ
Protein name
4IiQ
Sequence length
168 AA
Molecular weight
19687,77390 Da
Isoelectric point
9,11088
Sequence
MKKMERIFQIALLVAFIFMCVKACTTSMFAKSIYELTDGNPPSLEEFQMLEKTIEAISMVESQNQCGAYNQAENALGCMQIRPIMLADYNRISGENRFMYEVRDRRVAYMIAKVIFLHYTKDIKNPTPQHYSFVWNGGGDAWKRVKRPKNDLKQKNLDSYWEKVKLYL
Other Proteins in cluster: phalp2_21013
Total (incl. this protein): 26 Avg length: 151,6 Avg pI: 8,58

Protein ID Length (AA) pI
4IiQ 168 9,11088
10Mrk 150 9,05653
1Oh4Z 153 9,45875
1P5i6 156 8,61228
1S7OO 160 8,56129
1VP0r 150 9,18566
1weiX 152 8,83160
28QqB 152 5,90008
3oIyA 144 9,45908
4RfA1 152 8,81510
5p14Z 144 9,43232
6B5HL 142 9,72552
6FOHf 152 6,89908
6G3vK 141 9,03623
6NODZ 142 9,57518
7Epua 156 7,89030
82mvp 165 6,18553
88jLr 160 7,84472
8AD4l 154 6,58613
8AOpY 150 9,20629
8BEPB 144 9,45908
8F07n 144 9,64803
8etcO 158 5,77566
8jVNh 168 9,22873
RrAd 143 9,89972
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3273
2zg0L
96 38,6% 119 1.284E-32
2 phalp2_23582
1f8D
174 37,9% 116 6.026E-20
3 phalp2_23047
4ccMQ
55 32,2% 121 3.930E-19
4 phalp2_20516
4eE8g
949 35,3% 116 1.218E-17
5 phalp2_46
7Exor
250 31,4% 124 2.751E-16
6 phalp2_3524
4udWf
232 30,3% 112 2.922E-14
7 phalp2_9165
6AJjn
149 31,8% 138 8.860E-13
8 phalp2_11808
8aeor
7 29,4% 129 5.674E-12
9 phalp2_11904
2rHNT
22 32,5% 123 1.053E-11
10 phalp2_33104
4JGEd
92 31,0% 132 1.053E-11

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4IiQ (168 AA)
Member sequence: 6Bgp4 (142 AA)
1 168 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4IiQ) rather than this protein.
PDB ID
4IiQ
Method AlphaFoldv2
Resolution 93.90
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50