Protein

Protein accession
3ggKK [EnVhog]
Representative
2363R
Source
EnVhog (cluster: phalp2_20113)
Protein name
3ggKK
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MTDFNFSPGAGGPKPYNPADYGLPVWRDEDHLPPKETETAEQEPQTDAPAPVSAEPAAPGAIQRSVEWQLEKAADNRHGYSQADRWGPDYDCSSFVITAWEQAGVPVKQAGATYTGNMYEAFIRCGFEDVTAVCDLTTGAGLKYGDALLNHQRHTATYIGNGQLVHARSSEGNSLQGDQSGNEIRVQPYYNGPWDRVLRYTGANAGSIPAPEKRPILKKGMIGEDVRELQDMLVKLGYDTGGTDGKYGSKTFIAVAKLQEENHITPVDGEAGPVTMAVIDSLLEELPAVAESTPALHDTLAALAAYLQTEEFQQAFNDYIERSKEK
Physico‐chemical
properties
protein length:326 AA
molecular weight:35453,8 Da
isoelectric point:4,66
hydropathy:-0,57
Representative Protein Details
Accession
2363R
Protein name
2363R
Sequence length
332 AA
Molecular weight
35674,21830 Da
Isoelectric point
5,25405
Sequence
MTKIEAAVQWAEKMAADNSHGYSQADRWGPDYDCSSFVIAAWEAAGVRVKSAGASYTGNMRGAFLAMGFVDVTYECGLSTGYGIQRGDVLLNYSAHTCLAAGNGKVINCRTDEGNPQAGDQSGNEIRVQSYWNYPWDCVLRYKKDDNATGSTGSVSGSAPAEDDGTLHYGAKGEAVKAMQQKLIDLGYSCGSCGADGVYGYDTISAVRKFQKDHGLPVSAGANKKTQEAIQAAKKEAPAIATEPEKEEQSAPVSAPVEEHDWNPEPLKNGVKYSRAIVVLQALLNVRGFNCGNPDGYYGVMTEAAVNHAKRYYGMPQDGECSTELWDKLLGR
Other Proteins in cluster: phalp2_20113
Total (incl. this protein): 12 Avg length: 285,8 Avg pI: 5,18

Protein ID Length (AA) pI
2363R 332 5,25405
3TPEt 315 4,56482
3WEWK 326 4,66412
3ZFLo 237 6,49956
3fQB1 191 5,95766
406ns 274 5,37585
40CtJ 312 4,78007
40h1F 247 5,04346
4k6kc 315 4,79024
4kn4p 272 5,88121
67ZM 283 4,73818
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36427
10262
294 52,6% 241 8.026E-83
2 phalp2_17255
3VKDd
14 42,2% 246 7.740E-66
3 phalp2_22955
3fPKW
30 34,3% 317 6.593E-62
4 phalp2_20155
41eQz
91 33,8% 242 2.062E-33
5 phalp2_33777
19alW
272 26,5% 332 4.157E-22
6 phalp2_32958
3ZDZv
229 28,8% 208 2.901E-19
7 phalp2_9688
23BCv
75 28,1% 217 3.901E-19
8 phalp2_6753
406qp
2 26,5% 226 4.901E-14
9 phalp2_26724
2YH31
2 24,5% 265 1.497E-07

Domains

Domains
Unannotated
PG_1
PG_1
Representative sequence (used for alignment): 2363R (332 AA)
Member sequence: 3ggKK (326 AA)
1 332 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2363R) rather than this protein.
PDB ID
2363R
Method AlphaFoldv2
Resolution 79.50
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50