Protein

Protein accession
3kfXl [EnVhog]
Representative
6gtqm
Source
EnVhog (cluster: phalp2_7646)
Protein name
3kfXl
Lysin probability
99%
PhaLP type
endolysin
Probability: 93% (predicted by ML model)
Protein sequence
MFNPETVKAGDKNTSVLLLQEILRARGFKGKNGKALKLTWTADANTIYALKAYQESRKEVLEVDGICGSATWKDLIAI
Physico‐chemical
properties
protein length:78 AA
molecular weight:8613,9 Da
isoelectric point:9,17
hydropathy:-0,21
Representative Protein Details
Accession
6gtqm
Protein name
6gtqm
Sequence length
78 AA
Molecular weight
8563,88160 Da
Isoelectric point
9,03120
Sequence
MFNPETVKAGDKNTSVLLLQEILRARGFKGKNGKALKLTWTADANTICALKAYQESRKEVLEVDGICGPATWKDLIAI
Other Proteins in cluster: phalp2_7646
Total (incl. this protein): 10 Avg length: 70,2 Avg pI: 9,25

Protein ID Length (AA) pI
6gtqm 78 9,03120
3wY9L 69 8,83309
5MsyI 53 9,25465
5Wsus 53 9,10424
67pKD 77 9,17077
68BZs 63 9,25342
6cxs3 78 9,16658
7VqWK 71 9,89417
zbx0 82 9,67382
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_22989
3B6sG
2 94,8% 78 4.647E-42
2 phalp2_19553
3PyzH
44 45,8% 72 4.929E-12
3 phalp2_1201
8ENBB
6 36,1% 72 1.404E-08
4 phalp2_35175
5YfIt
13 30,2% 76 1.404E-08
5 phalp2_4363
N2SL
71 32,1% 84 1.930E-08
6 phalp2_8066
5TREZ
3 29,5% 71 1.793E-07
7 phalp2_8302
8ztw3
1 34,7% 69 6.408E-07
8 phalp2_15416
23BCo
2 30,1% 73 2.291E-06
9 phalp2_28190
8EvJv
1 36,5% 63 2.930E-05
10 phalp2_31076
1Nlkd
2 31,8% 69 7.619E-05

Domains

Domains
Unannotated
Representative sequence (used for alignment): 6gtqm (78 AA)
Member sequence: 3kfXl (78 AA)
1 78 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6gtqm) rather than this protein.
PDB ID
6gtqm
Method AlphaFoldv2
Resolution 95.23
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50