Protein

Protein accession
6SY7F [EnVhog]
Representative
49aGX
Source
EnVhog (cluster: phalp2_14340)
Protein name
6SY7F
Lysin probability
55%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MMKTIALCLAITMYREARSEPISTQIAVAKVLMNRAVKEKITPCKALEKAYQWAWVKKYKVVTPDANGKIDDCVWQQTKRIAHDIKK
Physico‐chemical
properties
protein length:87 AA
molecular weight:10008,9 Da
isoelectric point:9,72
hydropathy:-0,26
Representative Protein Details
Accession
49aGX
Protein name
49aGX
Sequence length
78 AA
Molecular weight
8974,34390 Da
Isoelectric point
8,92760
Sequence
MIDQALACLAQTIFMEARGESITGQIAVGYVLYRRADFKPENVCIEMKKPYQFSWYGKLKPPSPQALKNTSYYKIAYE
Other Proteins in cluster: phalp2_14340
Total (incl. this protein): 6 Avg length: 98,2 Avg pI: 8,41

Protein ID Length (AA) pI
49aGX 78 8,92760
2EDJl 130 9,20752
4cEuY 129 8,81523
6KECN 94 6,07111
mvzV 71 7,72211
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25016
SNJn
2 57,4% 54 3.855E-17
2 phalp2_8307
8Asy8
29 42,6% 68 2.242E-10
3 phalp2_21853
4tf87
1 40,3% 52 2.465E-07
4 phalp2_6737
26WZo
49 32,4% 74 2.465E-07
5 phalp2_12091
5qIjA
11 35,8% 67 6.408E-07
6 phalp2_34810
3FopW
4 39,1% 69 2.130E-05
7 phalp2_22991
3BMOW
57 31,4% 70 2.930E-05
8 phalp2_37344
8tT1g
1 33,3% 63 7.619E-05
9 phalp2_12179
6Bd6L
1 38,0% 50 5.153E-04
10 phalp2_12408
8Erth
12 33,3% 66 7.087E-04

Domains

Domains
Unannotated
Representative sequence (used for alignment): 49aGX (78 AA)
Member sequence: 6SY7F (87 AA)
1 78 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (49aGX) rather than this protein.
PDB ID
49aGX
Method AlphaFoldv2
Resolution 96.31
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50