Protein

Protein accession
4l7TW [EnVhog]
Representative
2hQSl
Source
EnVhog (cluster: phalp2_2016)
Protein name
4l7TW
Lysin probability
99%
PhaLP type
endolysin
Probability: 83% (predicted by ML model)
Protein sequence
MNFLAAAKATTKPTPLPHQQAAWNWAWELLAPDEQATFLDKFRSDPAPKPTLAWEPAAALIREFEGFSSVVYKCPAGVPTIGWGTTRWPDGAAVKIGDTITRD
Physico‐chemical
properties
protein length:103 AA
molecular weight:11280,7 Da
isoelectric point:5,74
hydropathy:-0,26
Representative Protein Details
Accession
2hQSl
Protein name
2hQSl
Sequence length
145 AA
Molecular weight
15862,94650 Da
Isoelectric point
6,76568
Sequence
VSNFLAAVKATTKPTPLPHQQAAWNWAWELLAPDEQKTFLDKFRADPKPKAALAWEPAAKLIREFEGFSDVAYICPAGVPTIGWGTTRWPDGAAVKIGDTITRDAADGLLDNMLETQVVPALAKSIPSWKTLSAQRQNALISFAY
Other Proteins in cluster: phalp2_2016
Total (incl. this protein): 5 Avg length: 131,8 Avg pI: 5,69

Protein ID Length (AA) pI
2hQSl 145 6,76568
1tMQH 122 4,80815
SuAR 109 5,54739
hVR3 180 5,56462
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_15231
SBta
7 87,7% 98 2.089E-54
2 phalp2_8851
2IyxA
73 50,7% 132 7.760E-31
3 phalp2_3745
5wa2f
396 44,6% 132 1.009E-23
4 phalp2_9019
535M3
35 45,6% 92 1.013E-14
5 phalp2_33687
Evty
71 42,0% 126 2.063E-12
6 phalp2_192
5FaJe
1 45,6% 92 8.705E-11
7 phalp2_31263
8mZo1
167 42,5% 94 1.257E-08
8 phalp2_20450
3rDzg
26 41,1% 90 1.496E-07
9 phalp2_28762
4NrrK
6 40,6% 96 7.006E-07
10 phalp2_37312
8djZH
4 37,7% 106 7.033E-05

Domains

Domains
Disordered region
GH24
Representative sequence (used for alignment): 2hQSl (145 AA)
Member sequence: 4l7TW (103 AA)
1 145 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2hQSl) rather than this protein.
PDB ID
2hQSl
Method AlphaFoldv2
Resolution 86.00
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50