Protein

Protein accession
4DASL [EnVhog]
Representative
4wMbz
Source
EnVhog (cluster: phalp2_33051)
Protein name
4DASL
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKPSSKKTKILLIFLAVLVLTTSIWLIKKNIRNIKSSLVQLAVVENHQNDGYYEEMVKRLADCESSNNQYAINPYDSKTSSLGLYQFKLETFIKYAKKYGVIGDNTSLNWIITLVFDRKTNEYLVLQMLKNEDMETLKNLWGNCIRKIGFRK
Physico‐chemical
properties
protein length:152 AA
molecular weight:17694,5 Da
isoelectric point:9,52
hydropathy:-0,25
Representative Protein Details
Accession
4wMbz
Protein name
4wMbz
Sequence length
139 AA
Molecular weight
16380,77350 Da
Isoelectric point
6,83076
Sequence
MENKKETKIFFILIIISYVVMAGGFAFIGMQNKKPIQKEIDWHMEAYLTAIEFCETSGYHEIVQMDSNDKLSYGAFQFQFETFRDYGKKYGLIKKEASDNEVKNMIMDYDLQRSIAREMVKEGLDSQHWKICYKKLNGK
Other Proteins in cluster: phalp2_33051
Total (incl. this protein): 12 Avg length: 151,8 Avg pI: 8,83

Protein ID Length (AA) pI
4wMbz 139 6,83076
1NSDW 148 8,70428
2u0uc 150 8,91200
4Mw1n 147 7,58530
4wdzO 152 9,26154
5hFqK 150 9,40905
7E9G1 153 8,83567
hJaS 162 9,43168
kgpV 149 8,57360
kgqW 153 9,53947
kh2N 166 9,30184
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_34952
4Dsud
16 36,0% 97 2.108E-17
2 phalp2_17923
UMZJ
12 38,6% 101 2.602E-16
3 phalp2_22614
1MavD
11 34,1% 129 6.608E-13
4 phalp2_25243
41xBv
4 29,4% 102 8.075E-12
5 phalp2_29073
6SdDJ
10 38,2% 89 1.104E-11
6 phalp2_13086
8rpjS
2 28,7% 108 1.509E-11
7 phalp2_30772
8yRd
3 35,7% 95 3.035E-09
8 phalp2_29908
80b6u
1 27,7% 101 6.799E-08
9 phalp2_13256
38QAx
2 38,4% 91 2.351E-07
10 phalp2_12892
2v9z8
3 30,8% 107 2.052E-06

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 4wMbz (139 AA)
Member sequence: 4DASL (152 AA)
1 139 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4wMbz) rather than this protein.
PDB ID
4wMbz
Method AlphaFoldv2
Resolution 90.82
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50