Protein

Protein accession
4LW5L [EnVhog]
Representative
3Qgcc
Source
EnVhog (cluster: phalp2_16047)
Protein name
4LW5L
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKDFKISEHFTFFELTKTRDFPNLLDENREYFAKEPYISRLTVACEYLLENIRIEIDKPIIVNNGGRFPALNEAVGGVWNSQHLFGGFQDGAFDFYCPQMSVAELGKYITDQSNLNWHQLRIYLSQNFIHLGMPLGHNDGQVTVIK
Physico‐chemical
properties
protein length:146 AA
molecular weight:16908,0 Da
isoelectric point:5,46
hydropathy:-0,29
Representative Protein Details
Accession
3Qgcc
Protein name
3Qgcc
Sequence length
157 AA
Molecular weight
18073,53920 Da
Isoelectric point
8,66798
Sequence
MKLKAGDFWLSKHFSFFECTKSRDHPELVKANREYFSHQPYLDRLIFGSEYMLEGIREIVDAPVIVNNGGRFPELNAAVGGVSTSQHLFARMNDGAYDITVPGQKVEMVAFKIFNHGLSFYQMRVYTKIGFIHIGMPRRYRDMQISFPESEAPGWAK
Other Proteins in cluster: phalp2_16047
Total (incl. this protein): 6 Avg length: 157,7 Avg pI: 6,81

Protein ID Length (AA) pI
3Qgcc 157 8,66798
174jF 163 6,29307
4Hb83 159 7,09467
4Hmaf 166 6,23230
d7d0 155 7,12110
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9595
1rj6b
47 30,7% 143 5.057E-25
2 phalp2_23143
4GhkT
33 34,3% 131 6.228E-21
3 phalp2_10381
1KRjC
51 42,1% 102 7.649E-20
4 phalp2_21194
151UP
2 29,3% 150 1.958E-19
5 phalp2_30101
38zFk
347 31,4% 162 4.487E-18
6 phalp2_21393
3LPeU
1997 34,1% 123 1.024E-16
7 phalp2_32134
6FuCt
10 31,8% 160 6.679E-16
8 phalp2_1343
17yCd
10947 29,2% 147 1.247E-15
9 phalp2_25524
3j7um
5 27,0% 133 5.940E-15
10 phalp2_37718
4e1ey
787 31,8% 132 1.514E-14

Domains

Domains
Representative sequence (used for alignment): 3Qgcc (157 AA)
Member sequence: 4LW5L (146 AA)
1 157 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3Qgcc) rather than this protein.
PDB ID
3Qgcc
Method AlphaFoldv2
Resolution 96.84
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50