Protein

Protein accession
4UWk1 [EnVhog]
Representative
4Q7iM
Source
EnVhog (cluster: phalp2_3622)
Protein name
4UWk1
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VKDNLAYGFKVTMLAEGVEFVNDQNDRGGPTKMGLTLGLMKNLRLDLNNDGSVTVDDVHLVTEEVVLDVYRKEFWNRVKADTLPTGQDVQTADFGFNAGPSRARDLLRKSPNMNGFALHQIAYYLSLADRLPGYVGFFRGLTRRALSVLVASAAICEYYETSQAITRIRFMLSEADKDSTFRGIFREKVRSVLYG
Physico‐chemical
properties
protein length:195 AA
molecular weight:21872,7 Da
isoelectric point:7,98
hydropathy:-0,21
Representative Protein Details
Accession
4Q7iM
Protein name
4Q7iM
Sequence length
195 AA
Molecular weight
21898,80340 Da
Isoelectric point
7,75809
Sequence
MKENLVYGFRTTMLAEGVEFVNDPNDRGGPTRMGLTLGLMKNLGLDLNCDGRVTVDDVHLVTDEVVLDVYKKEFWNRVRADTLPPGQDVQTTDFAFNSGPFRAKELLRKSPNINGFALHQIAYYLSLADRLPGYVGFFRGLTRRALSVLAASAALCEWYETSQAVTRIRLMLAEADKDTTFRGIFREKVRSVLNG
Other Proteins in cluster: phalp2_3622
Total (incl. this protein): 6 Avg length: 195,0 Avg pI: 8,12

Protein ID Length (AA) pI
4Q7iM 195 7,75809
3LiF 195 8,69932
4Glcn 195 7,85491
4PyZF 195 7,87567
8bLH 195 8,52989
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14453
4GAEG
12 36,1% 119 1.518E-17
2 phalp2_14059
8aO3S
400 32,5% 175 1.160E-13
3 phalp2_21492
8pawf
134 30,1% 156 7.142E-11
4 phalp2_18877
1MrjM
564 28,0% 182 1.101E-09
5 phalp2_20698
4XKIk
408 30,5% 170 3.698E-09
6 phalp2_13016
426A1
31 26,2% 164 1.239E-08
7 phalp2_25028
ZsUq
3460 28,4% 169 1.516E-06
8 phalp2_20399
37U0b
4 26,2% 156 5.000E-06

Domains

Domains
Representative sequence (used for alignment): 4Q7iM (195 AA)
Member sequence: 4UWk1 (195 AA)
1 195 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05838

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4Q7iM) rather than this protein.
PDB ID
4Q7iM
Method AlphaFoldv2
Resolution 85.29
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50