Protein

Protein accession
4gfUe [EnVhog]
Representative
86pNA
Source
EnVhog (cluster: phalp2_4294)
Protein name
4gfUe
Lysin probability
99%
PhaLP type
endolysin
Probability: 92% (predicted by ML model)
Protein sequence
MIRLLLLVFFCSFLLIKVTEKYEQKGYKELTERLSGLGMRFPTIFLSQVYLETGNFTSRIYKENNNMMGMKLPFYRESLAIGENLGHAVYENTGDCILDYIKWQEYWLPRYEENNGKLETDEDYLAFLNAIGYAEDKKYLDKIRNINKTVRKKLNLN
Physico‐chemical
properties
protein length:157 AA
molecular weight:18642,3 Da
isoelectric point:7,67
hydropathy:-0,41
Representative Protein Details
Accession
86pNA
Protein name
86pNA
Sequence length
157 AA
Molecular weight
17920,41460 Da
Isoelectric point
8,69203
Sequence
MKTTLASLLIIAGTLLIIKSYGVSEEVQEISSSSVLQSLPFAEVIIAQGLLESNYFSSAIFKENNNWLGLKCAEFRSTYCTGTNRGHAVFSSTTDCLLDYIEWQKKYLPRYEKNFGQVDSVDRYIDFLVKYRYAEDPHYGSKLKRLLPLARLLLKKN
Other Proteins in cluster: phalp2_4294
Total (incl. this protein): 3 Avg length: 166,7 Avg pI: 7,52

Protein ID Length (AA) pI
86pNA 157 8,69203
1LrHn 186 6,21417
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24545
7HQNr
274 36,0% 111 7.672E-26
2 phalp2_22607
1LayP
7 37,1% 113 6.929E-22
3 phalp2_32846
31z3H
398 35,1% 128 8.522E-21
4 phalp2_29907
82GaP
23 36,8% 103 2.145E-17
5 phalp2_1958
4xKIA
4 31,3% 102 2.068E-14
6 phalp2_38973
8d1wC
1 26,6% 105 7.196E-14
7 phalp2_34515
4NK1T
22 32,0% 106 3.414E-13
8 phalp2_15505
7ZVMu
112 28,8% 135 1.423E-11
9 phalp2_6776
83YaD
163 31,6% 117 2.317E-10
10 phalp2_37081
14SWF
8 34,5% 110 2.018E-09

Domains

Domains
Representative sequence (used for alignment): 86pNA (157 AA)
Member sequence: 4gfUe (157 AA)
1 157 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (86pNA) rather than this protein.
PDB ID
86pNA
Method AlphaFoldv2
Resolution 93.91
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50