Protein

Protein accession
4Ng0R [EnVhog]
Representative
107NP
Source
EnVhog (cluster: phalp2_21174)
Protein name
4Ng0R
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VKRKILIALLWLGIVFAYLTWLVSGVSSQRHRRLVPADGTAVHTKMPVAVVQEPQDSLTLEQRDSLVVRTARRWRVNPTLALAVSHVENWRAKPEAVSPAGAIGLMQIMPVHLNAFPECAPAAPLAGILNVQIVLQLADLYVPEINVCYGIRILGNYLRRHENRERALAAYNGAVTDWKAALYIRQVAMAQQRLIL
Physico‐chemical
properties
protein length:196 AA
molecular weight:21852,4 Da
isoelectric point:10,07
hydropathy:0,21
Representative Protein Details
Accession
107NP
Protein name
107NP
Sequence length
150 AA
Molecular weight
16636,70500 Da
Isoelectric point
9,09393
Sequence
MNESPAIQGEPDESLLARLRETPQQLRTRLVYAEARRQGISPELALAVSQVENWGADTAAISKAGAVGLMQVMPFWTANESLSAYCGGRELRDPQINICFGVAILRDYLTRHHTQDQALRAYNGSLRLRLAGARYVRLVNNKLKETQANG
Other Proteins in cluster: phalp2_21174
Total (incl. this protein): 9 Avg length: 169,0 Avg pI: 8,65

Protein ID Length (AA) pI
107NP 150 9,09393
1b5xY 162 9,71863
1b7wX 187 8,52287
1bdAB 187 9,32988
1bgBq 172 7,11399
1dqxm 176 7,91338
4GuBo 126 9,60419
gvBO 165 6,50348
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_27235
3QwTC
2 37,2% 145 6.168E-33
2 phalp2_38837
1DV7N
1 44,0% 118 4.704E-27
3 phalp2_25662
4qSKS
2 32,8% 125 5.732E-14
4 phalp2_3112
44dgn
1 38,1% 110 6.962E-13
5 phalp2_27552
5rxEe
14 39,3% 94 8.423E-12
6 phalp2_29884
8cUTh
33 34,4% 116 8.915E-10
7 phalp2_36497
1lkPy
4 29,6% 118 1.447E-08
8 phalp2_9329
7BCrF
25 24,3% 193 1.447E-08
9 phalp2_3385
3Alz4
5 32,4% 114 2.686E-08
10 phalp2_32795
2xoLt
8 31,6% 139 2.686E-08

Domains

Domains
Representative sequence (used for alignment): 107NP (150 AA)
Member sequence: 4Ng0R (196 AA)
1 150 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (107NP) rather than this protein.
PDB ID
107NP
Method AlphaFoldv2
Resolution 92.79
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50