Protein

Protein accession
2gNV5 [EnVhog]
Representative
8u4wJ
Source
EnVhog (cluster: phalp2_29656)
Protein name
2gNV5
Lysin probability
98%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MSVFLTVFVSLAASHAPVPSYHPIDEAVFVDTLRTYAKQTHRKLTQEEAVSLTRQFDHAGAMADVDPLFFAGLTYAESRWRNRAIGDGGNSRGIWQMSVSCVRAVMGRVTKREALEVIANKHARTWVAGAFWRRLLRKWSRREAATVYNCGPIRCKQSNGTQRKTTPAVRGYFKHFRRMLAIMGEFSS
Physico‐chemical
properties
protein length:188 AA
molecular weight:21311,3 Da
isoelectric point:10,58
hydropathy:-0,29
Representative Protein Details
Accession
8u4wJ
Protein name
8u4wJ
Sequence length
178 AA
Molecular weight
19884,91260 Da
Isoelectric point
10,25081
Sequence
MIAILAAWVMTHPPIPAYQSLPLEDLSYAINAHAERKYKALPPGVSYELAREFDAVGRLADVDPLFIAGLTYTESRWSGGALGDGGASVGLYQMVASSVRSVLPRLSRKQARKALADPLQRVLCAGYYWRRLIRRYGRTRAAVVFNCGPVRCKKLKHTRVTRGYFRNYKQIRGGLKGE
Other Proteins in cluster: phalp2_29656
Total (incl. this protein): 13 Avg length: 178,9 Avg pI: 9,71

Protein ID Length (AA) pI
8u4wJ 178 10,25081
1SLFo 199 10,67076
1TxsY 197 10,09203
2rvND 162 9,33059
2rwa0 162 9,29977
3KZLn 164 9,19482
5s4zV 165 9,02926
7Tl9U 205 9,66093
7VN2h 185 10,45292
8hUtD 164 8,76971
8l1Br 195 9,77026
rEC8 162 9,15672
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20134
27Dys
3 29,3% 133 1.577E-19
2 phalp2_8588
1Q5Su
11 19,8% 131 2.908E-08
3 phalp2_8654
44Rd9
6 22,8% 162 7.239E-08
4 phalp2_37324
8gjrX
7 27,9% 129 1.799E-07
5 phalp2_36497
1lkPy
4 25,6% 121 4.998E-06
6 phalp2_29431
1lokX
13 27,0% 111 8.078E-04

Domains

Domains
Representative sequence (used for alignment): 8u4wJ (178 AA)
Member sequence: 2gNV5 (188 AA)
1 178 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (8u4wJ) rather than this protein.
PDB ID
8u4wJ
Method AlphaFoldv2
Resolution 87.89
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50