Protein

Protein accession
891Ul [EnVhog]
Representative
44NDP
Source
EnVhog (cluster: phalp2_22690)
Protein name
891Ul
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSRYTWGLCMALAVLPAVASAGVDEMLTKAANRHGIPADIVHAVAHVESTKRCGVRNGGSVGIMQVQRGAAKEVGVAWPFRSCEDEIEAGVRYLRKALDKGGYDCKGITLYNTGIYARPHCSAYGQKVMAARKKYQ
Physico‐chemical
properties
protein length:136 AA
molecular weight:14702,9 Da
isoelectric point:9,42
hydropathy:-0,15
Representative Protein Details
Accession
44NDP
Protein name
44NDP
Sequence length
129 AA
Molecular weight
13842,90330 Da
Isoelectric point
9,50788
Sequence
MKALAVIAVLSTGQVDVLLQQAAQRHGVPANIVEAVAYIESTKRCGILNGKHRGIMQVGKDAANEVGAAWPPKTCADEIEIGVMYLKLALQRGGNGCNGATLYNSGINARPHCSEYGRKVMRRANRKIE
Other Proteins in cluster: phalp2_22690
Total (incl. this protein): 12 Avg length: 134,2 Avg pI: 9,01

Protein ID Length (AA) pI
44NDP 129 9,50788
18tBC 133 9,64681
3zCrJ 136 9,22325
46eAF 134 8,96480
47twZ 131 8,91341
48ToG 137 7,64430
48Xnz 137 9,18205
49NuY 133 9,50652
4ggvD 136 6,81462
5lwBg 133 9,50652
KWZC 135 9,77690
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_22914
30gdj
13 51,3% 109 1.336E-33
2 phalp2_23933
1Xbk7
7 39,0% 110 1.053E-27
3 phalp2_32909
3znVu
1 40,9% 105 9.319E-20
4 phalp2_34972
4FAhV
74 33,3% 126 6.963E-17
5 phalp2_33520
6j6f
19 33,3% 114 5.707E-15
6 phalp2_21100
mnNR
41 31,2% 112 2.750E-14
7 phalp2_6507
ADdR
1 35,4% 93 1.806E-10
8 phalp2_20713
5a1Cs
2 32,4% 111 8.646E-10
9 phalp2_4421
2GvBa
2 25,6% 121 7.721E-09
10 phalp2_37089
16u4b
9 32,8% 125 1.055E-08

Domains

Domains
Unannotated
Representative sequence (used for alignment): 44NDP (129 AA)
Member sequence: 891Ul (136 AA)
1 129 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (44NDP) rather than this protein.
PDB ID
44NDP
Method AlphaFoldv2
Resolution 92.73
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50