Protein

Protein accession
4c33W [EnVhog]
Representative
4Uj8G
Source
EnVhog (cluster: phalp2_14550)
Protein name
4c33W
Lysin probability
52%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MRVAIIILGLACLPASISWAQPWPAAPPRLGQLSMPQNYWDWTLEAAREYGVSPYVIQGFMAIESRYDPAAMSGRGRCIGLMQLDRGVARGLGVDPWNPRENIHGGARVLAALLKKHRGNLARAARAYNGPGCPQAYVREVLRAVRQAEKTGAESVGQ
Physico‐chemical
properties
protein length:158 AA
molecular weight:17220,7 Da
isoelectric point:9,94
hydropathy:-0,17
Representative Protein Details
Accession
4Uj8G
Protein name
4Uj8G
Sequence length
124 AA
Molecular weight
13595,49910 Da
Isoelectric point
10,32721
Sequence
MPQNYWDWTLEAAREHGVSPYVIQGFMAIESRYDPAAMSGRGRCIGLMQLDRGVARGLGVDPWNPRENIHGGARVLAGLLKKHRGNLARVARAYNGPGCPPAYVREVLRAVRQAKKTGAKSVGQ
Other Proteins in cluster: phalp2_14550
Total (incl. this protein): 9 Avg length: 151,9 Avg pI: 9,82

Protein ID Length (AA) pI
4Uj8G 124 10,32721
1Zt5k 147 10,32237
1hmSJ 167 10,31141
35XEC 151 9,99391
4GIUQ 158 10,01885
4GlMO 158 9,44824
6WE35 152 8,51706
hGvA 152 9,54398
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37089
16u4b
9 38,1% 110 3.053E-17
2 phalp2_40101
7ZJzm
4 36,7% 117 2.773E-16
3 phalp2_4421
2GvBa
2 34,7% 121 9.781E-16
4 phalp2_31447
3o77A
181 27,5% 109 1.666E-14
5 phalp2_17514
5x9xq
2 27,9% 118 2.067E-13
6 phalp2_664
1m3Kq
134 30,8% 120 3.878E-13
7 phalp2_26973
4Hvkb
3 36,4% 96 5.312E-13
8 phalp2_16618
8yg21
13 31,2% 125 1.365E-12
9 phalp2_21025
8KK9
3 24,7% 105 1.688E-11
10 phalp2_22639
1Wy8e
1 37,8% 95 5.929E-11

Domains

Domains
Representative sequence (used for alignment): 4Uj8G (124 AA)
Member sequence: 4c33W (158 AA)
1 124 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4Uj8G) rather than this protein.
PDB ID
4Uj8G
Method AlphaFoldv2
Resolution 95.15
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50