Protein

Protein accession
4emoE [EnVhog]
Representative
80CcD
Source
EnVhog (cluster: phalp2_14050)
Protein name
4emoE
Lysin probability
92%
PhaLP type
VAL
Probability: 88% (predicted by ML model)
Protein sequence
MTTTRLDPPHVRAQELLGIPWKEGGRTRLGADCLGAALMMHRLLGIVARDPWVAWSERWRAGARFADLADELAGWRAVPVDAERIVGDTAVWSNGSHVAVYVGAGWWLHSTREIGTYLAEDRFVRARIASVWRPIP
Physico‐chemical
properties
protein length:136 AA
molecular weight:15195,2 Da
isoelectric point:9,17
hydropathy:-0,10
Representative Protein Details
Accession
80CcD
Protein name
80CcD
Sequence length
133 AA
Molecular weight
14325,10720 Da
Isoelectric point
6,94734
Sequence
MADHVHRAALELIGIPFREGGRDFRQVLGGTDCAGVCWEFCRRAGIDARDPWLIIADQWRAQEIGVDAGLRAGGWVLCDSAVRAVGDIGESADAGHVCVYVGGGRALHARRGSVSFLASIVHAPAARWWRYAA
Other Proteins in cluster: phalp2_14050
Total (incl. this protein): 12 Avg length: 136,1 Avg pI: 7,27

Protein ID Length (AA) pI
80CcD 133 6,94734
11lvG 136 10,04780
1biLv 146 7,06733
1bqRK 143 5,68836
4Q8EL 130 6,39793
4euUb 129 8,03426
4gQNl 124 5,28122
5kfbF 130 5,76532
8sxjh 139 6,28846
YhCs 145 7,88302
lAA1 142 8,63285
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_7317
4eSpu
7 30,4% 128 2.083E-16
2 phalp2_22202
6VbUQ
26 30,8% 120 1.240E-14
3 phalp2_345
6Xoqn
767 29,6% 108 2.823E-10
4 phalp2_14049
80BsD
1 24,2% 140 1.347E-09
5 phalp2_21754
3QymU
8 25,7% 132 2.516E-09
6 phalp2_27353
4yLib
52 30,1% 106 1.198E-08
7 phalp2_12585
1l6Kj
132 29,1% 127 1.976E-07
8 phalp2_5892
5HKSo
183 23,2% 116 3.679E-07
9 phalp2_38609
25FGT
2035 28,3% 134 9.343E-07
10 phalp2_27198
3iRJT
385 28,4% 116 1.274E-06

Domains

Domains
Unannotated
Representative sequence (used for alignment): 80CcD (133 AA)
Member sequence: 4emoE (136 AA)
1 133 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (80CcD) rather than this protein.
PDB ID
80CcD
Method AlphaFoldv2
Resolution 73.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50