Protein

Protein accession
4vI8x [EnVhog]
Representative
5Oe2c
Source
EnVhog (cluster: phalp2_33284)
Protein name
4vI8x
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIARIFQGYYLTRDGRISHFTRSELACRNSCGLADMNERTIDALEASRLEAERALFPTSGVRCGAHNTAVGGSVDSPHLRGLAVDLTWSADIFSLWELLRKHFLRVCMYSRSKGGHCHGDVWITDRVIWDVVTSDGAFLIDDFLRSYTISLTREAVNQVLEAT
Physico‐chemical
properties
protein length:163 AA
molecular weight:18297,5 Da
isoelectric point:6,29
hydropathy:-0,07
Representative Protein Details
Accession
5Oe2c
Protein name
5Oe2c
Sequence length
128 AA
Molecular weight
14438,49020 Da
Isoelectric point
9,24820
Sequence
FQGRFRKQEAERFWKDIRYFSRNEPYIACSCGKCGGFPVEPAEKLMRLADAVREAAGKPMVPTSTVRCKTHNAEVGGVWNSRHLLGHAMDFRIPGLSAAEVLSIVRKQKNVVYCYAIDAQHVHMDIGN
Other Proteins in cluster: phalp2_33284
Total (incl. this protein): 4 Avg length: 154,3 Avg pI: 7,36

Protein ID Length (AA) pI
5Oe2c 128 9,24820
2qTYy 163 6,94950
3QkL7 163 6,94950
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24008
45fcF
278 59,3% 118 3.036E-39
2 phalp2_40197
2bNeh
8 35,0% 120 1.012E-17
3 phalp2_835
7WE8l
7 37,2% 102 1.901E-17
4 phalp2_6873
2aTQG
108 39,3% 89 9.181E-17
5 phalp2_14040
85wjf
1353 38,0% 92 6.808E-14
6 phalp2_29133
79Li3
3905 37,8% 82 1.152E-12
7 phalp2_36487
1iSG1
11 38,8% 85 2.389E-10
8 phalp2_11545
XXz6
1240 30,4% 105 4.470E-10
9 phalp2_19392
2nERN
10 32,0% 103 1.397E-08
10 phalp2_19324
8pvH4
112 28,7% 94 6.660E-08

Domains

Domains
Representative sequence (used for alignment): 5Oe2c (128 AA)
Member sequence: 4vI8x (163 AA)
1 128 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5Oe2c) rather than this protein.
PDB ID
5Oe2c
Method AlphaFoldv2
Resolution 93.24
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50