Protein

Protein accession
871Uj [EnVhog]
Representative
5l2Hd
Source
EnVhog (cluster: phalp2_35107)
Protein name
871Uj
Lysin probability
82%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTAAPDLAASFLKPIEGFSATPYPDSGGVWTIGYGTTYISGNPVTRYTQACTEELATAWLMSDMSDAWATVRNGVIATAKDCELAAFISFAYNEGDRAFLGSTLLARFNAGDIAGCEAQFSRWIYVSEGGRLVSVPGLVNRRAAEVALFRGVWKPPVAAPVVFAIGSKGHDVSLLQMALIGAGCLVPPPDGDFGEQTRAAVEKFQAANGLAVDGKVGPETAAVLGFDL
Physico‐chemical
properties
protein length:228 AA
molecular weight:23888,9 Da
isoelectric point:4,72
hydropathy:0,21
Representative Protein Details
Accession
5l2Hd
Protein name
5l2Hd
Sequence length
243 AA
Molecular weight
27583,87100 Da
Isoelectric point
8,96783
Sequence
MKLSASGLLFLKSLEDLRLKSYPDEGGVWTIGYGTTKNVVPDMEITEEKAEEFLIRDVRESEDCINQHVVAKLNQNQFDALVSFVFNVGINAFRQSTLLKKINCLKFDEVPAQMRRWIYVKNHINKGLINRRNFEINLFTESVVVKEAKKLVIKTNQVVNDVVVRTMFERIPLLAGLFKFIDGKKVLLGRLGLFATAILQAVIELYPDGPLGQSAVLALGALSWFLTEFGIRHKQDKELRGVE
Other Proteins in cluster: phalp2_35107
Total (incl. this protein): 3 Avg length: 215,3 Avg pI: 7,63

Protein ID Length (AA) pI
5l2Hd 243 8,96783
A0A6G5Y728 175 9,21061
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_33221
5jbqV
447 43,0% 151 5.862E-35
2 phalp2_28113
7zmZV
50 42,5% 155 8.003E-35
3 phalp2_38866
1Mo76
29 37,1% 242 6.930E-29
4 phalp2_22247
7g4te
1 36,7% 223 2.394E-28
5 phalp2_34307
3A2Hz
178 40,0% 165 1.164E-25
6 phalp2_38353
ATmM
19 35,8% 156 7.535E-23
7 phalp2_34948
4CY2e
4 32,9% 167 7.169E-17
8 phalp2_4127
1EJtW
7 33,7% 178 3.392E-13

Domains

Domains
GH24
Unannotated
Representative sequence (used for alignment): 5l2Hd (243 AA)
Member sequence: 871Uj (228 AA)
1 243 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
871Uj
Method AlphaFoldv2
Resolution 93.10
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5l2Hd) rather than this protein.
PDB ID
5l2Hd
Method AlphaFoldv2
Resolution 84.13
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50