Protein

Protein accession
5dyqq [EnVhog]
Representative
2g6Rl
Source
EnVhog (cluster: phalp2_20307)
Protein name
5dyqq
Lysin probability
99%
PhaLP type
endolysin
Probability: 88% (predicted by ML model)
Protein sequence
MEKQFEYLLKEGAPKILVNALRLYGTAEIVGSKHNPVILDWAKGLGLEKTYTNDE
Physico‐chemical
properties
protein length:55 AA
molecular weight:6206,1 Da
isoelectric point:5,74
hydropathy:-0,32
Representative Protein Details
Accession
2g6Rl
Protein name
2g6Rl
Sequence length
77 AA
Molecular weight
8423,51280 Da
Isoelectric point
5,60850
Sequence
MDWLDTIGTLPRMIVEARRLVGTVERAGTANSPTILAWADETGLRASDGYNADSIPWCGLFMAVVAQRAGYTYPKHP
Other Proteins in cluster: phalp2_20307
Total (incl. this protein): 21 Avg length: 81,3 Avg pI: 8,26

Protein ID Length (AA) pI
2g6Rl 77 5,60850
1WAvX 86 9,05273
1xR2A 59 8,88550
4Mnax 81 5,76310
4Nzfp 62 6,55282
4oObJ 84 8,80865
4qkMO 113 9,01205
5eyAt 74 9,50665
5eyAv 61 8,88466
5tEvN 55 4,69737
6LYtv 69 8,57270
7BEgu 106 9,51329
7Gmqr 93 8,88769
7dam1 106 9,51329
7dd4W 72 8,66205
7dj3N 80 9,22976
7dt9B 103 9,54669
7dtIZ 88 8,97582
NKju 107 9,48390
jsz8 76 8,63955
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20416
3dJBJ
17 44,2% 70 5.642E-30
2 phalp2_15008
70e39
2 34,0% 47 1.814E-12
3 phalp2_38373
LBuT
1 19,6% 66 5.711E-06
4 phalp2_5598
4cukV
91 31,3% 67 1.006E-04

Domains

Domains
Unannotated
Representative sequence (used for alignment): 2g6Rl (77 AA)
Member sequence: 5dyqq (55 AA)
1 77 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2g6Rl) rather than this protein.
PDB ID
2g6Rl
Method AlphaFoldv2
Resolution 95.95
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50