Protein

Protein accession
5DS2x [EnVhog]
Representative
4EL04
Source
EnVhog (cluster: phalp2_37813)
Protein name
5DS2x
Lysin probability
95%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGTSAQGKDFSSYQAPVTSADLDGLAFSFTKATEGPGVTDPHFTANWATIKAKGIHRGAYHELWSAGSAPVASQAAHFLATVKAAGLERGDMLAVVASDYGGVTGAEVKAWCDIVAAACPGCKVLVYSDLSTLAALTSCTGYPLWAAWPSGTAPESVAPWDRWTFWQWGSPGGVDCDAFNGTPEELTAWLGSVAAPAPAPAANWTEKLVNELPQITTGSANANAVRTVQGLCGARGHAVPVDGSFGPVTEAAVRDVQAAAKVTVDWIVGPVTWGALLGV
Physico‐chemical
properties
protein length:279 AA
molecular weight:28687,9 Da
isoelectric point:4,90
hydropathy:0,13
Representative Protein Details
Accession
4EL04
Protein name
4EL04
Sequence length
338 AA
Molecular weight
36476,81970 Da
Isoelectric point
5,63760
Sequence
MSAMGVDLSNFQTIRVGEWAGLDFGFVKATEGLGFIDSTLRARWATMRGALPCQGAYHFLHPGESGRAQAEFMWAVLAPLGVHKGDMIMCDSEITAGVIKALRGKRMHAGPNRQDVATRAPFTTTTVGFQTKAFLDRMRELAPAGVRIGVYTNLTVAQSLGNCTNYPLWIARPGSSWPSSVFPWVRGPKTNPFWQFDFTPRDQDAYDGTRRELLDWLGLTITPAPDPEPPAPSDDEDEDMPQLNTGTNAVTAIACTGGTKSFIGLLADPGQEGVNTVVLRVAVFSNSKGWSQIVDKVTIGTGKDSKQSVGFTEPDVAGVAITRISNGAADLTSVSYNW
Other Proteins in cluster: phalp2_37813
Total (incl. this protein): 15 Avg length: 310,5 Avg pI: 5,52

Protein ID Length (AA) pI
4EL04 338 5,63760
35lWw 280 4,62876
3LAGY 284 4,71402
3g4Iq 281 5,68415
4C3a7 348 8,52680
4E2wg 292 5,02789
4EIOr 331 6,44534
4EcTa 294 4,82281
4EiPL 380 8,31682
4EjhB 306 4,75011
4QSsO 301 4,69805
4eFDj 307 5,13611
5orGT 283 4,70305
5y0nz 354 4,84816
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_33074
4Ed1q
2 32,5% 258 3.318E-73
2 phalp2_38483
1lMz0
2 32,0% 234 6.392E-60
3 phalp2_16368
5DRpK
26 37,6% 242 2.682E-58
4 phalp2_39912
1jXPT
300 24,4% 241 3.257E-33
5 phalp2_20360
2RVWF
69 26,0% 219 2.031E-32
6 phalp2_17720
7d39Z
47 25,3% 229 1.365E-28
7 phalp2_4947
6Eg7U
122 27,9% 236 4.248E-26
8 phalp2_25079
1fQ0R
67 27,2% 220 4.248E-26
9 phalp2_25051
16dGB
2 25,7% 214 6.374E-25
10 phalp2_38445
1ecsY
3 27,6% 257 1.162E-24

Domains

Domains
GH25
Unannotated
Representative sequence (used for alignment): 4EL04 (338 AA)
Member sequence: 5DS2x (279 AA)
1 338 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4EL04) rather than this protein.
PDB ID
4EL04
Method AlphaFoldv2
Resolution 76.38
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50