Protein

Protein accession
71gvo [EnVhog]
Representative
3emtD
Source
EnVhog (cluster: phalp2_13271)
Protein name
71gvo
Lysin probability
91%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MYLEIGTMGTDVAQLQEDLNFCAYDAGPVDGDFGSQTQEAVLALQAYHGLEPDGIYGNLTDAALMGEIKTIQEALTKNGYCIAVDGAIGSETLAATTGFQKKNGLSVDGIVGNATLKALGIEIPPAANQTGTTETPSIVKPLPAGNGNGQLVCINPGHGGSDPGACGNLQEKDMNLVVSMRLGQLLKERGFSVAYTRTDDRWMALSDRPAIANESNADIFVSIHHNGSANPESSGTLVICYPGSTNGLRLAQLVLYGMCNCMGLANRGIIQRDDSDVTYSNMPAIITEGLFATSPSDCHFFNNGGAELEAQGILEGILAYFNS
Physico‐chemical
properties
protein length:323 AA
molecular weight:33877,6 Da
isoelectric point:4,39
hydropathy:-0,07
Representative Protein Details
Accession
3emtD
Protein name
3emtD
Sequence length
334 AA
Molecular weight
35340,09260 Da
Isoelectric point
5,12576
Sequence
MNYYTNIVQFNDEGDDVLFLQRMLRIVDCDPGVLDGDFGNATAAATGEFQADYGLEVDKICGPNTWHKLYDRVAEIQTALNKFGYKLVVDGNIGSSGVATIHALQNFQSTHGLDTDWICGPSTSAKLGIKATGKSAVAVQTANTAVAKSPTVKTGSLSGKKFYISQGHGGSDGGASGNGLTEKNITLDLGFKVGALLQQKGATVMLSRSGDYYKSLNSRTSEANSWGADYFVSIHENAFSDTSVNGTEIWYYAGSTVGAKMASSVCNSLVSTLGSKSRGIKTGDLWEINQTNMPAILCEGLFITNPIDAGKMDSDADLYKQAVAIVNGLVDAVS
Other Proteins in cluster: phalp2_13271
Total (incl. this protein): 11 Avg length: 318,6 Avg pI: 6,06

Protein ID Length (AA) pI
3emtD 334 5,12576
1rwff 325 4,42136
4lLcd 320 4,56999
4xH0B 323 4,64650
71mCI 325 4,56937
7BmYX 300 9,87735
7hC2B 325 4,63871
7mw9G 336 4,59546
84M6T 297 9,90378
8pCp1 297 9,87799
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21129
DfJM
1 26,7% 280 6.389E-27
2 phalp2_17729
7ikmD
1 31,8% 276 5.287E-26

Domains

Domains
Representative sequence (used for alignment): 3emtD (334 AA)
Member sequence: 71gvo (323 AA)
1 334 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF01520

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3emtD) rather than this protein.
PDB ID
3emtD
Method AlphaFoldv2
Resolution 89.16
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50