Protein

Protein accession
4DNQZ [EnVhog]
Representative
804oT
Source
EnVhog (cluster: phalp2_26528)
Protein name
4DNQZ
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MPFPEKSITGEYGTMSDYRRKNKMQAHSGTDWAPAGSNKGKTLIPAIADGTIKLAKWSDILGWVVVQTAADKNGKIWYIGYCHLSCAKHGINCKGGHDEDLALKVKVGDKVKAGDVSHGLTIGNTGLASSGAHLHATLGKTVKAVFGMTADKSDLKKAIIANGGAVAPKVTKKEAKAIIKGEAPKPAEKVEEKPVAKVEEVKPLVADTSRELTVEDWKKFQEILKRDHGYAGAIDGEPGVMTYKALQRSVVANGYNGAVDGIPGANTYKGVQRRLIAKGVYEGRVDGSWGPQTISALREAINKNLY
Physico‐chemical
properties
protein length:306 AA
molecular weight:32676,1 Da
isoelectric point:9,42
hydropathy:-0,38
Representative Protein Details
Accession
804oT
Protein name
804oT
Sequence length
305 AA
Molecular weight
32381,77410 Da
Isoelectric point
9,64152
Sequence
MSWVMPFPEKTITGEYGTLSAYRRKMKMQPHSGTDWAPGGSNKGKTLIPAIADGTIKLVQWSKVLGWVVVQTAADKDGKIWYIGYCHLACKKCGINCKGNHGADIALKVKVGDKVTAGDVTHGMTIGNTGNASSGAHLHATLGKAVKDVFGPTTAKSDLKKAIIANGGGAVAPKVDKKTAKAIIKGEAPKKAEETKPAEVKPLVADNSKELTVDDWKKFQQILQRDHGYTGAIDGDPGKKTWAAIQRSVAAFGYGGAVDGIPGPNTYKGVQRRLVARADYEGRIDGTWGPQTIAALRGAINSNKY
Other Proteins in cluster: phalp2_26528
Total (incl. this protein): 5 Avg length: 305,6 Avg pI: 9,55

Protein ID Length (AA) pI
804oT 305 9,64152
48AUE 310 9,54256
4l2so 303 9,63269
7JXY0 304 9,49228
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_27055
2drIt
1 36,8% 472 7.827E-87
2 phalp2_4378
2eAOD
228 32,2% 307 2.000E-18
3 phalp2_37479
36IRq
1 28,6% 293 6.586E-13
4 phalp2_428
7zQ20
48 29,5% 213 5.152E-07
5 phalp2_9460
zSNF
10 33,4% 206 1.612E-06
6 phalp2_30741
7uklG
2 25,3% 312 1.142E-04

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 804oT (305 AA)
Member sequence: 4DNQZ (306 AA)
1 305 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4DNQZ
Method AlphaFoldv2
Resolution 84.79
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (804oT) rather than this protein.
PDB ID
804oT
Method AlphaFoldv2
Resolution 85.65
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50