Protein

Protein accession
40dxp [EnVhog]
Representative
19ng1
Source
EnVhog (cluster: phalp2_23790)
Protein name
40dxp
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTSKLTRTAAGVIAAGMLATAPALAADPTDAMLRAAAEISAEYDLPAGLLLAVAEVESDFDPACRTGACLGLMQIHSKYAAGFARAAGMEDFDLYDPEDSLRIAASMLADYLTRYEGDLHFALMAYNLGEWGALAKLGAGVERTGYSRKVIGRMEKWACYGLPEPPKAEVTEAQAARARAFAVGVWERVREVLFR
Physico‐chemical
properties
protein length:195 AA
molecular weight:20912,8 Da
isoelectric point:5,03
hydropathy:0,09
Representative Protein Details
Accession
19ng1
Protein name
19ng1
Sequence length
184 AA
Molecular weight
20792,80640 Da
Isoelectric point
5,00049
Sequence
MKKVVVTIVVCILVVQFCMAASPDTNLVLMASEIEDKYDLPNGLLLAIAEVESDFNAGCKTGKCWGLMQIHSTYAPEYAKLAGMDEFDLFDPEDSMNIAAAMLRDYMDRYEGDIHFTLMAYNLGEWGARSRRSNGVQDTRYSRKVVSKIEEYANLETSTLARVMEDKITIMYGLRRTIQGVIFK
Other Proteins in cluster: phalp2_23790
Total (incl. this protein): 2 Avg length: 189,5 Avg pI: 5,01

Protein ID Length (AA) pI
19ng1 184 5,00049
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_664
1m3Kq
134 37,7% 122 1.090E-17
2 phalp2_31682
3e47a
166 29,7% 138 8.639E-14
3 phalp2_31447
3o77A
181 32,2% 124 5.498E-11
4 phalp2_22719
81sOI
7 30,0% 163 3.445E-10
5 phalp2_39704
bZih
3 31,5% 133 2.149E-09
6 phalp2_25082
1gCiB
6 32,4% 117 1.806E-08
7 phalp2_39988
1NzQM
1 32,7% 116 6.072E-08
8 phalp2_14786
6WPO0
6 29,6% 125 2.754E-07
9 phalp2_199
5H6GZ
1 28,0% 146 9.201E-07
10 phalp2_20872
6MzW9
10 28,5% 112 7.535E-06

Domains

Domains
Disordered region
SLT
Representative sequence (used for alignment): 19ng1 (184 AA)
Member sequence: 40dxp (195 AA)
1 184 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
40dxp
Method AlphaFoldv2
Resolution 85.41
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (19ng1) rather than this protein.
PDB ID
19ng1
Method AlphaFoldv2
Resolution 90.69
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50