Protein

Protein accession
22ydC [EnVhog]
Representative
xF2f
Source
EnVhog (cluster: phalp2_7945)
Protein name
22ydC
Lysin probability
99%
PhaLP type
endolysin
Probability: 66% (predicted by ML model)
Protein sequence
MSNESAQIEIDEGRVMESYVDTEGHLTGGVGHLLTDAEKALYPEGTPIPKEVVDEWYRIDLEEAEADADKFYKTDDPNLKAILTNMSFNLGSTRLGKFVKLKEALIAGDYEEAGAQMLDSKWAGQVKGRATRLIERMTSLAPEPEAQPEPEPEQPTYGQYRSRAKAAYNSGDKDQAKAILREAKGMGLSPSAEDTESFEVFSRKPLEK
Physico‐chemical
properties
protein length:208 AA
molecular weight:23049,4 Da
isoelectric point:4,69
hydropathy:-0,73
Representative Protein Details
Accession
xF2f
Protein name
xF2f
Sequence length
242 AA
Molecular weight
27457,46920 Da
Isoelectric point
4,92774
Sequence
MPRLPYPYEDENLIPEMLLKNKDAGYEVPQSIAREALVPEQTLQEKEDQAYWENSELNQIKFDEGFEPKVYPDSVGIDTIGYGFNLERPEAQTELDHAGINKTVADLRSKKESLTEAEADILIRSEVPKFESSAINFVGEETWNSLPKDKQNILTNMAFNLGSTRLNKFKDFRSALQAGDYEKAKNEMHDSTWRKQVKSRATRLMDRMKAPLQETAKAPINPNFNLQNAIAQALGSVMEQGG
Other Proteins in cluster: phalp2_7945
Total (incl. this protein): 3 Avg length: 220,3 Avg pI: 4,74

Protein ID Length (AA) pI
xF2f 242 4,92774
1ahA7 211 4,59375
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_11886
2cyBF
52 37,8% 161 1.488E-27
2 phalp2_30135
3hDmC
1 41,4% 176 4.472E-26
3 phalp2_8727
8l77Y
27 41,8% 172 3.886E-25
4 phalp2_19234
42usC
219 37,0% 151 8.492E-24
5 phalp2_13430
4B1BD
32 38,2% 157 3.961E-23
6 phalp2_2924
QDRo
8927 33,1% 151 6.309E-22
7 phalp2_16834
1uDRT
24 36,0% 150 1.358E-20
8 phalp2_26128
8oWY
1 33,3% 204 1.358E-20
9 phalp2_35438
8DSMF
6 34,3% 160 1.573E-19
10 phalp2_6283
6Psj3
6 35,7% 179 2.899E-19

Domains

Domains
Disordered region
GH24
Representative sequence (used for alignment): xF2f (242 AA)
Member sequence: 22ydC (208 AA)
1 242 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
22ydC
Method AlphaFoldv2
Resolution 87.29
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (xF2f) rather than this protein.
PDB ID
xF2f
Method AlphaFoldv2
Resolution 79.56
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50