Protein

Protein accession
UKws [EnVhog]
Representative
UEQo
Source
EnVhog (cluster: phalp2_37057)
Protein name
UKws
Lysin probability
95%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MFPLTSKILIRGARAHELAGLGVAADYVADYVDLFSPFEGTLRSYSGVQGGLWLELTRPTGERLKMAHLSRYIVSSGKVKEGQLIAITGNTGTITTGPHLHLECYNNQGIRLDPEKYQWYNTDMTCQEELIEAKKEIIKLNTEIGKTIVDRDEWQQKAEYNYAEWQKCLDKPNDCTDLQAKIDRAKADLA
Physico‐chemical
properties
protein length:190 AA
molecular weight:21455,1 Da
isoelectric point:5,62
hydropathy:-0,47
Representative Protein Details
Accession
UEQo
Protein name
UEQo
Sequence length
199 AA
Molecular weight
21549,27660 Da
Isoelectric point
7,78522
Sequence
MRFPLDQRILTRGAAAHVAAGLGAACDYRADHVPFFAPCEGTVYHFGDPTNGGGYWMGFRRGDGVKLELAHLSRRDVPNGATVREGQHCGITGNTGTITSGPHLHLQIIEPNGHRIDPELLNWTTLTPTQMAIQQVVDAQKKTLKNVQDGIVAGAVDASTGKAYIIKNSKKIEKPIIEVLASLYVPYVKQETLNDIPNG
Other Proteins in cluster: phalp2_37057
Total (incl. this protein): 5 Avg length: 193,8 Avg pI: 7,02

Protein ID Length (AA) pI
UEQo 199 7,78522
1bh4l 190 6,07429
2cwXN 189 7,81784
8aqAo 201 7,82035
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_39085
2cikn
2 43,4% 122 8.039E-44
2 phalp2_31166
42gc4
1 43,7% 135 2.535E-36
3 phalp2_12119
5Fqwt
2 41,8% 122 2.535E-36
4 phalp2_32978
49AHS
6 23,3% 167 9.280E-12
5 phalp2_27718
6SAtZ
6 32,3% 139 5.767E-11
6 phalp2_26514
41roI
79 29,9% 147 1.059E-10
7 phalp2_20028
1oELM
3 28,7% 139 5.426E-09
8 phalp2_7549
5jaLN
16 26,4% 151 7.337E-09
9 phalp2_11629
1p97o
4 29,3% 133 1.341E-08
10 phalp2_33595
jjDT
1 24,4% 168 1.341E-08

Domains

Domains
PET_M23
Unannotated
Unannotated
Representative sequence (used for alignment): UEQo (199 AA)
Member sequence: UKws (190 AA)
1 199 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
UKws
Method AlphaFoldv2
Resolution 81.25
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (UEQo) rather than this protein.
PDB ID
UEQo
Method AlphaFoldv2
Resolution 84.79
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50