Protein

Protein accession
1qOUK [EnVhog]
Representative
2S9MI
Source
EnVhog (cluster: phalp2_5428)
Protein name
1qOUK
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MPDLFTAKTMILGPRFMHDVGCQDFQSAGCFGNLGLESGGYLYLQEIGHHHGHGGWGLGMWTGARRTAMLNWCAAHKFSPSSDEGNYGFLLHELQTTEAGALAALKRARTLEEATSVFMSRFERPGKPNLRGRLAYARKALTAIRMGLGSHQAGLAKAAAPTPAPPPRARRKPIADKHRGHH
Physico‐chemical
properties
protein length:182 AA
molecular weight:19811,5 Da
isoelectric point:10,02
hydropathy:-0,41
Representative Protein Details
Accession
2S9MI
Protein name
2S9MI
Sequence length
251 AA
Molecular weight
26494,86230 Da
Isoelectric point
5,17982
Sequence
MSDRSFDMPPVSAGFDARGGWLLNRLMAEPELGLTNPIHAAAIVGNLGGESGLEAINERHPIEPGSRGGWSWAQWTGSRRDQFEAYAATRGLPLTSDQAAYEFLVKELLGTEAHALQQTKKTTALDAAVYTFEVLFERPSDPEGGLPDRMAFAKKALAAAGVKVFVPALPAIPAQPIEVPVMPSPSPAPPPPQSATFHPKFTASAIGGALALIVITELNRRGITIDGNEGASITLLIGALSAWLAPNLPSE
Other Proteins in cluster: phalp2_5428
Total (incl. this protein): 7 Avg length: 213,9 Avg pI: 6,81

Protein ID Length (AA) pI
2S9MI 251 5,17982
4Feci 222 5,40308
4FjXx 228 5,99460
6DtDf 232 5,29463
7lK2z 182 10,01615
gyvI 200 5,78311
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6943
2SsmN
26 45,3% 183 3.363E-55
2 phalp2_17396
4LvTH
16 47,9% 171 1.948E-50
3 phalp2_20850
6DAdC
1 33,1% 211 1.087E-21
4 phalp2_3546
4AJc3
7 29,5% 193 2.380E-15
5 phalp2_31371
2Crlg
15 33,5% 176 1.959E-14
6 phalp2_14068
8qK0c
83 23,1% 186 5.757E-05

Domains

Domains
Representative sequence (used for alignment): 2S9MI (251 AA)
Member sequence: 1qOUK (182 AA)
1 251 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF18013

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2S9MI) rather than this protein.
PDB ID
2S9MI
Method AlphaFoldv2
Resolution 77.07
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50