Protein

Protein accession
86e1X [EnVhog]
Representative
op4C
Source
EnVhog (cluster: phalp2_26224)
Protein name
86e1X
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKINTEFISKKCTYAGQNSPKYIVIHETDNFSKGADAGRHAQAQAAGHLSTSVHYYAGSDGVYQAAEHTDGTYSVGREYGGAHAVKDATNRNTINIEICVNEDGDYTTARENAIELVKHLIQTTGIPAERVIRHFDAKGKYCPRKMMDKPELWEDFKRQIGQASVQQEHTKPDQVSEDKEKAVWYRVGTGWKNGICQKQTGAYHKKDFAIADCQPGQKVYDESGKVIYSAGKVEATDICPAYTQKNFIKDVQSATGSKVDGMAGDETIGNTITVSATKNRKHPVVVPLQKRLNTLNYNCGAVDGIAGPKFTAAVNTYQKKVLGYKNLDGEITAKKKMWKSLLGMI
Physico‐chemical
properties
protein length:345 AA
molecular weight:38037,4 Da
isoelectric point:8,90
hydropathy:-0,63
Representative Protein Details
Accession
op4C
Protein name
op4C
Sequence length
362 AA
Molecular weight
39727,80870 Da
Isoelectric point
8,55504
Sequence
MNINKDYISTQNTYTGKNSPKYIVIHETDNFSAGADAQRHASAQAAGHLSTSVHYYSGSDGVYQAASHTDGTLSVGREYGGNHAIHDAANRNTINIEICVNPDGDYAKARSNAIELVKYLIQQTGIPAERVIRHFDAKGKYCPRNMMDNPALWEDFKKQIGQAPAEQQESTKPAQPSQAPEDKKEVWYRVGSGWKNGICLNQTGAYHNKDFAIADCKPGQNVYDKKGTVIYSRENTAGKTSGENTPAADGTVYTQKQFILDVQKATGSNPDGIAGDETLGNTVTVSRSTHKYHAVVTPLERRMKALGYYRGEVEADRGEPPCFGKGMESAVISYQKKVLKYKNTDGEITAGKKMWKSLLGMI
Other Proteins in cluster: phalp2_26224
Total (incl. this protein): 2 Avg length: 353,5 Avg pI: 8,73

Protein ID Length (AA) pI
op4C 362 8,55504
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_17898
EGF4
130 33,8% 354 1.386E-39
2 phalp2_5925
64hKp
120 30,1% 362 5.446E-38
3 phalp2_5899
5KvkK
133 37,5% 232 2.882E-36
4 phalp2_6450
8Jtqs
137 31,1% 276 4.575E-27
5 phalp2_24835
7sl6i
39 29,5% 359 6.175E-27
6 phalp2_26828
3TOBT
44 31,3% 252 1.853E-19
7 phalp2_36615
23Gpv
52 28,2% 255 2.588E-18
8 phalp2_36746
6AI9B
16 28,5% 336 1.114E-17
9 phalp2_26510
41c23
2 29,6% 233 3.614E-11
10 phalp2_17183
3bJIf
458 27,9% 268 2.478E-08

Domains

Domains
Ami2
Unannotated
Unannotated
Representative sequence (used for alignment): op4C (362 AA)
Member sequence: 86e1X (345 AA)
1 362 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (op4C) rather than this protein.
PDB ID
op4C
Method AlphaFoldv2
Resolution 85.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50