Protein

Protein accession
f5V4 [EnVhog]
Representative
3OJQS
Source
EnVhog (cluster: phalp2_18068)
Protein name
f5V4
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTTIPDAIIDAAKASQAKWGIPASISIAQWAVESGYGRHMPAGSNNPFGIKARAGEPSVGARTREVLGGKTVYITDGFRKFDSIAEAFDKHGEFLATRTPYANARTKLPDPDAFADALTGVYATDPNYGAVLKGLMRSANLYQHNDASAVPAQSQAPAHPLLKLGSRGGSVHELQEKLGIAADGIFGPATEAAVKALQSRKGLSPDGIVGPRTWEALK
Physico‐chemical
properties
protein length:218 AA
molecular weight:22909,6 Da
isoelectric point:9,07
hydropathy:-0,26
Representative Protein Details
Accession
3OJQS
Protein name
3OJQS
Sequence length
283 AA
Molecular weight
29312,08300 Da
Isoelectric point
8,96125
Sequence
MTAGTFPLDIVEAAQAGEAKWRVPASVCLAQWALESNWGAAMPPGSNNPFGIKAAAGQPCVACGTHEDVGGRMVAITAKFRAFGSIADAFDQHAQLLATSHYYEAARRVLPDADRFAEQLTGVYATDHLYDTKLRSIMKAHDLYRWDTAGEDVASAVVRAVPPPAPVARRALALGATGPCVVGLQTALTAARTPVAADGTFGPATDGAVRGFQAASGLLVDGKAGPRTWDALDRAGGKAVRPSPPPPGSPRPVPATPIAGPHLLPAAPGFWTRLVAALTRKAV
Other Proteins in cluster: phalp2_18068
Total (incl. this protein): 2 Avg length: 250,5 Avg pI: 9,02

Protein ID Length (AA) pI
3OJQS 283 8,96125
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37565
3PfJM
2 40,0% 260 2.514E-40
2 phalp2_36783
6JiwL
2 27,7% 238 7.270E-19
3 phalp2_31680
3dQYZ
3 29,7% 232 2.675E-17
4 phalp2_17668
6SS3O
22 30,0% 226 4.285E-15
5 phalp2_39880
1c4km
5 30,2% 205 8.331E-14

Domains

Domains
GLUCO
PG_1
Disordered region
Representative sequence (used for alignment): 3OJQS (283 AA)
Member sequence: f5V4 (218 AA)
1 283 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
f5V4
Method AlphaFoldv2
Resolution 88.73
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (3OJQS) rather than this protein.
PDB ID
3OJQS
Method AlphaFoldv2
Resolution 72.29
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50