Protein

Protein accession
4RAgQ [EnVhog]
Representative
2WBFl
Source
EnVhog (cluster: phalp2_24236)
Protein name
4RAgQ
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGNLQQEHNFKTDDVPGGLGIAQWIGARREQLIQRGNYLDLNMQLDYIVWELNNTETYAHTQLKAQSTVEGATIAFQNYYERCGDCRQSQRIQYAYDIFGRN
Physico‐chemical
properties
protein length:102 AA
molecular weight:11866,0 Da
isoelectric point:5,21
hydropathy:-0,74
Representative Protein Details
Accession
2WBFl
Protein name
2WBFl
Sequence length
120 AA
Molecular weight
13490,99370 Da
Isoelectric point
5,25376
Sequence
VVPKAIVADNEQAVWDFLIVRFTRNQTAGIMGNLQQEHGFQTSGDGLAQWTGGRKSNLMSMENPYSLSTQLSFMVSEMSNINLPDDVTAATRIFQNQFERCGDCNEARRIEYAYAILGRH
Other Proteins in cluster: phalp2_24236
Total (incl. this protein): 7 Avg length: 117,9 Avg pI: 5,61

Protein ID Length (AA) pI
2WBFl 120 5,25376
1lgPI 98 4,80957
2ha9F 139 5,72848
3f42A 151 5,10411
5sQHj 100 5,31566
8ioLM 115 7,85852
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_17072
2f2jb
9 51,1% 131 3.631E-52
2 phalp2_834
7WW8c
6 34,6% 124 7.610E-15
3 phalp2_33233
5o1H8
8 28,2% 117 3.340E-13
4 phalp2_3546
4AJc3
7 31,8% 135 1.066E-11
5 phalp2_37575
3TNHr
1 30,1% 136 5.141E-11
6 phalp2_22748
89IiP
3 24,1% 112 2.748E-08
7 phalp2_16088
4aMjC
1 28,4% 130 1.317E-07
8 phalp2_4146
1KaIV
1 30,2% 96 1.954E-05
9 phalp2_5720
4JYpO
2 25,2% 123 1.954E-05
10 phalp2_29265
hzvN
2 29,5% 98 6.791E-05

Domains

Domains
Representative sequence (used for alignment): 2WBFl (120 AA)
Member sequence: 4RAgQ (102 AA)
1 120 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF18013

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2WBFl) rather than this protein.
PDB ID
2WBFl
Method AlphaFoldv2
Resolution 94.96
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50