Protein

Protein accession
4txPF [EnVhog]
Representative
6uecQ
Source
EnVhog (cluster: phalp2_4923)
Protein name
4txPF
Lysin probability
79%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MREIKRIILHCSATPPGMDIGADTIREWHVH
Physico‐chemical
properties
protein length:31 AA
molecular weight:3584,2 Da
isoelectric point:6,81
hydropathy:-0,25
Representative Protein Details
Accession
6uecQ
Protein name
6uecQ
Sequence length
36 AA
Molecular weight
4371,95900 Da
Isoelectric point
9,10205
Sequence
MRKINKIIVHCSATPEWQDVKTETIRDWHVNGNHWK
Other Proteins in cluster: phalp2_4923
Total (incl. this protein): 12 Avg length: 44,3 Avg pI: 8,12

Protein ID Length (AA) pI
6uecQ 36 9,10205
3yUIY 50 5,51852
4QD9n 47 9,22157
5O2R3 48 9,02024
5ZtxK 39 9,18386
6858X 59 7,82667
6eGkR 47 8,76946
6m4H5 31 7,99854
6s1oY 50 7,96199
7CrIU 51 6,81053
7VSlZ 42 9,18521
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3286
2K3pw
54 62,8% 35 2.937E-12
2 phalp2_39223
3yVOi
3 48,2% 29 4.540E-09
3 phalp2_36419
TLzK
3 42,8% 42 1.533E-07
4 phalp2_37115
1gJl9
12 38,7% 31 2.111E-07
5 phalp2_12823
7GXQq
1 46,6% 30 5.519E-07
6 phalp2_36262
3Agc
1 44,4% 36 5.519E-07
7 phalp2_5326
3xStJ
2 53,3% 30 1.363E-05
8 phalp2_4559
45WQ6
1 41,9% 31 2.591E-05
9 phalp2_18619
1vWE
2 56,5% 23 1.292E-04

Domains

Domains
Unannotated
Representative sequence (used for alignment): 6uecQ (36 AA)
Member sequence: 4txPF (31 AA)
1 36 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6uecQ) rather than this protein.
PDB ID
6uecQ
Method AlphaFoldv2
Resolution 94.84
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50