Protein

Protein accession
3oV6f [EnVhog]
Representative
4z8rc
Source
EnVhog (cluster: phalp2_7376)
Protein name
3oV6f
Lysin probability
99%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
MAKSRQTVVNLVKSWDGKKESNGSHKSIIDLYNDFFEEICSGKFPRGIRMRYDWAWCACTWSALAAALRYESIMPMEISCYYLIEAAKKMGCWQENDAYVPSPGDGILYDWQDNGIGDNTGNPDHVGTVIEVHKESGXXXPVTWLLKKATTVMPLRRERCLLTENLSAVSSRQSTMTTQLPSLN
Physico‐chemical
properties
protein length:184 AA
molecular weight: Da
isoelectric point:6,30
hydropathy:
Representative Protein Details
Accession
4z8rc
Protein name
4z8rc
Sequence length
139 AA
Molecular weight
15712,62540 Da
Isoelectric point
5,12326
Sequence
MAKSRQAVVNLVESWDGKKESNGSHKSIIDLYNDFFEKICAGKFPRGIRMRYDWAWCACTWSALAAALRYESIMPMEISCYYLIEAAKKMGCWQENDAYVPSPGDAILYDWQDNGIGDNAGNPDHVGTVIEVHKESGYM
Other Proteins in cluster: phalp2_7376
Total (incl. this protein): 8 Avg length: 152,8 Avg pI: 6,41

Protein ID Length (AA) pI
4z8rc 139 5,12326
3fXWb 162 8,61157
3tk34 151 5,02374
4kp5j 153 6,56686
5UbjH 172 6,58351
7DGtc 152 4,89761
8tTa0 109 8,16126
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10363
1FVFV
9 40,0% 125 4.111E-37
2 phalp2_31234
8eyJQ
2 42,2% 97 3.010E-27
3 phalp2_37240
3uYls
10 44,8% 87 4.125E-27
4 phalp2_19200
23OeP
1 38,4% 138 3.748E-26
5 phalp2_34785
3qFtk
3 33,5% 128 1.354E-22
6 phalp2_19272
7YZUV
4 29,3% 133 3.155E-21
7 phalp2_36066
6qvgL
4 31,6% 101 4.322E-21
8 phalp2_528
nLg4
1 32,4% 108 7.407E-17
9 phalp2_2996
1ditH
38 34,8% 109 3.940E-14
10 phalp2_21324
1KFzg
4 27,8% 147 7.196E-11

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4z8rc (139 AA)
Member sequence: 3oV6f (184 AA)
1 139 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
3oV6f
Method AlphaFoldv2
Resolution 75.30
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4z8rc) rather than this protein.
PDB ID
4z8rc
Method AlphaFoldv2
Resolution 95.02
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50