Protein

Protein accession
5BfSV [EnVhog]
Representative
3SNy4
Source
EnVhog (cluster: phalp2_34331)
Protein name
5BfSV
Lysin probability
96%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDILGIGDASEYDTFSSTDWQKAADMGCKFGLVRATTTTFDYKLNKPFSREDIRHLANSQAMTFVGIDRWSYCWFDPRPAITAWEQADFFISAVNHAGGPGNQDFRVALDIEPSDSINYDPAGLVRARAWLDIVSQAGLKPSIYTYPSFLEKWNKPDIAWMKKYELIIAHWSINVPRCPYPWYPGGFLAWQYTASINGDLYGFHSSVPGKTAPRICLAVM
Physico‐chemical
properties
protein length:220 AA
molecular weight:24869,9 Da
isoelectric point:5,77
hydropathy:-0,22
Representative Protein Details
Accession
3SNy4
Protein name
3SNy4
Sequence length
211 AA
Molecular weight
23798,71250 Da
Isoelectric point
5,46464
Sequence
MTATGIDISSYSGNWNAQKTRDAGFEFATIRAATWDLKPLVDRNFETNYLKAEKAGLWRWVYAWTNPRAPLTPQADLFTTVCKGAEPELGYLLDLEDYKTCRGWRGVMSAYRAQWLEPVAAAVGQTPAIYINKSYADSYFVRPTKISAGDLWLMNYPLCIASWGGIAPSIPWPWLPGQWAAWQCVNDKTSGPHFGLESQECTLYISQAEMY
Other Proteins in cluster: phalp2_34331
Total (incl. this protein): 4 Avg length: 218,0 Avg pI: 6,09

Protein ID Length (AA) pI
3SNy4 211 5,46464
40p33 211 5,46464
4JwAo 230 7,64237
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14419
4AB58
4 33,1% 214 6.996E-30
2 phalp2_4651
4zcNr
1 30,5% 200 2.162E-23
3 phalp2_26214
jZqY
82 30,8% 214 9.264E-20
4 phalp2_20360
2RVWF
69 26,9% 193 1.288E-17
5 phalp2_5615
4g2xP
40 30,1% 202 1.752E-17
6 phalp2_39912
1jXPT
300 32,0% 184 3.242E-17
7 phalp2_36157
6TiaL
25 30,8% 185 5.997E-17
8 phalp2_10644
2AeOK
60 26,9% 189 1.292E-15
9 phalp2_29268
hFI3
21 26,0% 192 5.976E-15
10 phalp2_16368
5DRpK
26 30,3% 188 1.269E-13

Domains

Domains
Representative sequence (used for alignment): 3SNy4 (211 AA)
Member sequence: 5BfSV (220 AA)
1 211 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3SNy4) rather than this protein.
PDB ID
3SNy4
Method AlphaFoldv2
Resolution 80.27
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50