Protein

Protein accession
5BY17 [EnVhog]
Representative
fs6P
Source
EnVhog (cluster: phalp2_21041)
Protein name
5BY17
Lysin probability
99%
PhaLP type
endolysin
Probability: 83% (predicted by ML model)
Protein sequence
MSSCAPEPITAPASAVLTFERPRVATTPAPPRFTGMGLADFISRYTGRAVILPGYTTAECVAVFSHYNAEAVLGDAYSAPGAQDLWLTNTWTAYDRVPATEPAKRGDVVVWSGSFGAYLGGGYGHVAIVLSDNGATLTTLSQNPNPTGVLELSKYGVLGYLRPHHLNP
Physico‐chemical
properties
protein length:168 AA
molecular weight:17742,8 Da
isoelectric point:6,03
hydropathy:0,02
Representative Protein Details
Accession
fs6P
Protein name
fs6P
Sequence length
163 AA
Molecular weight
16722,58930 Da
Isoelectric point
9,19740
Sequence
TTAAPRAGGAAIAGAPGVAGAFDRFAAKYTGRVVDFDGAFGGQCVDLTMLYASEVFGVRVNGNGNQWFANGARSGAFTQVSKDATPQKGDIACWGGYYGGIYGHVAIVITDSGGSLRVLTQNPGGFHQDTLSKQGLQGYLRPTKAPTVYAGGIDAVRGPMKVV
Other Proteins in cluster: phalp2_21041
Total (incl. this protein): 6 Avg length: 311,2 Avg pI: 7,78

Protein ID Length (AA) pI
fs6P 163 9,19740
7d2br 215 7,89778
A0A6G8R2C3 440 6,45716
A0A514TYX8 442 8,22431
A0AA96HIB7 439 8,87344
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24452
4BMZo
66 37,3% 134 2.209E-21
2 phalp2_17703
71dZW
23 35,1% 131 1.447E-20
3 phalp2_23555
7sVyK
88 35,3% 133 1.447E-20
4 phalp2_30026
2tAM9
32 37,3% 134 1.979E-20
5 phalp2_24091
8mc01
6 34,3% 131 3.309E-19
6 phalp2_39580
6iLgH
177 37,0% 135 4.524E-19
7 phalp2_3543
4zM0o
163 36,1% 105 6.712E-17
8 phalp2_10799
3WA57
47 38,2% 141 1.253E-16
9 phalp2_25278
8b22d
4 32,0% 128 1.344E-14
10 phalp2_3981
7wit1
2 29,8% 144 4.664E-14

Domains

Domains
Representative sequence (used for alignment): fs6P (163 AA)
Member sequence: 5BY17 (168 AA)
1 163 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05257

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
5BY17
Method AlphaFoldv2
Resolution 80.03
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (fs6P) rather than this protein.
PDB ID
fs6P
Method AlphaFoldv2
Resolution 82.99
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50