Protein

Protein accession
F4YCR3 [UniProt]
Representative
yB01
Source
UniProt (cluster: phalp2_26241)
Protein name
Structural protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MRLYVSALPILILSGPVSTAQVIHPPLKTNPIVPVANVIAKGEGNWDSVNRGRAGDTPGGIKSLTGKSFSSHTVGEVRSLQRSRIYAVGRYQLIPSTLSYAVQKAGVQASERFTPEVQNRLLQALLDHKRPSIGAYIRGEHNNLDLALRAMALEWSSVAWTSGRSYYGGSNRSHVTRDEAGVALRRARDLYSGSPMETSQ
Physico‐chemical
properties
protein length:200 AA
molecular weight:21753,4 Da
isoelectric point:10,22
hydropathy:-0,30
Representative Protein Details
Accession
yB01
Protein name
yB01
Sequence length
271 AA
Molecular weight
30302,80060 Da
Isoelectric point
5,87047
Sequence
MKTSSKIARLMWYSIETGESWNDQMLTNVMRIKSILCLFATVPFLVACGVSKPIKSKVPVDEHVQKVTFTEHLKPLRDLISKGEGDYDAVNRGRAGDTPQGIVDLTGKKFENHSVGEVLSLQKTSVFAVGRYQFIPKTLRFAVSESNVNTKDKFTNKVQDQLFTVLVSHKRPVIGGYLLGEHDNVEGALDDLAREWASVEFRKGTSYYQGRGGNKAHISRADAKVVLEKIRNDIEDRDEETEVQHSEVSFREEELQDSGLDQGLPSVEGGD
Other Proteins in cluster: phalp2_26241
Total (incl. this protein): 9 Avg length: 215,2 Avg pI: 8,79

Protein ID Length (AA) pI
yB01 271 5,87047
A0A482IE72 209 9,93692
A0A6G8R6P6 218 9,37488
A0A6G8R6U4 218 9,57067
A0AA95FNE2 203 6,96058
A0AA95FTU5 203 8,45343
A0AA95JNZ0 202 9,72823
A0AA96EPS4 213 9,04068
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25040
13Zu8
10 52,1% 184 8.460E-77
2 phalp2_11689
1R23y
7 39,4% 180 7.086E-32
3 phalp2_39645
6MMi3
2 32,4% 188 6.804E-20
4 phalp2_25468
34JbU
19 29,3% 191 8.937E-09
5 phalp2_5563
3V43M
4 23,6% 182 5.163E-06
6 phalp2_35822
4E3bu
2 23,7% 177 1.221E-05

Domains

Domains [InterPro]
Disordered region
Unannotated
Representative sequence (used for alignment): yB01 (271 AA)
Member sequence: F4YCR3 (200 AA)
1 271 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage S-CBS2
[NCBI]
753084 No lineage information
Host Synechococcus sp. CB0204
[NCBI]
232353 Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
GU936714 [NCBI]
CDS location
range 61200 -> 61802
strand +
CDS
ATGCGTCTTTATGTATCCGCACTGCCCATCCTCATCCTGAGCGGCCCCGTTTCAACCGCACAGGTCATCCATCCACCCCTTAAAACCAACCCAATCGTTCCAGTAGCCAACGTCATTGCCAAGGGCGAGGGCAACTGGGATTCCGTCAACCGTGGCCGAGCAGGCGACACCCCAGGCGGAATCAAATCTCTAACTGGCAAGTCCTTCTCCAGCCACACCGTCGGCGAAGTACGATCCCTCCAGCGCAGCAGGATCTATGCCGTCGGCCGCTACCAGCTCATTCCCAGCACGCTCTCCTACGCCGTCCAGAAGGCTGGCGTGCAGGCCAGTGAGCGATTCACCCCAGAGGTGCAGAACCGCCTCCTACAGGCCCTCCTGGATCACAAGCGCCCGTCGATCGGCGCTTACATCCGCGGGGAACATAACAACCTGGACCTGGCCCTCCGCGCCATGGCCCTCGAGTGGTCATCGGTCGCCTGGACCAGTGGCCGCAGCTACTACGGTGGCAGCAACCGCTCTCATGTTACGAGAGATGAAGCAGGGGTTGCACTCCGCAGAGCCAGGGATCTATACTCGGGGAGTCCAATGGAGACATCCCAATGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000207842b_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (yB01) rather than this protein.
PDB ID
yB01
Method AlphaFoldv2
Resolution 81.00
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50