Protein

Protein accession
M4T440 [UniProt]
Representative
2s6dm
Source
UniProt (cluster: phalp2_2034)
Protein name
Uncharacterized protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSVKPIRLIDLFRYYQKLGHQTAAINELEQEILKVAPEIFNRDQAWYETWKSAVPPKSGVWLITRQQISKISGHAEHLFDDAFMTDLNRLVRATGMTSLNQRRMLIAQTCHETARYRYMTEIGDRAYFSRMYDNRSDLGNGPNDGYRYRGCGVIQLTGRHNFTRFAKWMERNGMRDDRIMEGTDYVVTKYPFLCAVCWIEENNWAAICETGDVYAATRRLNGGYNGIDDRIHYYEKAKKFITA
Physico‐chemical
properties
protein length:243 AA
molecular weight:28429,0 Da
isoelectric point:8,88
hydropathy:-0,57
Representative Protein Details
Accession
2s6dm
Protein name
2s6dm
Sequence length
63 AA
Molecular weight
7197,02400 Da
Isoelectric point
5,63754
Sequence
QGANYVAQVYPFLSAICWIEDNNWASICEGTDVYRVTRVLNGGYNGIEDRLALYKRACEHIKQ
Other Proteins in cluster: phalp2_2034
Total (incl. this protein): 4 Avg length: 117,5 Avg pI: 7,58

Protein ID Length (AA) pI
2s6dm 63 5,63754
24fMy 86 8,81471
7oWsu 78 6,97331
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_40666
5hkt3
44 31,4% 54 4.209E-06
2 phalp2_30479
4YsHC
16 32,1% 56 1.099E-05
3 phalp2_22660
27jJG
2 41,5% 53 7.509E-05
4 phalp2_22500
1e7PJ
24 30,6% 49 3.728E-04

Domains

Domains [InterPro]
Unannotated
Representative sequence (used for alignment): 2s6dm (63 AA)
Member sequence: M4T440 (243 AA)
1 63 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Cyanophage KBS-S-2A
[NCBI]
889953 No lineage information
Host Synechococcus sp. WH 7803
[NCBI]
32051 Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
HQ634187 [NCBI]
CDS location
range 6623 -> 7354
strand +
CDS
ATGAGCGTGAAACCGATCCGACTGATTGACCTATTCCGGTACTACCAGAAGCTTGGTCATCAAACTGCTGCTATCAATGAGCTAGAGCAGGAGATTTTGAAGGTAGCACCGGAGATCTTCAACCGCGATCAAGCGTGGTACGAAACTTGGAAATCGGCAGTGCCACCCAAGTCTGGCGTGTGGTTGATCACCCGTCAACAGATCAGCAAAATATCGGGTCATGCTGAGCATCTATTTGATGATGCGTTTATGACTGACCTTAACCGACTGGTACGAGCCACTGGCATGACAAGTTTAAATCAGCGTCGGATGCTTATTGCTCAGACTTGTCATGAGACTGCAAGGTATCGGTACATGACTGAAATCGGTGACAGAGCGTATTTCAGCCGAATGTACGATAATCGCAGTGATCTTGGTAATGGTCCAAATGACGGATACCGGTATCGCGGTTGCGGCGTGATTCAACTTACTGGCCGCCATAATTTCACCCGCTTTGCCAAATGGATGGAACGTAATGGGATGCGTGATGATCGGATTATGGAAGGCACCGATTATGTAGTCACCAAATACCCATTTTTGTGTGCGGTGTGCTGGATCGAAGAGAATAACTGGGCCGCGATCTGTGAAACCGGTGATGTGTACGCTGCAACACGTCGATTAAATGGTGGATATAACGGCATCGACGACAGGATTCATTACTATGAAAAGGCTAAAAAATTCATCACTGCATGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0002c18e96_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (2s6dm) rather than this protein.
PDB ID
2s6dm
Method AlphaFoldv2
Resolution 93.21
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50