Protein
- Protein accession
- M4T440 [UniProt]
- Representative
- 2s6dm
- Source
- UniProt (cluster: phalp2_2034)
- Protein name
- Uncharacterized protein
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MSVKPIRLIDLFRYYQKLGHQTAAINELEQEILKVAPEIFNRDQAWYETWKSAVPPKSGVWLITRQQISKISGHAEHLFDDAFMTDLNRLVRATGMTSLNQRRMLIAQTCHETARYRYMTEIGDRAYFSRMYDNRSDLGNGPNDGYRYRGCGVIQLTGRHNFTRFAKWMERNGMRDDRIMEGTDYVVTKYPFLCAVCWIEENNWAAICETGDVYAATRRLNGGYNGIDDRIHYYEKAKKFITA
- Physico‐chemical
properties -
protein length: 243 AA molecular weight: 28429,0 Da isoelectric point: 8,88 hydropathy: -0,57
Representative Protein Details
- Accession
- 2s6dm
- Protein name
- 2s6dm
- Sequence length
- 63 AA
- Molecular weight
- 7197,02400 Da
- Isoelectric point
- 5,63754
- Sequence
-
QGANYVAQVYPFLSAICWIEDNNWASICEGTDVYRVTRVLNGGYNGIEDRLALYKRACEHIKQ
Other Proteins in cluster: phalp2_2034
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_40666
5hkt3
|
44 | 31,4% | 54 | 4.209E-06 |
| 2 |
phalp2_30479
4YsHC
|
16 | 32,1% | 56 | 1.099E-05 |
| 3 |
phalp2_22660
27jJG
|
2 | 41,5% | 53 | 7.509E-05 |
| 4 |
phalp2_22500
1e7PJ
|
24 | 30,6% | 49 | 3.728E-04 |
Domains
Domains [InterPro]
1
63 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Cyanophage KBS-S-2A [NCBI] |
889953 | No lineage information |
| Host |
Synechococcus sp. WH 7803 [NCBI] |
32051 | Cyanobacteria > Oscillatoriophycideae > Chroococcales > Synechococcus > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
HQ634187
[NCBI]
CDS location
range 6623 -> 7354
strand +
strand +
CDS
ATGAGCGTGAAACCGATCCGACTGATTGACCTATTCCGGTACTACCAGAAGCTTGGTCATCAAACTGCTGCTATCAATGAGCTAGAGCAGGAGATTTTGAAGGTAGCACCGGAGATCTTCAACCGCGATCAAGCGTGGTACGAAACTTGGAAATCGGCAGTGCCACCCAAGTCTGGCGTGTGGTTGATCACCCGTCAACAGATCAGCAAAATATCGGGTCATGCTGAGCATCTATTTGATGATGCGTTTATGACTGACCTTAACCGACTGGTACGAGCCACTGGCATGACAAGTTTAAATCAGCGTCGGATGCTTATTGCTCAGACTTGTCATGAGACTGCAAGGTATCGGTACATGACTGAAATCGGTGACAGAGCGTATTTCAGCCGAATGTACGATAATCGCAGTGATCTTGGTAATGGTCCAAATGACGGATACCGGTATCGCGGTTGCGGCGTGATTCAACTTACTGGCCGCCATAATTTCACCCGCTTTGCCAAATGGATGGAACGTAATGGGATGCGTGATGATCGGATTATGGAAGGCACCGATTATGTAGTCACCAAATACCCATTTTTGTGTGCGGTGTGCTGGATCGAAGAGAATAACTGGGCCGCGATCTGTGAAACCGGTGATGTGTACGCTGCAACACGTCGATTAAATGGTGGATATAACGGCATCGACGACAGGATTCATTACTATGAAAAGGCTAAAAAATTCATCACTGCATGA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0002c18e96_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(2s6dm)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50