Protein

Protein accession
A0A3G8F2K6 [UniProt]
Representative
82DSg
Source
UniProt (cluster: phalp2_26536)
Protein name
Peptidase
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MGGDHEGGITMYVEDWTRYPNFTRAEFACRHTGECRMQIDFMDKLQKLRVLYGKSMRITSGYRHPSHPVEARKGSTTGEHTQGAAADIAVEGADAIRLLRLALEIGFSRIGVQQKGTGRFIHLGIGGRGLASPTIWSY
Physico‐chemical
properties
protein length:138 AA
molecular weight:15297,3 Da
isoelectric point:9,30
hydropathy:-0,39
Representative Protein Details
Accession
82DSg
Protein name
82DSg
Sequence length
70 AA
Molecular weight
8258,17650 Da
Isoelectric point
9,88521
Sequence
MNWDNYPNFSEAEFTCSHTGKCDMQASFMEKLQALRTAHGKAMTVTSGYRHETHRSRLRRTGRHTRWGWP
Other Proteins in cluster: phalp2_26536
Total (incl. this protein): 12 Avg length: 64,9 Avg pI: 7,71

Protein ID Length (AA) pI
82DSg 70 9,88521
1BHNA 54 8,81955
26h7K 44 5,11451
2dA2O 72 9,13203
2dmN8 81 9,30164
342sJ 54 9,01805
3bLlf 77 6,56987
3hyRk 49 7,86742
6C87z 33 5,47976
6OsdM 55 5,46793
80c0H 52 6,54219
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2374
7IJp2
341 40,3% 52 2.071E-18
2 phalp2_7277
3ZpbC
27 53,3% 45 8.689E-16
3 phalp2_39422
7D36k
12 35,4% 62 1.309E-12
4 phalp2_25414
2AE54
4 42,3% 52 3.402E-12
5 phalp2_38403
12owM
41 43,7% 48 4.677E-12
6 phalp2_33162
7IbbY
1 37,5% 48 2.940E-10
7 phalp2_40395
45OU4
4 40,0% 60 2.940E-10
8 phalp2_12543
17tRy
38 34,0% 50 7.647E-10
9 phalp2_34418
4soej
3 39,2% 51 5.178E-09
10 phalp2_32684
816WS
5 37,2% 51 1.730E-07

Domains

Domains [InterPro]
Unannotated
Representative sequence (used for alignment): 82DSg (70 AA)
Member sequence: A0A3G8F2K6 (138 AA)
1 70 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Microcystis phage Me-ZS1
[NCBI]
2483660 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MK069556 [NCBI]
CDS location
range 37627 -> 38043
strand -
CDS
ATGGGCGGTGATCACGAAGGGGGCATCACCATGTACGTTGAAGACTGGACAAGATACCCAAACTTCACGCGTGCAGAGTTTGCCTGCCGGCATACGGGGGAATGCCGTATGCAGATCGACTTCATGGACAAACTGCAAAAACTCCGTGTTCTCTACGGCAAGTCCATGCGCATTACAAGCGGTTACCGTCACCCGTCGCACCCTGTCGAAGCACGTAAGGGTTCGACAACCGGCGAGCACACACAGGGCGCGGCTGCGGACATTGCGGTCGAAGGTGCTGACGCCATCCGGCTGCTCCGTTTGGCACTCGAAATCGGTTTCTCCCGAATTGGAGTGCAACAGAAAGGGACGGGGCGTTTCATCCATCTCGGCATTGGTGGTCGCGGCTTGGCCTCTCCGACAATCTGGAGTTACTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (82DSg) rather than this protein.
PDB ID
82DSg
Method AlphaFoldv2
Resolution 70.82
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50