Protein

Protein accession
A0A6M2ZHL8 [UniProt]
Representative
8DvQj
Source
UniProt (cluster: phalp2_32332)
Protein name
Transglycosylase domain-containing protein
Lysin probability
96%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MSKVISSKFLPQSSTGSTSIVSVSGKRITALSTDKKKINFEKFVDDVQEKVSKKKTTGGSNLKLINHELIMIKKTLSKEKTEDKKQNDVKRKLREKTGREKRESKLESGLKLGKIGKNIASSLTSNSIFDAIKNFIGNVLLGRFVIFLLENIDKVFEFLKFVGGVTSFIYDLGTKVFGGIVGLIEGAYDINETVKGEIERVGGENAGKEYDKFTGAFNKFINLAIILAASGVPLNLGGGKDPLKRIPKGTLNQAAKSVSTSTKPFAGFSRTPGTFGAGNQRSSIAEYLKRSRAEKLIERRYGNAAAMRYSNAYRNAIDAGRSPSQAVIRAKATLQSSIRKGTIIARPAGYGLRGGTQARGLTGGIMRRGVGRAGSRLQTRILGRSARLGINRVGMRTASMYSRYMNSMLGRIPVLGALLVGVNTYFEDADGDGEPDRKLDKALFKMGGAALGGFLGSFIPIPFLGTMLGLAVGEYVGDLFYTLIRGGGVNAVGDKLKKDIERLINGVELAKNWLLNGVGNIANDNRFPTNIGLFRQEQVEGMLGFKMGAWPALLNPLNFNLFQKLDILRQAFLVDNKKSLPNNKQQTGNNRNSNQSGNRRSVTAQTLGFTPAQVRNLKAITPGAVPTLAQWKAAVKGSIFEPVAEEVYNTCINEGINPAFMWGLSGAEQSHVPSLKNNPWNYGVNSQMTFPSLAEGVKVIARAIRDPNGYYVGEGRRTLAAIIERYTPHNVQGNNTPQHIKNIINIGIETNGDPSTVLLPLNYKPGNQVSPSTSLLPSIVQPAGTIPQSTVQNNGQNGRLKQSQLTRVGQLSSSPTGGPYWYGNGAYLKHEAARWFIKAKQAAQKEGIYFTINSAYRSYEHQQALVGRYAVVAAPGTSAHGWGTALDIEVNGGWTWLKENGPKYGWHWMQIPNDDVHFEYLGPPTIAPQQAQPAAIPLNPNTGMPPNTSNWGTGGVTPAGTRPITNNPTRQSSLKASLDRFFKAKPGSNHKLIVPELGLILVKGVGWLGGNQSQIWTRKPGGKNFELGKLIFGGTDEEVKNEIYKKLEQRGLLTVPKATLSPTPPVPGQRYGKNGQPVSFNPQINNTNKTQLTASLRQTPSYNTRTLVYVQPIETIIEKENQNTDPLSFLV
Physico‐chemical
properties
protein length:1131 AA
molecular weight:123078,2 Da
isoelectric point:10,04
hydropathy:-0,38
Representative Protein Details
Accession
8DvQj
Protein name
8DvQj
Sequence length
678 AA
Molecular weight
N/A Da
Isoelectric point
9,82319
Sequence
matisskkllpsaegkkqvflvplanvvpstpklegitpvekkagskttvigekitevtrlfryglllkerdrrrkirlnerkrreqretkleekklgkkgqqeklavkipgssifdrlerflgftlfgfifnnyskylprllvlgkaikpavegflnfsetilsatvtfidegykaydavgqwvediggenakalfdkfskelnvylntailigltgmrggaftprrgpqkpkpkppgrppgtgagagfasgfaxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxggwaggalgtaitgataffsgglglilapvivgglgvggailgdilfgmiydglagvlggtrgysqggiiesrqikleesvkkkknivqpspigfkaeekyrdyynkgnqktpysllrdahgelselggffgsllagtvallngqffepvllsyvgsafgrttkaelestgrivvnmvrnrvgiattttttspqtplpqtrrrnppsnylppngtngrmrqnqltkvgtlsgsptggpywygngaylehtaakyfikakeaarkegitltinsayrsyehqqalvglypvvaapgtsahglgialdievdagwywlkdngpkygwewsqipnddvhfefmgapqlisktrsttnlnniasakvlstntsyevesnnilmrqtfiqpvvg
Other Proteins in cluster: phalp2_32332
Total (incl. this protein): 6 Avg length: 898,0 Avg pI: 9,84

Protein ID Length (AA) pI
8DvQj 678 9,82319
28XnS 918 9,66376
4vzgm 798 9,66976
5FjwK 925 10,00119
8umPm 938 9,82674
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36465
19CpA
19 25,9% 760 9.431E-31
2 phalp2_3283
2IoOV
16 25,8% 844 1.657E-30

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Synechococcus phage S-SCSM1
[NCBI]
2588487 Kyanoviridae > Zhoulongquanvirus > Zhoulongquanvirus esscess
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MK867354 [NCBI]
CDS location
range 165317 -> 168712
strand +
CDS
ATGAGCAAAGTAATTTCCTCTAAATTTTTACCACAATCTTCTACAGGATCAACAAGTATTGTTTCTGTTAGTGGTAAAAGAATTACTGCGCTTTCTACAGATAAGAAAAAAATTAATTTTGAAAAATTTGTTGATGATGTTCAAGAAAAAGTATCTAAGAAAAAAACAACTGGTGGTTCTAATTTAAAATTGATTAATCATGAATTGATTATGATTAAAAAAACTCTTAGTAAAGAGAAAACAGAAGATAAAAAACAAAATGATGTCAAAAGAAAACTGAGGGAAAAAACAGGTAGAGAAAAACGAGAAAGTAAATTAGAAAGTGGATTAAAGTTAGGAAAGATTGGGAAAAATATTGCTAGTTCATTAACATCAAATAGTATATTTGATGCGATTAAAAACTTTATAGGTAATGTTCTTTTAGGTCGATTTGTTATATTTTTACTCGAAAATATTGATAAAGTATTTGAGTTTTTAAAATTTGTTGGTGGTGTTACCTCTTTCATATATGATCTTGGCACTAAAGTATTTGGTGGTATTGTTGGACTGATTGAAGGTGCTTATGATATTAATGAAACTGTAAAGGGTGAGATTGAACGTGTTGGTGGAGAAAATGCAGGTAAAGAATATGACAAGTTTACTGGTGCATTTAATAAGTTTATAAATCTTGCAATTATTCTTGCAGCATCTGGAGTTCCTTTAAATCTTGGTGGAGGAAAAGATCCTTTAAAGAGAATACCAAAAGGAACTCTTAATCAGGCAGCAAAATCAGTATCAACAAGCACTAAACCTTTTGCTGGTTTTTCTAGAACACCAGGAACATTTGGTGCAGGAAATCAAAGAAGTTCAATAGCAGAATATCTTAAGAGATCTAGAGCAGAAAAATTAATTGAAAGGAGATATGGTAACGCAGCTGCAATGCGTTATTCAAATGCATATAGAAATGCAATTGATGCTGGTAGATCTCCATCTCAAGCAGTAATAAGAGCAAAAGCAACACTACAAAGTTCAATAAGAAAAGGTACTATTATAGCAAGACCTGCAGGATATGGTCTTCGTGGAGGAACACAGGCACGTGGACTGACTGGTGGTATTATGAGACGAGGAGTAGGACGGGCAGGTAGTCGTTTACAGACTCGTATACTAGGTCGTAGTGCAAGATTAGGTATTAATCGAGTGGGAATGCGTACAGCAAGTATGTATTCTCGTTATATGAATAGTATGCTCGGAAGAATTCCCGTTCTCGGAGCACTACTAGTTGGTGTCAATACTTATTTTGAAGATGCTGATGGTGATGGTGAACCTGATAGAAAATTGGATAAAGCATTATTTAAGATGGGTGGTGCTGCCCTTGGTGGATTCTTGGGATCTTTTATTCCAATTCCATTTTTGGGAACAATGCTTGGATTGGCTGTTGGTGAATATGTTGGTGACTTATTTTATACTTTAATAAGAGGTGGTGGAGTAAATGCTGTTGGAGATAAACTTAAAAAGGATATTGAAAGACTTATTAATGGTGTAGAACTAGCAAAAAACTGGTTGCTTAATGGAGTAGGAAATATTGCAAATGATAATAGATTTCCTACTAATATCGGTCTGTTTAGACAGGAACAAGTTGAAGGAATGCTTGGATTTAAAATGGGTGCATGGCCAGCACTATTAAATCCACTTAACTTCAATTTATTTCAAAAACTTGATATCCTCAGACAAGCATTCTTAGTCGATAATAAAAAATCTTTACCTAACAATAAGCAACAAACTGGAAATAATAGAAATTCTAATCAATCAGGAAATAGAAGAAGTGTTACTGCACAAACTCTTGGATTTACTCCAGCGCAAGTTAGGAATCTAAAAGCTATTACTCCTGGAGCAGTTCCAACATTAGCACAGTGGAAAGCAGCAGTAAAGGGAAGTATTTTTGAACCCGTAGCAGAGGAAGTATATAATACATGCATTAATGAAGGAATAAACCCAGCATTTATGTGGGGATTATCTGGTGCTGAACAGTCTCACGTACCTTCCCTCAAAAATAATCCATGGAATTATGGTGTTAATAGTCAGATGACATTTCCTTCGTTGGCAGAGGGTGTGAAAGTTATTGCAAGAGCAATTAGAGATCCTAATGGATATTATGTTGGAGAAGGACGCAGGACACTCGCAGCTATTATTGAACGATATACACCACATAATGTCCAAGGTAATAATACACCTCAGCATATTAAGAATATTATTAATATTGGAATTGAAACAAACGGAGATCCATCTACAGTATTACTTCCTTTAAATTATAAACCAGGAAATCAAGTTAGTCCAAGCACCTCATTATTACCTTCTATCGTTCAACCCGCAGGAACTATTCCACAATCAACCGTCCAAAATAATGGACAAAATGGAAGGTTGAAACAAAGTCAATTGACTAGAGTTGGTCAATTATCTAGTTCACCAACTGGTGGTCCTTATTGGTATGGGAACGGTGCTTACTTGAAACATGAGGCAGCAAGATGGTTTATAAAAGCAAAACAAGCTGCTCAGAAAGAAGGTATATACTTTACAATTAATAGTGCATATAGAAGTTATGAGCATCAGCAAGCATTAGTTGGTAGGTATGCTGTAGTTGCTGCTCCTGGAACATCTGCTCATGGTTGGGGAACAGCACTTGATATTGAAGTAAATGGTGGATGGACTTGGTTGAAGGAGAATGGACCGAAGTATGGGTGGCATTGGATGCAAATTCCAAATGATGATGTTCACTTTGAATACTTAGGACCTCCTACAATTGCTCCACAACAAGCACAACCTGCAGCAATTCCTCTTAATCCTAATACTGGTATGCCTCCAAATACTAGTAATTGGGGAACTGGAGGAGTAACTCCTGCAGGAACACGACCAATAACAAATAATCCTACAAGACAAAGCTCTTTAAAAGCATCTCTAGATAGATTTTTTAAAGCTAAGCCTGGAAGTAATCATAAACTAATTGTTCCAGAATTGGGATTAATTTTAGTTAAAGGTGTGGGTTGGTTAGGAGGAAATCAAAGTCAAATATGGACAAGAAAACCAGGAGGAAAAAACTTTGAATTAGGTAAACTTATTTTTGGTGGTACTGACGAAGAAGTTAAAAATGAGATATACAAAAAATTAGAACAAAGGGGATTATTGACTGTACCAAAGGCAACACTTAGTCCTACACCACCAGTACCTGGTCAGAGATATGGTAAAAATGGTCAACCAGTATCATTTAACCCACAAATAAATAATACAAACAAAACACAACTGACAGCATCTTTAAGACAAACACCTTCTTATAACACAAGAACCCTGGTTTATGTTCAACCAATAGAGACAATTATTGAAAAAGAAAATCAGAATACCGATCCATTAAGTTTCTTGGTATAA

Gene Ontology

Description Category Evidence (source)
GO:0006508 proteolysis biological process None (UniProt)
GO:0008233 peptidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (8DvQj) rather than this protein.
PDB ID
8DvQj
Method AlphaFoldv2
Resolution 50.97
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50