Protein

Protein accession
M4ZR40 [UniProt]
Representative
5tFDA
Source
UniProt (cluster: phalp2_19846)
Protein name
Transglycosylase SLT domain-containing protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MNGTVREIKAQFIAVVSDFKKKINSAVDSLQEIGTQTEKSVDKANRGLDSLSKGLKSLNKTLANSGKADEFKELSSALDDVQKEFEQTGKVSKDTMKNLQKELSKSKGSLEELGKADAFDGMIREISRVEGSLDDLDKVDFRNLIGEINRSEQALSGLNSADLSGLQSEIDQTSFTEVTSGATRAAGSVGELNNESLSGLRSEADRTGASLAEVGAAGTEAGNEAEAGGIAAGLGFKGAIGAVVAFTSTVGSAVLGLVGFVKSGIDLQKTMNNVQIRTGATNAEMAKFRNNIMNVYKGGYGEGFSDIADAMVRIEQTTNLSGKALEDATKQALLFRDEFGYEVPESMKTVNVMMKQFGITSEQAFNLLAQGQQQGLDYADDMLDTFWEYSVYFKQLGYDANGMFDVLKAGADAGAFNLDKVGDAIKEFGIRVKDGSDTTNEAFNDLGLDAGEMQEMFAKGGDTAQKALQTVFDKLQKVNDKTVKNTAAVNLFGTMYEDLEENTIAAMNNVQKSGDMTADTMKKIDEIHFDNIGAAFTGLWRWFNGSFLIPLQSQAMPAINNFANNAKRTLDAITSGDNQKISDLLESWGLNDQQINHVFVALNKVKDAFAAVRALASGDEKTGVNLLQKLGLNENQINNIVNLFSTLKSYMSTAIGVFKQFASAIGGFFSGLWDLIGPYIMPALDAVVGFVQETLAKIIGFWNTDGQQIMQAAQNVFNFILSIIKFVMPVVLAIIQSVWGNIKGVISGALNIIMGLVRIFTGLFTGDFSKMWQGIKQLFAGAIQFVWNLVNLLFIGKIIGGIKSLVKTGLTFLKDFWTKIPTLFQTGVTKANKFISDMVVKILGFLKNLAINAVKAVWNMFTGIIKWFANIRTNAVKIFNAMKNTIQAIYNAIKNAVIASVKFMVSKVVGFFKSLYNTGKKIFTDTKNFFTSIWNKIKDSVVNAAKNMWNGARKKFTDMKNGISKIFTSVKDGIKKKFDDIVEFAKKLPGRIGSGIKKMASKVGDGIKSLANTMNKKLALGVNGVIGGINWVLDTLKVPKKVGRVKKWEPPQYAKGTGGHPGGPMIVGDGRKKELVQYPDGTTFLSPATDTLVPNAPAGTKVLSGKDTETLMKSIPMYKNGDGSGISDFLGFVGGKVKQGAQWVGSKVKQGADYVKDKAEELWGYVTDPKKLLKKGLEMTGFSMPEIPAIAKATPQILGKLVDGGVSYLKDKLSDFSFLGDSNAPGNVKSWIAQAVAITKSPTSWIEPLITIAMRESGGRTGSSTINKWDINWKRGTPSMGLMQTIKPTFDSFKMKGHGNIMNPIDNAIAAIRYIKHRYGTPMNTPGLRSLAAGGPYKGYENGGRVFGKQLAWLAENPGVAEWVIPEDGSANAYTHWANAGIANGFMSNDNGSPATRSGQAAPIANMDETNTLLTQAVKILREIADNDNQFVLPVEDVDRIQGKRVKAEAFKNGVV
Physico‐chemical
properties
protein length:1456 AA
molecular weight:157590,8 Da
isoelectric point:9,16
hydropathy:-0,15
Representative Protein Details
Accession
5tFDA
Protein name
5tFDA
Sequence length
1443 AA
Molecular weight
153776,85380 Da
Isoelectric point
6,04303
Sequence
MANDDSNNLGGKVFLDTTEFKAGVTDLNRQIRVIESGFKAAAAGMDDWGKSSEGLRLKINSLNQVTDLQKQKIANLTQQYKEVVAAKGEDSKQAQNLQVRINNETASLNKNLKELSNTSNALKNLGNDSKDTGQNVDKLNKSADETGNNFKELSGNLLKVTGIAAVGTMAVNAGKSVLGFSTDSQKAMNSFQAQTGASTSQMSQFKQQIDQIYADNFGESLDDIANSMAQVKQVTGESGDTLKNTTEDALLLRDTFGFEVPESIRAANSLIKNFGITGEQAYNLIAQGAQNGANKNDDLLDSLNEYSQEFKSLGFNAQDFTNILITGAENGSFSIDKMADAVKEFEIRSKDGSQTSIQGFQALGLNANQMFETFAKGGDGAKKAFQLVIDKLVAMKDPVAQNQAGVNLFGTQFEDLGIKGIASLGSISDKANITKDSLSQLNGVKYNDASSALEGLKRQLIESVGDPIGKEVTPKINNMINSLKKVDTSNIVNGFGWIVDNAGSIAVGIGTIVSAWAGFKVGTAINTAVQGVIEFNKATKDATLAQAALNLVMNANSVGVFIAAISAVVAGFILLWNTSSGFRNFFIGMWNAIKNATSSVVSAIITFFTVTIPTTFNNFIVFISQLPAKIGSFFMQLPSIIANLFTQAFTGIVNWGANVIAWVAMAIPNIINSIGTFFNELPGKIGYALDFAIGSFVKFGADAITWITTNVPIIIENIVNFFLTLPGKLWSVFLSVLNYIGQWGSNTINWIGTNVPKIVSSIINFFTSLPSRLWNIFLNAINNIKQWGSNIMNWAKATIPGIAHSIAEFFRGLPGDLLKIGEDLIKGLWNGISNMAKWVRDKIKDFASGVVKGFKDALGIHSPSKVMRKEVGVYVSSGVAQGIKDGIPGVNKSITDVANNIIKNKGIVTDAIKGIANNTATNVQVAGTTTIRNSVLTNTNEQNSTDTIDNSKKYGKELNENLGKGITDDQQKATIPVQNLVDTIGTKMSDLAISFAKNGQDSDTNLGTGITTNSAAATTPVNTLITNITTALKTFIQACIGHGQDTDMSLGTGITSNSTAVTGAVNNVISTVGNNLDTFATGAIQYGENTDVSLGTGVTSNAGNVTGSINNLISSITNIFSTFVNSCVSIGTGIVNAIGHGIQSSENNLVGIVHELTQKVIDAFTGPDGFDIQSPSKRLFEIGSYVIQGFINGLSSQDVLTFFKSKISSMLDVAGNVSQWLVAALAITDTPMNWLPGLEKLVQAESGGNPLAVNPQSVNGEHATGLLQTLPSTFISNAVKGLNNILNPIDNAVAAINYIKRIYGSIYNTPLFKNPGSYVGYWTGTDSTKPGLGYVNEKGWEVIDFSGGQMVQSHEDSINLLNRASSAINALNSAMSRLKASTTTTVNTSTNNTNSNNNVTIGDGDLYITLKTPDGKELARQVLPWVDIFQGKNLRKKKKGVTV
Other Proteins in cluster: phalp2_19846
Total (incl. this protein): 3 Avg length: 1440,7 Avg pI: 6,81

Protein ID Length (AA) pI
5tFDA 1443 6,04303
7kzHn 1423 5,23256
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24823
7mvK1
8 27,0% 1365 5.518E-133
2 phalp2_1271
CG1p
1 23,1% 1467 1.147E-58
3 phalp2_15998
7RoFJ
265 20,6% 1442 4.168E-57
4 phalp2_22967
3lUBD
83 22,8% 1557 2.832E-33
5 phalp2_6330
7c3Ed
8 23,4% 1158 3.290E-29
6 phalp2_3959
7qClx
89 20,6% 1464 2.259E-28
7 phalp2_7794
7rQ6d
55 22,2% 1420 2.615E-24
8 phalp2_22110
6cZ09
37 20,8% 1177 5.970E-24
9 phalp2_32251
7vWBY
4 22,7% 1178 1.464E-21
10 phalp2_18752
YoZh
2 23,2% 1086 1.281E-17

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage PM1
[NCBI]
547228 Pemunavirus > Pemunavirus PM1
Host Bacillus subtilis subsp. natto
[NCBI]
86029 Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus > Bacillus subtilis group

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
AB711120 [NCBI]
CDS location
range 35339 -> 39709
strand +
CDS
ATGAATGGAACGGTAAGGGAGATCAAGGCGCAGTTTATCGCGGTCGTATCCGATTTTAAGAAAAAAATCAATTCAGCCGTAGACAGCTTACAAGAAATTGGAACACAAACGGAAAAATCCGTCGATAAAGCGAACCGGGGTCTTGATAGCTTATCCAAAGGGCTGAAAAGCCTGAATAAAACCCTTGCTAATTCCGGAAAAGCAGACGAATTCAAAGAATTGAGTTCTGCCCTTGATGATGTACAAAAGGAATTTGAGCAAACAGGAAAAGTCAGCAAAGATACCATGAAAAACCTTCAGAAAGAGTTGTCTAAATCAAAAGGCTCGCTAGAAGAGTTGGGGAAAGCTGACGCCTTCGATGGCATGATACGTGAAATATCGCGTGTAGAAGGTTCTCTAGACGATTTGGACAAAGTTGACTTCCGTAACCTCATCGGAGAGATAAACCGCTCAGAACAGGCGTTAAGCGGCTTAAATTCAGCCGATCTTTCAGGACTTCAAAGCGAAATCGATCAGACAAGTTTTACTGAAGTGACAAGCGGCGCGACTAGGGCCGCCGGAAGTGTCGGGGAATTGAACAACGAATCATTAAGCGGCTTACGTTCCGAAGCGGATCGGACAGGCGCAAGCTTGGCGGAGGTAGGTGCAGCCGGGACAGAAGCCGGAAACGAAGCTGAAGCGGGAGGAATAGCGGCAGGCCTTGGATTCAAAGGTGCGATAGGTGCCGTTGTCGCTTTTACTTCCACTGTAGGCAGCGCTGTCCTCGGCCTTGTCGGTTTTGTTAAAAGTGGTATAGACCTCCAGAAGACTATGAACAATGTTCAGATCAGAACAGGCGCCACGAATGCGGAAATGGCTAAATTCCGCAATAACATTATGAACGTATACAAAGGCGGATACGGAGAAGGCTTCTCTGATATCGCCGATGCCATGGTTCGTATTGAACAGACGACAAATCTTTCAGGGAAAGCACTAGAGGACGCGACGAAACAAGCGCTTTTATTCCGTGATGAATTCGGATACGAAGTACCTGAAAGCATGAAAACCGTTAATGTCATGATGAAACAGTTCGGAATTACCTCCGAACAGGCGTTCAATTTACTAGCGCAAGGGCAGCAGCAGGGTCTTGATTACGCCGACGACATGCTGGATACATTTTGGGAATACTCTGTTTATTTCAAACAATTAGGGTATGACGCTAACGGCATGTTCGATGTCCTGAAGGCGGGAGCGGACGCCGGGGCTTTTAACCTCGATAAAGTCGGTGACGCCATTAAAGAATTCGGTATCAGGGTTAAAGATGGATCAGATACTACGAACGAAGCTTTCAACGATTTAGGTCTTGACGCCGGAGAAATGCAAGAGATGTTTGCGAAGGGTGGAGACACCGCACAAAAAGCACTACAAACCGTATTCGATAAGCTGCAAAAAGTAAATGACAAAACCGTCAAAAACACGGCTGCCGTTAACCTATTCGGTACGATGTATGAAGACCTCGAAGAAAACACGATTGCAGCTATGAATAACGTGCAAAAGTCGGGCGACATGACTGCCGACACGATGAAAAAGATTGATGAAATTCACTTCGATAACATCGGAGCGGCTTTCACAGGTCTATGGCGTTGGTTCAATGGTTCTTTCCTTATCCCGCTGCAATCACAAGCGATGCCAGCTATTAATAACTTCGCCAACAATGCGAAAAGAACACTGGATGCCATCACTTCGGGTGATAATCAGAAGATATCTGATTTGCTGGAAAGTTGGGGTCTGAATGATCAGCAAATAAACCATGTGTTCGTTGCTCTAAACAAAGTCAAAGACGCTTTTGCGGCTGTACGTGCCCTCGCTTCAGGTGATGAAAAGACAGGCGTAAACCTATTGCAGAAGTTGGGACTGAACGAAAATCAGATTAATAACATAGTTAATTTGTTCAGTACGCTCAAATCCTATATGTCTACCGCTATTGGGGTATTCAAACAATTCGCTTCAGCGATAGGCGGGTTTTTCTCAGGATTATGGGATTTAATAGGCCCGTATATCATGCCAGCACTTGATGCGGTTGTCGGATTCGTGCAGGAAACCCTAGCGAAAATCATCGGATTTTGGAACACAGACGGTCAGCAGATCATGCAAGCGGCGCAAAATGTCTTTAATTTTATATTATCGATTATTAAATTTGTCATGCCCGTTGTCCTAGCCATTATCCAAAGCGTTTGGGGCAATATCAAAGGTGTTATATCCGGCGCACTTAATATCATCATGGGGCTTGTTCGCATTTTCACCGGGTTATTCACCGGAGACTTTTCGAAAATGTGGCAGGGTATCAAACAATTGTTCGCCGGAGCGATCCAATTCGTTTGGAACCTCGTAAACTTGCTCTTTATCGGTAAGATCATTGGAGGTATTAAGTCTCTAGTAAAAACAGGATTGACATTCCTTAAAGATTTTTGGACTAAGATTCCGACTTTATTCCAAACGGGAGTCACGAAAGCGAATAAATTCATTTCTGATATGGTCGTAAAAATACTCGGATTCTTGAAAAATCTCGCGATAAATGCTGTCAAGGCTGTTTGGAATATGTTCACGGGTATCATCAAATGGTTCGCCAACATTAGAACAAACGCCGTTAAAATCTTTAATGCCATGAAAAATACAATACAAGCAATATATAACGCAATTAAAAACGCGGTCATTGCATCAGTTAAATTCATGGTATCTAAAGTAGTCGGTTTTTTTAAATCCTTATATAACACTGGCAAGAAAATTTTTACAGATACAAAGAATTTCTTTACAAGTATATGGAACAAAATTAAGGATTCCGTCGTGAATGCTGCGAAAAACATGTGGAACGGTGCTAGGAAAAAATTCACCGACATGAAAAATGGTATTTCTAAAATTTTCACATCGGTAAAAGATGGAATCAAGAAAAAATTTGATGATATTGTCGAATTCGCGAAGAAATTGCCGGGACGAATTGGTTCAGGTATCAAGAAAATGGCTAGTAAGGTCGGCGACGGCATTAAGAGCCTAGCGAATACCATGAATAAGAAACTTGCGTTAGGTGTTAATGGCGTCATCGGCGGGATCAACTGGGTACTAGACACGCTCAAAGTACCTAAGAAAGTCGGCCGCGTTAAGAAGTGGGAACCGCCGCAGTATGCGAAAGGTACCGGAGGACACCCCGGCGGACCTATGATCGTCGGAGACGGACGTAAAAAAGAGCTTGTTCAATATCCTGACGGAACGACATTCCTAAGCCCGGCGACAGATACACTTGTACCGAATGCCCCAGCAGGAACAAAGGTTCTATCAGGTAAAGATACAGAAACACTTATGAAGAGCATCCCGATGTACAAAAATGGCGATGGTTCAGGCATTTCCGATTTCCTTGGATTCGTTGGCGGCAAGGTCAAACAGGGCGCGCAATGGGTCGGAAGCAAAGTGAAACAAGGCGCGGATTACGTCAAAGATAAGGCGGAGGAATTATGGGGTTATGTCACAGACCCTAAAAAGCTACTGAAGAAAGGTTTAGAAATGACCGGGTTCAGCATGCCGGAGATTCCAGCCATAGCAAAAGCAACGCCTCAAATCCTTGGGAAATTAGTCGATGGCGGGGTTTCCTACTTGAAAGACAAGTTGTCAGACTTCAGTTTCTTGGGCGACAGCAACGCGCCGGGGAACGTCAAGTCTTGGATTGCACAAGCGGTTGCCATCACAAAAAGCCCTACAAGCTGGATAGAACCACTTATCACGATCGCTATGAGAGAATCTGGCGGACGTACAGGGTCATCCACTATTAACAAATGGGATATCAACTGGAAGCGCGGTACACCTTCTATGGGTCTTATGCAGACGATCAAACCGACGTTCGACTCTTTCAAAATGAAAGGTCACGGAAATATCATGAATCCGATTGATAACGCGATTGCTGCCATTCGATACATTAAACACCGATACGGAACGCCAATGAATACGCCGGGACTTCGTTCGCTTGCAGCGGGCGGCCCTTACAAGGGTTATGAGAACGGCGGCAGAGTATTTGGGAAACAATTGGCATGGCTTGCTGAAAATCCGGGCGTCGCTGAATGGGTCATACCTGAAGACGGCAGCGCAAACGCTTATACTCATTGGGCTAACGCTGGAATCGCTAACGGATTCATGAGTAATGATAATGGATCGCCAGCGACAAGAAGCGGACAGGCCGCGCCTATCGCCAATATGGACGAAACAAATACGCTTTTAACGCAAGCGGTAAAAATATTAAGAGAAATAGCGGATAACGATAATCAATTTGTACTTCCAGTAGAGGATGTGGACAGAATACAAGGAAAACGTGTAAAAGCTGAAGCTTTCAAGAATGGGGTTGTGTAA

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0002c11b3a_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5tFDA) rather than this protein.
PDB ID
5tFDA
Method AlphaFoldv2
Resolution 52.53
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50