Protein
- Protein accession
- M4ZR40 [UniProt]
- Representative
- 5tFDA
- Source
- UniProt (cluster: phalp2_19846)
- Protein name
- Transglycosylase SLT domain-containing protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MNGTVREIKAQFIAVVSDFKKKINSAVDSLQEIGTQTEKSVDKANRGLDSLSKGLKSLNKTLANSGKADEFKELSSALDDVQKEFEQTGKVSKDTMKNLQKELSKSKGSLEELGKADAFDGMIREISRVEGSLDDLDKVDFRNLIGEINRSEQALSGLNSADLSGLQSEIDQTSFTEVTSGATRAAGSVGELNNESLSGLRSEADRTGASLAEVGAAGTEAGNEAEAGGIAAGLGFKGAIGAVVAFTSTVGSAVLGLVGFVKSGIDLQKTMNNVQIRTGATNAEMAKFRNNIMNVYKGGYGEGFSDIADAMVRIEQTTNLSGKALEDATKQALLFRDEFGYEVPESMKTVNVMMKQFGITSEQAFNLLAQGQQQGLDYADDMLDTFWEYSVYFKQLGYDANGMFDVLKAGADAGAFNLDKVGDAIKEFGIRVKDGSDTTNEAFNDLGLDAGEMQEMFAKGGDTAQKALQTVFDKLQKVNDKTVKNTAAVNLFGTMYEDLEENTIAAMNNVQKSGDMTADTMKKIDEIHFDNIGAAFTGLWRWFNGSFLIPLQSQAMPAINNFANNAKRTLDAITSGDNQKISDLLESWGLNDQQINHVFVALNKVKDAFAAVRALASGDEKTGVNLLQKLGLNENQINNIVNLFSTLKSYMSTAIGVFKQFASAIGGFFSGLWDLIGPYIMPALDAVVGFVQETLAKIIGFWNTDGQQIMQAAQNVFNFILSIIKFVMPVVLAIIQSVWGNIKGVISGALNIIMGLVRIFTGLFTGDFSKMWQGIKQLFAGAIQFVWNLVNLLFIGKIIGGIKSLVKTGLTFLKDFWTKIPTLFQTGVTKANKFISDMVVKILGFLKNLAINAVKAVWNMFTGIIKWFANIRTNAVKIFNAMKNTIQAIYNAIKNAVIASVKFMVSKVVGFFKSLYNTGKKIFTDTKNFFTSIWNKIKDSVVNAAKNMWNGARKKFTDMKNGISKIFTSVKDGIKKKFDDIVEFAKKLPGRIGSGIKKMASKVGDGIKSLANTMNKKLALGVNGVIGGINWVLDTLKVPKKVGRVKKWEPPQYAKGTGGHPGGPMIVGDGRKKELVQYPDGTTFLSPATDTLVPNAPAGTKVLSGKDTETLMKSIPMYKNGDGSGISDFLGFVGGKVKQGAQWVGSKVKQGADYVKDKAEELWGYVTDPKKLLKKGLEMTGFSMPEIPAIAKATPQILGKLVDGGVSYLKDKLSDFSFLGDSNAPGNVKSWIAQAVAITKSPTSWIEPLITIAMRESGGRTGSSTINKWDINWKRGTPSMGLMQTIKPTFDSFKMKGHGNIMNPIDNAIAAIRYIKHRYGTPMNTPGLRSLAAGGPYKGYENGGRVFGKQLAWLAENPGVAEWVIPEDGSANAYTHWANAGIANGFMSNDNGSPATRSGQAAPIANMDETNTLLTQAVKILREIADNDNQFVLPVEDVDRIQGKRVKAEAFKNGVV
- Physico‐chemical
properties -
protein length: 1456 AA molecular weight: 157590,8 Da isoelectric point: 9,16 hydropathy: -0,15
Representative Protein Details
- Accession
- 5tFDA
- Protein name
- 5tFDA
- Sequence length
- 1443 AA
- Molecular weight
- 153776,85380 Da
- Isoelectric point
- 6,04303
- Sequence
-
MANDDSNNLGGKVFLDTTEFKAGVTDLNRQIRVIESGFKAAAAGMDDWGKSSEGLRLKINSLNQVTDLQKQKIANLTQQYKEVVAAKGEDSKQAQNLQVRINNETASLNKNLKELSNTSNALKNLGNDSKDTGQNVDKLNKSADETGNNFKELSGNLLKVTGIAAVGTMAVNAGKSVLGFSTDSQKAMNSFQAQTGASTSQMSQFKQQIDQIYADNFGESLDDIANSMAQVKQVTGESGDTLKNTTEDALLLRDTFGFEVPESIRAANSLIKNFGITGEQAYNLIAQGAQNGANKNDDLLDSLNEYSQEFKSLGFNAQDFTNILITGAENGSFSIDKMADAVKEFEIRSKDGSQTSIQGFQALGLNANQMFETFAKGGDGAKKAFQLVIDKLVAMKDPVAQNQAGVNLFGTQFEDLGIKGIASLGSISDKANITKDSLSQLNGVKYNDASSALEGLKRQLIESVGDPIGKEVTPKINNMINSLKKVDTSNIVNGFGWIVDNAGSIAVGIGTIVSAWAGFKVGTAINTAVQGVIEFNKATKDATLAQAALNLVMNANSVGVFIAAISAVVAGFILLWNTSSGFRNFFIGMWNAIKNATSSVVSAIITFFTVTIPTTFNNFIVFISQLPAKIGSFFMQLPSIIANLFTQAFTGIVNWGANVIAWVAMAIPNIINSIGTFFNELPGKIGYALDFAIGSFVKFGADAITWITTNVPIIIENIVNFFLTLPGKLWSVFLSVLNYIGQWGSNTINWIGTNVPKIVSSIINFFTSLPSRLWNIFLNAINNIKQWGSNIMNWAKATIPGIAHSIAEFFRGLPGDLLKIGEDLIKGLWNGISNMAKWVRDKIKDFASGVVKGFKDALGIHSPSKVMRKEVGVYVSSGVAQGIKDGIPGVNKSITDVANNIIKNKGIVTDAIKGIANNTATNVQVAGTTTIRNSVLTNTNEQNSTDTIDNSKKYGKELNENLGKGITDDQQKATIPVQNLVDTIGTKMSDLAISFAKNGQDSDTNLGTGITTNSAAATTPVNTLITNITTALKTFIQACIGHGQDTDMSLGTGITSNSTAVTGAVNNVISTVGNNLDTFATGAIQYGENTDVSLGTGVTSNAGNVTGSINNLISSITNIFSTFVNSCVSIGTGIVNAIGHGIQSSENNLVGIVHELTQKVIDAFTGPDGFDIQSPSKRLFEIGSYVIQGFINGLSSQDVLTFFKSKISSMLDVAGNVSQWLVAALAITDTPMNWLPGLEKLVQAESGGNPLAVNPQSVNGEHATGLLQTLPSTFISNAVKGLNNILNPIDNAVAAINYIKRIYGSIYNTPLFKNPGSYVGYWTGTDSTKPGLGYVNEKGWEVIDFSGGQMVQSHEDSINLLNRASSAINALNSAMSRLKASTTTTVNTSTNNTNSNNNVTIGDGDLYITLKTPDGKELARQVLPWVDIFQGKNLRKKKKGVTV
Other Proteins in cluster: phalp2_19846
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_24823
7mvK1
|
8 | 27,0% | 1365 | 5.518E-133 |
| 2 |
phalp2_1271
CG1p
|
1 | 23,1% | 1467 | 1.147E-58 |
| 3 |
phalp2_15998
7RoFJ
|
265 | 20,6% | 1442 | 4.168E-57 |
| 4 |
phalp2_22967
3lUBD
|
83 | 22,8% | 1557 | 2.832E-33 |
| 5 |
phalp2_6330
7c3Ed
|
8 | 23,4% | 1158 | 3.290E-29 |
| 6 |
phalp2_3959
7qClx
|
89 | 20,6% | 1464 | 2.259E-28 |
| 7 |
phalp2_7794
7rQ6d
|
55 | 22,2% | 1420 | 2.615E-24 |
| 8 |
phalp2_22110
6cZ09
|
37 | 20,8% | 1177 | 5.970E-24 |
| 9 |
phalp2_32251
7vWBY
|
4 | 22,7% | 1178 | 1.464E-21 |
| 10 |
phalp2_18752
YoZh
|
2 | 23,2% | 1086 | 1.281E-17 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Bacillus phage PM1 [NCBI] |
547228 | Pemunavirus > Pemunavirus PM1 |
| Host |
Bacillus subtilis subsp. natto [NCBI] |
86029 | Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus > Bacillus subtilis group |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
AB711120
[NCBI]
CDS location
range 35339 -> 39709
strand +
strand +
CDS
ATGAATGGAACGGTAAGGGAGATCAAGGCGCAGTTTATCGCGGTCGTATCCGATTTTAAGAAAAAAATCAATTCAGCCGTAGACAGCTTACAAGAAATTGGAACACAAACGGAAAAATCCGTCGATAAAGCGAACCGGGGTCTTGATAGCTTATCCAAAGGGCTGAAAAGCCTGAATAAAACCCTTGCTAATTCCGGAAAAGCAGACGAATTCAAAGAATTGAGTTCTGCCCTTGATGATGTACAAAAGGAATTTGAGCAAACAGGAAAAGTCAGCAAAGATACCATGAAAAACCTTCAGAAAGAGTTGTCTAAATCAAAAGGCTCGCTAGAAGAGTTGGGGAAAGCTGACGCCTTCGATGGCATGATACGTGAAATATCGCGTGTAGAAGGTTCTCTAGACGATTTGGACAAAGTTGACTTCCGTAACCTCATCGGAGAGATAAACCGCTCAGAACAGGCGTTAAGCGGCTTAAATTCAGCCGATCTTTCAGGACTTCAAAGCGAAATCGATCAGACAAGTTTTACTGAAGTGACAAGCGGCGCGACTAGGGCCGCCGGAAGTGTCGGGGAATTGAACAACGAATCATTAAGCGGCTTACGTTCCGAAGCGGATCGGACAGGCGCAAGCTTGGCGGAGGTAGGTGCAGCCGGGACAGAAGCCGGAAACGAAGCTGAAGCGGGAGGAATAGCGGCAGGCCTTGGATTCAAAGGTGCGATAGGTGCCGTTGTCGCTTTTACTTCCACTGTAGGCAGCGCTGTCCTCGGCCTTGTCGGTTTTGTTAAAAGTGGTATAGACCTCCAGAAGACTATGAACAATGTTCAGATCAGAACAGGCGCCACGAATGCGGAAATGGCTAAATTCCGCAATAACATTATGAACGTATACAAAGGCGGATACGGAGAAGGCTTCTCTGATATCGCCGATGCCATGGTTCGTATTGAACAGACGACAAATCTTTCAGGGAAAGCACTAGAGGACGCGACGAAACAAGCGCTTTTATTCCGTGATGAATTCGGATACGAAGTACCTGAAAGCATGAAAACCGTTAATGTCATGATGAAACAGTTCGGAATTACCTCCGAACAGGCGTTCAATTTACTAGCGCAAGGGCAGCAGCAGGGTCTTGATTACGCCGACGACATGCTGGATACATTTTGGGAATACTCTGTTTATTTCAAACAATTAGGGTATGACGCTAACGGCATGTTCGATGTCCTGAAGGCGGGAGCGGACGCCGGGGCTTTTAACCTCGATAAAGTCGGTGACGCCATTAAAGAATTCGGTATCAGGGTTAAAGATGGATCAGATACTACGAACGAAGCTTTCAACGATTTAGGTCTTGACGCCGGAGAAATGCAAGAGATGTTTGCGAAGGGTGGAGACACCGCACAAAAAGCACTACAAACCGTATTCGATAAGCTGCAAAAAGTAAATGACAAAACCGTCAAAAACACGGCTGCCGTTAACCTATTCGGTACGATGTATGAAGACCTCGAAGAAAACACGATTGCAGCTATGAATAACGTGCAAAAGTCGGGCGACATGACTGCCGACACGATGAAAAAGATTGATGAAATTCACTTCGATAACATCGGAGCGGCTTTCACAGGTCTATGGCGTTGGTTCAATGGTTCTTTCCTTATCCCGCTGCAATCACAAGCGATGCCAGCTATTAATAACTTCGCCAACAATGCGAAAAGAACACTGGATGCCATCACTTCGGGTGATAATCAGAAGATATCTGATTTGCTGGAAAGTTGGGGTCTGAATGATCAGCAAATAAACCATGTGTTCGTTGCTCTAAACAAAGTCAAAGACGCTTTTGCGGCTGTACGTGCCCTCGCTTCAGGTGATGAAAAGACAGGCGTAAACCTATTGCAGAAGTTGGGACTGAACGAAAATCAGATTAATAACATAGTTAATTTGTTCAGTACGCTCAAATCCTATATGTCTACCGCTATTGGGGTATTCAAACAATTCGCTTCAGCGATAGGCGGGTTTTTCTCAGGATTATGGGATTTAATAGGCCCGTATATCATGCCAGCACTTGATGCGGTTGTCGGATTCGTGCAGGAAACCCTAGCGAAAATCATCGGATTTTGGAACACAGACGGTCAGCAGATCATGCAAGCGGCGCAAAATGTCTTTAATTTTATATTATCGATTATTAAATTTGTCATGCCCGTTGTCCTAGCCATTATCCAAAGCGTTTGGGGCAATATCAAAGGTGTTATATCCGGCGCACTTAATATCATCATGGGGCTTGTTCGCATTTTCACCGGGTTATTCACCGGAGACTTTTCGAAAATGTGGCAGGGTATCAAACAATTGTTCGCCGGAGCGATCCAATTCGTTTGGAACCTCGTAAACTTGCTCTTTATCGGTAAGATCATTGGAGGTATTAAGTCTCTAGTAAAAACAGGATTGACATTCCTTAAAGATTTTTGGACTAAGATTCCGACTTTATTCCAAACGGGAGTCACGAAAGCGAATAAATTCATTTCTGATATGGTCGTAAAAATACTCGGATTCTTGAAAAATCTCGCGATAAATGCTGTCAAGGCTGTTTGGAATATGTTCACGGGTATCATCAAATGGTTCGCCAACATTAGAACAAACGCCGTTAAAATCTTTAATGCCATGAAAAATACAATACAAGCAATATATAACGCAATTAAAAACGCGGTCATTGCATCAGTTAAATTCATGGTATCTAAAGTAGTCGGTTTTTTTAAATCCTTATATAACACTGGCAAGAAAATTTTTACAGATACAAAGAATTTCTTTACAAGTATATGGAACAAAATTAAGGATTCCGTCGTGAATGCTGCGAAAAACATGTGGAACGGTGCTAGGAAAAAATTCACCGACATGAAAAATGGTATTTCTAAAATTTTCACATCGGTAAAAGATGGAATCAAGAAAAAATTTGATGATATTGTCGAATTCGCGAAGAAATTGCCGGGACGAATTGGTTCAGGTATCAAGAAAATGGCTAGTAAGGTCGGCGACGGCATTAAGAGCCTAGCGAATACCATGAATAAGAAACTTGCGTTAGGTGTTAATGGCGTCATCGGCGGGATCAACTGGGTACTAGACACGCTCAAAGTACCTAAGAAAGTCGGCCGCGTTAAGAAGTGGGAACCGCCGCAGTATGCGAAAGGTACCGGAGGACACCCCGGCGGACCTATGATCGTCGGAGACGGACGTAAAAAAGAGCTTGTTCAATATCCTGACGGAACGACATTCCTAAGCCCGGCGACAGATACACTTGTACCGAATGCCCCAGCAGGAACAAAGGTTCTATCAGGTAAAGATACAGAAACACTTATGAAGAGCATCCCGATGTACAAAAATGGCGATGGTTCAGGCATTTCCGATTTCCTTGGATTCGTTGGCGGCAAGGTCAAACAGGGCGCGCAATGGGTCGGAAGCAAAGTGAAACAAGGCGCGGATTACGTCAAAGATAAGGCGGAGGAATTATGGGGTTATGTCACAGACCCTAAAAAGCTACTGAAGAAAGGTTTAGAAATGACCGGGTTCAGCATGCCGGAGATTCCAGCCATAGCAAAAGCAACGCCTCAAATCCTTGGGAAATTAGTCGATGGCGGGGTTTCCTACTTGAAAGACAAGTTGTCAGACTTCAGTTTCTTGGGCGACAGCAACGCGCCGGGGAACGTCAAGTCTTGGATTGCACAAGCGGTTGCCATCACAAAAAGCCCTACAAGCTGGATAGAACCACTTATCACGATCGCTATGAGAGAATCTGGCGGACGTACAGGGTCATCCACTATTAACAAATGGGATATCAACTGGAAGCGCGGTACACCTTCTATGGGTCTTATGCAGACGATCAAACCGACGTTCGACTCTTTCAAAATGAAAGGTCACGGAAATATCATGAATCCGATTGATAACGCGATTGCTGCCATTCGATACATTAAACACCGATACGGAACGCCAATGAATACGCCGGGACTTCGTTCGCTTGCAGCGGGCGGCCCTTACAAGGGTTATGAGAACGGCGGCAGAGTATTTGGGAAACAATTGGCATGGCTTGCTGAAAATCCGGGCGTCGCTGAATGGGTCATACCTGAAGACGGCAGCGCAAACGCTTATACTCATTGGGCTAACGCTGGAATCGCTAACGGATTCATGAGTAATGATAATGGATCGCCAGCGACAAGAAGCGGACAGGCCGCGCCTATCGCCAATATGGACGAAACAAATACGCTTTTAACGCAAGCGGTAAAAATATTAAGAGAAATAGCGGATAACGATAATCAATTTGTACTTCCAGTAGAGGATGTGGACAGAATACAAGGAAAACGTGTAAAAGCTGAAGCTTTCAAGAATGGGGTTGTGTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0016020 | membrane | cellular component | None (UniProt) |
| GO:0098003 | viral tail assembly | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0002c11b3a_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(5tFDA)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50