Protein
- Protein accession
- A0A077KC92 [UniProt]
- Representative
- 7mH6m
- Source
- UniProt (cluster: phalp2_10047)
- Protein name
- Putative transglycosylase
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MIIQELAYKVKIQAEEFLAGKKKVAQSVAELDQDVKKPLESVGKRFDGLTDGVTAFGNAGKNAFNQVQMGAAKFLGVALTLEGARRLFTSTTRNLVDLGNTSSFLDMGAKSLDGFNRAAAATGASEQSMTSMLMRLKNAQNWMAMPMGAPDASTIAIQQLQGMTGVDIMGEKDPGKMLLRSATALRKLNKAQAEVMWGQMGGAPDMFGLVYSGKLPAMQKEFEKRSNATDPAIKRANEVNETLEKLRQTVDNLGNDFVLAFGDDVNRLLKEFGDWVSTHKDDILGFFKDASSLAKQFADSVGGTTNALILLTAAWYKSPAGMIIGGAMTANSNIESAQEEAKRRGVGVGDILAERIKNKATEASNGESGLDWVKRKWGELWGSEPDQHAQSAKKSMLMNSVALTESGGNPNAVSSAGAAGAYQLMPGTARDLGLTPEERFDPEKSRAAASIHISRLLKHYNGNVTYALMAYNAGQKRIDDYLAGTGKPLTDETLNYPGKVLSYYQQQTAMASRPSGMPGGIDNSQQSHISIQKVEVNSNPQTIDQLTNAIAEQATRSRMNVSFSSGNQ
- Physico‐chemical
properties -
protein length: 568 AA molecular weight: 61108,5 Da isoelectric point: 8,34 hydropathy: -0,38
Representative Protein Details
- Accession
- 7mH6m
- Protein name
- 7mH6m
- Sequence length
- 441 AA
- Molecular weight
- 47860,05140 Da
- Isoelectric point
- 9,13770
- Sequence
-
MRLNQAKNQLTNPIPGSSPDYDLINFNRATGSDLFGAKNNDEMLSRLAAAFRRISLAQAENWGQRLGYSPAAVNLFRSPDFDKRHKALEQRSNVSEASIATARQLQRIMADLDQSTQNVKNSLLAAFGPGMVKELDAFSHWINAHGNDITGFFQGISAGAEKVTEALGGTDRALKDVVALYGASKIAKVAGSHSVIGKAGWAGALAYIGEPVIDKGLNALFGNYDAFQAARTAKTWGDFGHALIGETGGAHWEKGKWIDPRYQATPALTSAVARTESQGNPNAVSRAGAAGLMQLMPNTARDLGLTPAERFDPEKAYAAGQIHLSRLLRHYNGDTQLALMAYNAGQGRIDNYLAGKGQPLKQETLDYPQKVLENYHQIVQQASAPPAAIAAQTTDNRQTHSTHINTVNVMTHPQTVNQLQQSIEEQAKRHRMNTTLNNGMY
Other Proteins in cluster: phalp2_10047
| Total (incl. this protein): 2 | Avg length: 504,5 | Avg pI: 8,74 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 7mH6m | 441 | 9,13770 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_19637
4kMk0
|
16 | 36,5% | 501 | 7.274E-126 |
| 2 |
phalp2_1100
7t6JV
|
60 | 31,9% | 410 | 9.023E-59 |
| 3 |
phalp2_29671
8nRZg
|
6 | 25,5% | 568 | 3.842E-46 |
| 4 |
phalp2_24804
75WI3
|
128 | 25,0% | 528 | 1.763E-30 |
| 5 |
phalp2_7783
7lfjV
|
1 | 24,4% | 422 | 7.569E-23 |
| 6 |
phalp2_34328
3S3iK
|
17 | 25,6% | 506 | 6.870E-15 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Edwardsiella phage GF-2 [NCBI] |
1537091 | Gofduovirus > Gofduovirus GF2 |
| Host |
Edwardsiella tarda [NCBI] |
636 | Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Edwardsiella > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
AP014629
[NCBI]
CDS location
range 11856 -> 13562
strand +
strand +
CDS
ATGATCATCCAAGAGTTAGCCTATAAGGTAAAAATCCAGGCAGAAGAGTTTCTCGCTGGGAAAAAGAAAGTAGCCCAAAGCGTCGCCGAATTAGATCAGGACGTAAAGAAGCCACTGGAAAGCGTCGGTAAGCGATTCGATGGCCTCACTGATGGTGTTACCGCCTTTGGGAATGCCGGTAAAAACGCATTCAATCAAGTGCAGATGGGGGCTGCAAAGTTCCTCGGAGTGGCCTTAACACTTGAGGGTGCGCGGCGGCTGTTTACGTCTACGACACGGAACCTGGTTGATCTGGGCAACACTTCATCGTTCCTCGACATGGGGGCGAAAAGCCTGGACGGTTTCAACCGGGCGGCCGCCGCCACCGGCGCTTCGGAACAGTCGATGACATCCATGCTGATGCGCTTGAAGAACGCGCAGAACTGGATGGCTATGCCGATGGGTGCGCCTGACGCTTCTACGATAGCCATACAGCAATTGCAGGGAATGACCGGCGTTGACATCATGGGGGAAAAAGACCCCGGCAAAATGCTTCTCAGGTCAGCTACGGCGCTGCGCAAATTAAACAAAGCGCAAGCCGAGGTGATGTGGGGACAGATGGGCGGCGCTCCTGACATGTTTGGGTTGGTTTATTCCGGTAAGCTGCCCGCCATGCAAAAGGAGTTTGAAAAACGATCCAACGCTACAGACCCAGCCATCAAGCGAGCAAATGAGGTTAACGAAACCCTGGAGAAGCTGCGCCAGACAGTAGATAACCTGGGTAACGATTTCGTCTTGGCATTCGGGGATGATGTTAACAGACTATTGAAAGAGTTCGGTGATTGGGTTTCGACTCATAAGGATGACATCCTCGGATTTTTCAAAGATGCATCCTCTTTGGCTAAGCAATTTGCTGACTCGGTTGGTGGAACCACTAACGCACTAATTCTTCTAACTGCTGCGTGGTACAAATCCCCGGCAGGAATGATTATTGGTGGAGCGATGACAGCGAATTCAAATATTGAAAGCGCCCAGGAGGAGGCAAAAAGGCGAGGAGTTGGAGTTGGGGACATCCTGGCGGAGCGTATAAAAAATAAGGCCACCGAAGCATCTAATGGGGAAAGCGGCCTTGATTGGGTAAAAAGGAAGTGGGGTGAGTTGTGGGGTTCTGAGCCAGATCAGCATGCCCAATCAGCAAAAAAAAGCATGTTAATGAACTCGGTCGCATTAACTGAAAGCGGAGGGAATCCGAACGCTGTCTCTTCTGCTGGTGCTGCTGGCGCTTATCAACTCATGCCGGGAACCGCGAGAGACTTAGGGCTAACGCCTGAAGAAAGGTTCGATCCTGAAAAATCAAGGGCGGCTGCCTCCATACACATAAGCAGATTGCTCAAGCATTATAATGGAAACGTCACATACGCATTGATGGCCTATAATGCTGGACAAAAAAGGATCGATGACTATCTTGCTGGAACTGGCAAGCCATTGACTGATGAAACGCTGAATTACCCAGGGAAAGTGCTGAGTTATTATCAGCAGCAAACAGCTATGGCTTCTCGCCCATCAGGAATGCCAGGAGGAATCGATAACAGTCAGCAAAGTCATATCAGCATTCAGAAAGTCGAGGTTAACAGCAACCCGCAGACTATTGATCAACTGACCAATGCTATTGCGGAACAGGCTACGCGTAGTAGAATGAATGTATCATTCTCAAGCGGGAACCAGTAA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(7mH6m)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50