Protein
- Protein accession
- M4SKY3 [UniProt]
- Representative
- 1UBAd
- Source
- UniProt (cluster: phalp2_31094)
- Protein name
- Phage tail lysozyme domain-containing protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MATTPKARLYKMITPPTVKGGITVKVGGKTVAGPMDGMTTMIKATNSIGATTNSIAIIVQKMNESFATQMQMQIQQQQELADAREEGVQRLIDQRKDEQDDINRKKDKQDDLDAENKQEGQGGTSLSYKAGAIVGAVSNAFGFFEGIARFLGNVFKTIVSYAILKWIANPENTKKVKKMIEGVANIGKFLLKVAGWIVSMGLGGLGAFMENPLSFKGIFGIVKFITALGLFFAPAKMAKLGLKAVMSLFKGGKLFKIIGSMMKNLMKVFKGIVAFCAARPRAALILGGAILATWGLKALLDKDEEAAGDELEKDDKEQKNNDKKNWLGSLFGGNKDDKKEETPEKGSEESYEQWRRNYDASDTTINGLKEGDEGFEDALKKSWLGEQQNQIEGNSARITGKEAEGLEFDDGTEGKPEMAKGGWISGPQSGYPVSLDGGKSTAFIGHGTEYVAAPKAASGGAFVVPFDTPATRRDPSLTGRRMNEAGKMGFGLPGFSMGGLLDFIAKGEGGYNSMNQGTMFGRIVGSTHDAKSKLGKNLTDMTLREVMGLQRSRKLFAAGRYQIIPTTMRWIVDKMKLPAGSTFNSSLQDKMGEGLIKHKRPYAWNYIKGKHDDERGAMLALAREWASLPHPDTGKSVYGFGNKALHSVDEVRKALNDARGTQPEKEQSGLQKLWQGIKDRFSNKDQVQADPKQTSVNSGTTIDNNAVTEEVSTGSGDGNVNVTTGDVAPIEAGSGQNPTDMKPDPPIFIDNKYEPPANDYFRTRYGMMAEANTEPVEMF
- Physico‐chemical
properties -
protein length: 779 AA molecular weight: 84614,3 Da isoelectric point: 8,83 hydropathy: -0,49
Representative Protein Details
- Accession
- 1UBAd
- Protein name
- 1UBAd
- Sequence length
- 896 AA
- Molecular weight
- 94870,03770 Da
- Isoelectric point
- 9,39564
- Sequence
-
MAQARLYKMITPPKFKGGGITVKVGDKIVTQPAAGYAKSIAATNSLGASVNSIAIMVEEMKDSFAQYTFKNMELRESMMKQREDYIKDEKKRIKDAKRAAIRQAGLVKDRASEKVQEKKTNKEEDKQTIGAAKKSVGFFEGIAGLIKQIFKSLLIYTVLDWMSKPENQGKLKKIFKALRSMFEAFVKIADFLVTFGLDGLVEFLENPLSFKGIFGLLKFITVLGAIFAPVALAKFGLAAGGAIMKLVKGGGLKKMLMGLFRGIGGMVKGLIAFVKGMGLGGMLALGAGAIVVGTAVAAVNANQDGTAVIEDPDDPNKSQADEIRESGGMTGAPISAEMLGFARGGPLPQFAAGGWIHGPQSGYPVSLDGGRSTAFIGHGTEYVATKADGGGLGKAFVVPFDTPATRGNPGLTNTRLAEASRSGFGLPMPFSKGGEIPQMFLGGMIDGAKNLLGLNKTMDPVLFGLAKKGVGMSVNMLGGNPDYWKKPESQRSIEDMAREVASKSQVLKRGGQDKVHVTKEPMGSGTSKGSKNAATGPQKIDIAKVFQNMGGADLMGSYISDMGEAANKAKGGFVKGCTWCNKKRMAAGGLLDFIASGEGGYNSMNQGTRGGRIVGSTHNASEILGKNLTDMTVGEVMSQQSSGKLFAAGRYQIIPDTMKYIVKEMNIDKEAKYDKSLQDKLGVGLIKYKRPYAWQYIQKQHNDENGALLELAREWASLPDPATGESVYGNGNKALHSVAEVKAALNGARGGAAMVSDDPNLNLASAQTPGQGSDSPAGTQADPEVAQGPKKIDPMSIFNNLGGKELMQSYIDDINNDGKGTKISDAALAKKEAQAERQNGTQVSTQQLPDQPPPPVEPPPPVVAGNSGGYKNPASDFLMPRMGWLSDISTPPKSLT
Other Proteins in cluster: phalp2_31094
| Total (incl. this protein): 39 | Avg length: 782,9 | Avg pI: 8,31 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 1UBAd | 896 | 9,39564 |
| 111oK | 778 | 9,15008 |
| 1RoKn | 707 | 8,37491 |
| 1ST55 | 779 | 8,91064 |
| 1SVZq | 811 | 9,01218 |
| 1TI3y | 690 | 9,07323 |
| 1TPQ6 | 725 | 7,09023 |
| 1UE7M | 813 | 4,80627 |
| 1UPbz | 900 | 9,23634 |
| 1UU7H | 815 | 8,82825 |
| 1UwuH | 708 | 8,90336 |
| 1VIrs | 747 | 6,26766 |
| 1Voun | 698 | 6,56322 |
| 1W0Zu | 757 | 8,73813 |
| 1WaoB | 756 | 8,77320 |
| 1Wcls | 752 | 6,65445 |
| 1WvRQ | 757 | 8,63601 |
| 1g0W8 | 771 | 6,53293 |
| 1kupv | 819 | 9,26851 |
| 22qIL | 902 | 9,08677 |
| 2N8h | 786 | 9,18863 |
| 2SWIW | 826 | 8,81033 |
| 2UhL0 | 667 | 8,98987 |
| 36boT | 754 | 5,71194 |
| 3L9EL | 779 | 8,91064 |
| 3M2KE | 994 | 9,21532 |
| 3aPhS | 829 | 9,01598 |
| 3xf6A | 726 | 8,14321 |
| 7TjBq | 779 | 8,91064 |
| 7oLZ2 | 726 | 7,09035 |
| 89gsf | 904 | 9,15665 |
| 8AQSB | 779 | 8,91064 |
| 8CBFI | 904 | 9,15665 |
| 8EJa0 | 707 | 8,37491 |
| 8LUZE | 779 | 8,82741 |
| 8u3LF | 781 | 8,88086 |
| 8uvnJ | 708 | 8,54072 |
| poOK | 745 | 6,11908 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_38449
1fLhQ
|
12 | 30,0% | 1223 | 7.083E-142 |
| 2 |
phalp2_1947
4upJo
|
19 | 26,1% | 861 | 3.617E-53 |
| 3 |
phalp2_40218
2qjSX
|
23 | 28,2% | 702 | 4.494E-45 |
| 4 |
phalp2_6778
81ulx
|
3 | 21,8% | 796 | 5.899E-23 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Cyanophage P-RSM6 [NCBI] |
929832 | Kyanoviridae > |
| Host |
Prochlorococcus marinus str. NATL2A [NCBI] |
59920 | Cyanobacteria > Prochlorales > Prochlorococcaceae > Prochlorococcus > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
HQ634193
[NCBI]
CDS location
range 178339 -> 180678
strand -
strand -
CDS
ATGGCAACAACACCAAAAGCACGATTATATAAGATGATAACTCCACCAACGGTGAAGGGTGGAATTACTGTGAAAGTTGGTGGTAAGACAGTTGCTGGTCCTATGGATGGTATGACCACCATGATAAAGGCAACTAATAGTATAGGTGCCACTACTAATAGCATTGCTATCATTGTACAGAAGATGAATGAATCTTTTGCTACACAGATGCAAATGCAGATACAGCAACAACAAGAATTAGCAGATGCAAGAGAAGAAGGAGTACAAAGATTAATAGATCAGAGAAAGGATGAGCAAGACGATATAAACAGAAAAAAAGATAAACAAGATGATTTAGATGCAGAGAATAAGCAGGAAGGTCAAGGTGGAACATCACTTTCATATAAAGCAGGTGCCATTGTCGGTGCTGTTTCTAATGCATTTGGTTTCTTCGAAGGTATTGCAAGGTTCTTAGGAAATGTCTTTAAAACAATTGTTAGTTATGCAATACTGAAATGGATTGCTAATCCTGAGAACACCAAGAAAGTCAAGAAGATGATAGAGGGTGTTGCTAATATTGGTAAGTTCTTACTTAAAGTTGCAGGTTGGATAGTCAGCATGGGACTGGGTGGACTGGGTGCTTTCATGGAAAACCCCTTGAGTTTTAAAGGTATATTTGGTATAGTAAAATTCATAACTGCATTGGGTTTATTCTTCGCACCTGCAAAGATGGCGAAGTTGGGACTGAAGGCAGTCATGTCTCTCTTTAAGGGTGGTAAACTCTTTAAGATAATTGGTAGTATGATGAAGAACCTGATGAAGGTTTTTAAAGGTATAGTAGCGTTCTGTGCTGCTAGACCTAGAGCAGCTCTCATCTTAGGAGGTGCTATACTTGCTACATGGGGTCTAAAAGCACTACTTGATAAGGATGAAGAAGCAGCAGGAGATGAATTAGAAAAGGATGACAAAGAACAAAAAAATAATGATAAAAAGAATTGGCTTGGTAGTTTATTTGGTGGGAATAAAGATGACAAAAAGGAAGAAACACCAGAGAAAGGATCAGAAGAATCTTATGAGCAATGGAGACGAAATTATGATGCTTCCGATACAACTATAAATGGTTTAAAAGAAGGTGATGAAGGTTTTGAGGATGCATTAAAGAAAAGTTGGTTGGGGGAGCAACAGAATCAAATAGAAGGCAACTCCGCAAGAATTACTGGTAAAGAAGCAGAAGGATTAGAATTTGATGATGGAACAGAAGGAAAGCCAGAAATGGCAAAAGGTGGATGGATATCAGGTCCTCAATCTGGTTATCCCGTCAGTCTTGATGGTGGCAAATCAACTGCCTTCATTGGGCATGGAACAGAGTATGTTGCTGCTCCCAAAGCAGCAAGCGGGGGTGCTTTCGTAGTACCGTTTGATACTCCCGCAACTAGAAGAGATCCAAGTCTGACTGGTAGAAGGATGAATGAAGCAGGTAAAATGGGATTTGGACTACCTGGATTTTCTATGGGTGGACTCTTAGACTTTATTGCTAAGGGTGAAGGTGGATATAACTCCATGAACCAAGGTACTATGTTTGGTCGTATTGTTGGTAGTACTCATGATGCAAAATCAAAACTTGGTAAGAACCTGACTGACATGACTCTTCGTGAAGTTATGGGACTTCAGAGAAGTCGTAAATTATTTGCTGCAGGTCGTTATCAGATCATACCTACTACTATGAGGTGGATCGTTGATAAAATGAAACTCCCTGCAGGTTCTACGTTTAATTCATCATTGCAAGATAAGATGGGTGAAGGTCTTATAAAACATAAGAGACCATATGCTTGGAACTATATTAAAGGTAAGCATGATGATGAGCGTGGTGCAATGCTTGCACTAGCTAGAGAGTGGGCTTCTTTACCACATCCTGATACAGGTAAATCTGTTTATGGTTTTGGTAACAAGGCACTTCATAGTGTTGATGAAGTTAGAAAAGCACTTAATGATGCAAGAGGTACTCAACCAGAGAAGGAACAAAGTGGACTACAGAAGTTATGGCAGGGAATAAAGGATAGATTCAGTAATAAAGATCAAGTTCAGGCTGACCCTAAACAGACATCAGTCAATAGTGGTACCACTATAGATAATAATGCTGTGACAGAAGAAGTCTCTACTGGTAGTGGAGACGGGAATGTTAACGTTACTACGGGTGATGTGGCTCCAATCGAAGCTGGTAGTGGACAAAATCCAACAGATATGAAACCTGATCCACCCATATTCATTGACAATAAGTATGAACCACCAGCTAATGATTATTTCCGTACTAGATATGGAATGATGGCAGAAGCAAACACCGAACCAGTTGAGATGTTCTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0016020 | membrane | cellular component | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0002c18054_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(1UBAd)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50