Protein
- Protein accession
- A0A6J5MRY3 [UniProt]
- Representative
- 4xSpq
- Source
- UniProt (cluster: phalp2_31613)
- Protein name
- Putative lysin
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MARVPVVEGNSVRTEALPSAQINTAAPAEAFGMGQARQLQQLGQGASDLGVGLFRAQQEANQSRVDDALNQLRETELDLAYNPEQGYTNLKGVQALQRPDNKPLSDEYFDKMRTQSDQIESGLSNDMQKQAFRQRATDRLTQFRGNLMNYEGQENRNYELSVASGTISTASREIAAFYNDPERIAGAVGSIQAAAVKDGRLRGLSAEQIENNSRKLVSGAHLGAIEQAIAVQDPLYAEQYLRTFKQQLDPSDLLKARAQVDELASVYIGTAKASQAFSGYTAAENPSDLDRVTAITMQSESGGKRFGADGKMLESPAGAKGEMQVMDKTNLDPGYGVKPARDDSPDERARVGREYLGSMVREYDGNLAHAWAAYNAGPGALNKALKEAKEEGNPASWLDKMPRETQAYVAKNVKAYSEGGGRESPPTFEQLQARLDADPDLATRPGALKKAREELSRKYELFQKGQKETRSNAYAQAMEHIEGGGKYDTIPREIRNAVDPSKWDDLRKYEKTVMGGGREHSDLATYQLLAGDPGRVRGMSDSDFYAMRQQLSESDFKKFSDMRGKSSTDTKTPGALDLPMVNSVVDSRLQSMGLDPKAKNGSALARVGAIRKTINDEVLLRQEAAGRKFDDAEITKTVDDLFLRTRGFKSRTWLEAFTGQEGEVRKATLFGATVGDIPKDLQKTLEADFKSQGIDEPTDQQMLEAFFMGDLRRK
- Physico‐chemical
properties -
protein length: 714 AA molecular weight: 78549,6 Da isoelectric point: 5,51 hydropathy: -0,73
Representative Protein Details
- Accession
- 4xSpq
- Protein name
- 4xSpq
- Sequence length
- 398 AA
- Molecular weight
- 43479,39010 Da
- Isoelectric point
- 5,47203
- Sequence
-
VTVPRVPTYDNFQQMPAQFRPVEMQAAMPRADPGAQAASFGQAAQRAGAVAMDMELEALKQANQLRVDDALNKALEAEMRLAYDKDAGYTNQRGISALERKSGKPLADEYDEEFGKAIESIGAGLGNDYQRQVFGQAIAKRRAQFRAGAMKHEADEFRTYTLSVREGTIATRMQQIGLNYNNPEVIDEAITSIRAATYDAAKLQGMSAEWADAQARKMASNAHKTAIAAALEKNDITYADRYLKRYGKDMEADDLLQATGLITKEMDLRVGTSAATEVMGRWAPKIVPGDMDRLTNIVMGIESAGRRYDASGKLIEGPATKYGTAKGEMQVLDGTNRDPGYGVKPAADDSPEERARVGRDYLAAMVREYDGDVSKALAAYNWGPGNLDKAVKERGPQL
Other Proteins in cluster: phalp2_31613
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_5490
3g8aN
|
6 | 66,4% | 274 | 1.203E-105 |
| 2 |
phalp2_34622
5lqUi
|
2 | 26,9% | 423 | 6.681E-36 |
| 3 |
phalp2_11362
7yTtN
|
5 | 19,8% | 419 | 3.200E-24 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
uncultured Caudovirales phage [NCBI] |
2100421 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
LR796526
[NCBI]
CDS location
range 14059 -> 16203
strand -
strand -
CDS
ATGGCCCGCGTACCCGTGGTAGAAGGTAACAGCGTCCGCACGGAGGCACTGCCCTCCGCGCAGATCAACACCGCCGCTCCGGCCGAGGCGTTCGGCATGGGGCAGGCCCGCCAGCTGCAGCAGTTGGGGCAGGGCGCCAGCGACTTGGGTGTGGGCTTGTTCCGCGCACAGCAGGAGGCGAACCAGTCCCGCGTGGACGACGCCCTCAACCAGCTGCGCGAGACGGAGCTCGACCTCGCCTACAACCCTGAGCAGGGTTACACCAACCTCAAGGGTGTGCAGGCGTTGCAGCGCCCGGACAACAAGCCGCTGTCGGACGAATACTTCGACAAGATGCGCACGCAGTCCGACCAGATCGAGTCGGGCCTGTCCAACGACATGCAGAAACAGGCGTTCCGCCAACGGGCCACCGACCGGCTGACCCAGTTCCGCGGCAACCTGATGAACTACGAGGGACAGGAGAACCGCAACTACGAGCTGTCGGTCGCCTCCGGCACGATCTCCACCGCCTCACGCGAGATCGCCGCGTTCTACAACGACCCGGAGCGTATTGCCGGTGCGGTGGGTTCCATCCAAGCGGCTGCCGTCAAGGACGGGCGCTTGCGTGGGCTGTCCGCCGAGCAGATCGAGAACAACTCCCGCAAGCTGGTGAGCGGCGCGCACCTGGGCGCCATCGAGCAAGCCATCGCCGTGCAAGACCCGCTCTACGCCGAGCAGTACCTGCGCACGTTCAAGCAGCAGCTCGACCCGTCGGACTTGCTCAAGGCCCGCGCCCAAGTGGACGAGCTGGCCTCGGTCTACATCGGCACCGCCAAGGCGAGCCAGGCATTCAGCGGCTACACCGCCGCCGAGAACCCCAGCGACCTCGACCGGGTGACGGCGATCACCATGCAGAGCGAATCGGGCGGCAAGCGCTTCGGCGCCGACGGCAAGATGCTGGAGTCCCCCGCCGGTGCGAAGGGCGAGATGCAGGTGATGGACAAGACCAACCTCGATCCGGGGTATGGCGTCAAGCCTGCGCGCGACGACTCGCCGGACGAGCGTGCCCGCGTGGGCCGTGAGTACCTGGGCTCAATGGTGCGCGAGTACGACGGCAACCTGGCCCACGCATGGGCGGCCTACAACGCAGGCCCCGGCGCATTGAACAAGGCGCTTAAGGAGGCCAAGGAGGAAGGCAACCCGGCGTCGTGGCTGGACAAGATGCCGCGCGAGACGCAAGCCTACGTCGCCAAGAACGTCAAGGCGTATAGCGAAGGGGGTGGACGGGAATCCCCACCGACCTTCGAGCAGCTGCAGGCCCGCCTCGACGCCGACCCGGATCTGGCCACACGCCCCGGCGCGCTTAAGAAAGCCCGCGAGGAGCTGAGCCGCAAGTACGAGCTGTTCCAGAAGGGCCAGAAGGAGACCCGCTCGAACGCCTACGCGCAGGCGATGGAGCACATCGAGGGCGGTGGCAAGTACGACACGATCCCCCGCGAGATCCGCAACGCGGTGGACCCGTCCAAGTGGGATGACCTTCGCAAGTACGAGAAGACGGTCATGGGTGGGGGCCGGGAGCATTCCGACCTGGCCACGTACCAGTTGCTCGCTGGTGATCCGGGTCGGGTGCGCGGCATGAGCGACTCGGACTTCTACGCCATGCGCCAGCAGCTGTCGGAATCGGACTTCAAGAAGTTCTCCGACATGCGCGGCAAGTCCTCGACCGACACCAAAACACCCGGCGCGCTGGACTTGCCGATGGTCAACAGCGTGGTGGACAGCCGCCTGCAGAGCATGGGCCTGGACCCGAAAGCCAAGAACGGATCAGCACTTGCCAGAGTCGGCGCCATCCGCAAGACGATCAACGACGAGGTGCTCCTGCGACAGGAGGCCGCGGGGCGCAAGTTCGACGACGCGGAGATCACCAAGACGGTGGACGATCTGTTCCTGCGCACGCGGGGCTTCAAGTCCCGGACGTGGCTGGAGGCGTTCACCGGGCAGGAAGGGGAGGTGCGCAAAGCCACGTTGTTCGGCGCTACTGTCGGTGACATACCCAAAGACCTCCAAAAAACGCTGGAAGCTGACTTCAAGTCCCAAGGGATTGACGAGCCAACCGATCAGCAGATGCTGGAAGCGTTCTTTATGGGAGACCTCCGCCGCAAGTAA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(4xSpq)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50