Protein
- Protein accession
- M1ID49 [UniProt]
- Representative
- 3IYDg
- Source
- UniProt (cluster: phalp2_10774)
- Protein name
- Lysozyme
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MARLETKYIVIHCSQTRPSQKIGAKDIDRWHRERGWLKIGYAKVIKRDGTVEQGRADDELQAHVKEYNHVSTSVCVVGGAVEENWKEPEDNFTGEQWESLKKVLEELVIKFPEARIVGHYELDERKTCPNFNVREYLLNEDIKGYKFADSTVTDGDIQEMKDAGEL
- Physico‐chemical
properties -
protein length: 166 AA molecular weight: 19184,3 Da isoelectric point: 5,33 hydropathy: -0,76
Representative Protein Details
- Accession
- 3IYDg
- Protein name
- 3IYDg
- Sequence length
- 133 AA
- Molecular weight
- N/A Da
- Isoelectric point
- 4,42590
- Sequence
-
MRNGLTEFIEKXXXXGEVQQGRADDAVQAHVRGYNHTSYGLCLVGGALEEDWTKPDNNFTAEQYESLYKVLGELVLKYPEARIVGHYELDEKKTCPNMNVREYLLNEDIPNYKFQDGLTDEADLAELEDDHEQ
Other Proteins in cluster: phalp2_10774
| Total (incl. this protein): 14 | Avg length: 161,0 | Avg pI: 4,88 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 3IYDg | 133 | 4,42590 |
| 1yYts | 172 | 4,85419 |
| 4R9Sk | 167 | 5,00231 |
| 8363j | 172 | 4,95826 |
| 8A9FW | 167 | 4,96457 |
| 8CwLf | 166 | 5,33095 |
| 8zx3c | 167 | 4,96798 |
| Dhwc | 96 | 4,19025 |
| A0A4Y1NU36 | 166 | 4,90835 |
| A0A7S5Y9C4 | 173 | 4,72937 |
| A0A7S5YA06 | 173 | 4,72937 |
| A0A8S0I6Y9 | 164 | 4,83662 |
| A0AAX4G2N4 | 172 | 5,07603 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_3887
6RreM
|
7260 | 45,7% | 94 | 4.010E-26 |
| 2 |
phalp2_25112
1plEg
|
298 | 47,1% | 89 | 3.757E-22 |
| 3 |
phalp2_11894
2jrH4
|
329 | 46,8% | 94 | 2.489E-21 |
| 4 |
phalp2_24952
kjdf
|
441 | 40,7% | 103 | 1.004E-15 |
| 5 |
phalp2_1572
7VT79
|
79 | 36,9% | 92 | 5.964E-14 |
| 6 |
phalp2_7367
4xnnO
|
5 | 33,0% | 100 | 1.529E-13 |
| 7 |
phalp2_25435
2NLUL
|
9 | 38,5% | 83 | 7.344E-13 |
| 8 |
phalp2_11611
1k5JO
|
2203 | 32,2% | 96 | 2.823E-10 |
| 9 |
phalp2_7876
8Bm2S
|
19 | 33,5% | 128 | 1.198E-08 |
| 10 |
phalp2_12432
kMJA
|
24 | 33,3% | 90 | 1.976E-07 |
Domains
Domains [InterPro]
1
133 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Pelagibacter phage HTVC011P [NCBI] |
1283078 | Autographiviridae > Stopavirus > Stopavirus HTVC011P |
| Host |
Candidatus Pelagibacter ubique HTCC1062 [NCBI] |
335992 | Proteobacteria > Alphaproteobacteria > Pelagibacterales > Pelagibacteraceae > Candidatus Pelagibacter > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
KC465900
[NCBI]
CDS location
range 9643 -> 10143
strand +
strand +
CDS
ATGGCAAGACTAGAAACTAAATACATTGTGATACATTGTTCACAAACTAGACCTTCACAAAAGATAGGTGCTAAAGATATAGATAGATGGCACAGAGAAAGAGGTTGGTTAAAAATAGGTTATGCCAAAGTTATTAAAAGAGATGGTACTGTTGAGCAAGGCAGGGCAGATGATGAACTACAAGCACATGTTAAAGAATATAATCATGTATCAACTTCAGTTTGTGTAGTTGGTGGTGCAGTAGAAGAAAACTGGAAAGAACCAGAAGATAATTTTACAGGAGAACAATGGGAAAGTTTAAAGAAAGTTTTAGAAGAACTAGTAATTAAATTTCCTGAAGCAAGAATTGTAGGACATTATGAACTTGATGAAAGAAAAACATGTCCTAACTTTAATGTCAGAGAATATTTATTAAACGAAGATATTAAAGGTTACAAATTCGCAGACAGCACAGTCACTGATGGCGACATACAGGAAATGAAAGATGCAGGAGAACTCTAG
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0001897 | symbiont-mediated cytolysis of host cell | biological process | None (UniProt) |
| GO:0008270 | zinc ion binding | molecular function | None (UniProt) |
| GO:0008745 | N-acetylmuramoyl-L-alanine amidase activity | molecular function | None (UniProt) |
| GO:0009253 | peptidoglycan catabolic process | biological process | None (UniProt) |
| GO:0042742 | defense response to bacterium | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0002b2a3b8_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(3IYDg)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50