Protein
- Protein accession
- A0A9E8M2V6 [UniProt]
- Representative
- 42vZA
- Source
- UniProt (cluster: phalp2_3109)
- Protein name
- Lysin A, protease C39 domain
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MKMAAGDVVLKFDHVIIPQETYYWCGPATMQVLLSIRGIKVTEKYMADQLGTTENGTDTILYLTRELNERLGDIYRTVQVPGAGNLDQFREHVFHSIDAGYGVGGNIMVPPANYPRPQRGERAQYSGGWIYHYWSIVGKNERLDQMAIADSGFPDFYYWVTSEQALSMISGKGYTWAANAKVADDFLGSLSDADQRRVLAAAIQVSEPHRGA
- Physico‐chemical
properties -
protein length: 212 AA molecular weight: 23648,4 Da isoelectric point: 5,51 hydropathy: -0,29
Representative Protein Details
- Accession
- 42vZA
- Protein name
- 42vZA
- Sequence length
- 549 AA
- Molecular weight
- 60894,25120 Da
- Isoelectric point
- 4,38703
- Sequence
-
MTDPVSVLADAMQWSVTYERYAELYPALSQSLRESECTTYDRIAMYCAQTGHESAGLYYMEEQDWNGDNYAYLEGRCDDLGNCSPGDGAKYHGRGPIQVTGKYNYTECSQWAFDEGLAPSPTFFVDIPEELAQDDNGFHGVTWYWTTQRPMNDAADAHDIYLATYYINGGYNGIEDREQRYNHCMSMGDLLLVLLQPKDERPLVGERVLSYNPDIIPQETGYWCGPASAQMCLDMRGIYESEQTLANEMGTDDGGTDYVALIEQSLDPRLPEANYSSHDAPHDPPSAPEKEALWDALKRSIDNGYGVVMNWVAPPANYPIGIKGSPNPSYGGGTVFHYVSAAGYDDNPSQRAVWIVDSGFQPWHYWISFDQCCTLIPPKAFCYANLPHTGGGEPMSDETVWVYEQLCGPINPATGYGSGWPQLGQNEHGQNLYYVDALGRTLAMLEGEPFTDIQAQPTAAPRNPLDYSALSLDQLAGPVTADGERHGWPQLGNRSVTDTQAHINSLLSEDSGEPPLQKAHQEQSSLVGNRLSSRLRERGQRQEPDSPAT
Other Proteins in cluster: phalp2_3109
| Total (incl. this protein): 6 | Avg length: 307,5 | Avg pI: 5,09 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 42vZA | 549 | 4,38703 |
| 6IggY | 454 | 4,34332 |
| A0A386KFM4 | 210 | 5,51591 |
| A0A2D1GCZ8 | 210 | 5,49988 |
| A0A0K0N611 | 210 | 5,30805 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_28047
725I0
|
68 | 47,1% | 443 | 5.708E-126 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Gordonia phage Dalilpop [NCBI] |
2998886 | Zierdtviridae > Gruunavirus > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
OP867020
[NCBI]
CDS location
range 38988 -> 39626
strand +
strand +
CDS
GTGAAGATGGCCGCTGGCGACGTCGTACTGAAGTTCGACCACGTGATCATTCCGCAGGAGACGTACTACTGGTGCGGGCCTGCGACGATGCAGGTGTTGCTGTCGATCCGCGGCATCAAGGTCACCGAGAAGTACATGGCCGACCAGCTCGGGACCACCGAGAACGGCACCGATACCATCCTCTACCTCACCCGCGAGCTGAACGAGCGCCTCGGAGACATCTACCGGACCGTGCAGGTCCCGGGAGCCGGGAACCTCGACCAGTTCCGCGAGCACGTCTTCCATTCGATTGACGCGGGGTACGGCGTCGGCGGCAACATCATGGTGCCCCCGGCCAACTACCCGCGGCCACAGCGCGGAGAGCGCGCGCAGTACAGCGGCGGGTGGATCTACCACTACTGGTCGATCGTCGGGAAGAACGAACGCCTCGACCAGATGGCCATCGCCGACAGCGGGTTCCCCGACTTCTACTACTGGGTCACCAGCGAACAGGCGCTCTCGATGATCTCCGGTAAGGGGTACACCTGGGCCGCAAACGCCAAGGTCGCCGACGACTTCCTCGGGTCCCTGTCCGACGCCGACCAGCGTCGCGTGCTCGCCGCCGCGATCCAGGTCTCCGAGCCCCACCGAGGAGCATGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0001897 | symbiont-mediated cytolysis of host cell | biological process | None (UniProt) |
| GO:0006508 | proteolysis | biological process | None (UniProt) |
| GO:0008233 | peptidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(42vZA)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50