Protein
- Protein accession
- A0A2D1GCZ8 [UniProt]
- Representative
- 42vZA
- Source
- UniProt (cluster: phalp2_3109)
- Protein name
- Lysin A, protease C39 domain
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MAAGDVVLKFDHVIIPQETYYWCGPATMQVLLSIRGIKVTEKYMADQLGTTENGTDTILYLTRELNERLGDIYRTVQVPGAGNLDQFRKHVFHSIDAGYGVGGNIMVPPANYPRPQRGERAQYGGGWIYHYWSIVGKNERLDQMAIADSGFPDYFYWVTSEQALSMISGKGYTWAANAKVADDFLGALSEADQLRVLAAAIQVGEPHRGA
- Physico‐chemical
properties -
protein length: 210 AA molecular weight: 23283,0 Da isoelectric point: 5,50 hydropathy: -0,23
Representative Protein Details
- Accession
- 42vZA
- Protein name
- 42vZA
- Sequence length
- 549 AA
- Molecular weight
- 60894,25120 Da
- Isoelectric point
- 4,38703
- Sequence
-
MTDPVSVLADAMQWSVTYERYAELYPALSQSLRESECTTYDRIAMYCAQTGHESAGLYYMEEQDWNGDNYAYLEGRCDDLGNCSPGDGAKYHGRGPIQVTGKYNYTECSQWAFDEGLAPSPTFFVDIPEELAQDDNGFHGVTWYWTTQRPMNDAADAHDIYLATYYINGGYNGIEDREQRYNHCMSMGDLLLVLLQPKDERPLVGERVLSYNPDIIPQETGYWCGPASAQMCLDMRGIYESEQTLANEMGTDDGGTDYVALIEQSLDPRLPEANYSSHDAPHDPPSAPEKEALWDALKRSIDNGYGVVMNWVAPPANYPIGIKGSPNPSYGGGTVFHYVSAAGYDDNPSQRAVWIVDSGFQPWHYWISFDQCCTLIPPKAFCYANLPHTGGGEPMSDETVWVYEQLCGPINPATGYGSGWPQLGQNEHGQNLYYVDALGRTLAMLEGEPFTDIQAQPTAAPRNPLDYSALSLDQLAGPVTADGERHGWPQLGNRSVTDTQAHINSLLSEDSGEPPLQKAHQEQSSLVGNRLSSRLRERGQRQEPDSPAT
Other Proteins in cluster: phalp2_3109
| Total (incl. this protein): 6 | Avg length: 307,5 | Avg pI: 5,09 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 42vZA | 549 | 4,38703 |
| 6IggY | 454 | 4,34332 |
| A0A386KFM4 | 210 | 5,51591 |
| A0A0K0N611 | 210 | 5,30805 |
| A0A9E8M2V6 | 212 | 5,50670 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_28047
725I0
|
68 | 47,1% | 443 | 5.708E-126 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Gordonia phage Kabluna [NCBI] |
2041511 | Zierdtviridae > Kablunavirus > Kablunavirus kabluna |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MF919510
[NCBI]
CDS location
range 36789 -> 37421
strand +
strand +
CDS
ATGGCTGCTGGCGACGTCGTGCTGAAGTTCGACCATGTGATCATCCCGCAGGAGACGTATTACTGGTGCGGACCCGCGACGATGCAGGTTCTCCTGTCGATCCGGGGGATCAAGGTCACCGAGAAGTACATGGCCGACCAGCTCGGGACGACCGAGAACGGCACCGACACCATCCTCTACCTCACCCGCGAGCTGAACGAGCGCCTCGGCGACATCTATCGCACCGTGCAGGTCCCGGGAGCCGGGAACCTCGACCAGTTCCGCAAGCACGTCTTCCACTCGATCGACGCTGGGTACGGGGTCGGCGGCAACATCATGGTGCCCCCGGCCAACTACCCCCGGCCGCAGCGCGGAGAGCGCGCACAGTACGGCGGGGGATGGATCTACCACTACTGGTCCATCGTCGGGAAGAACGAGCGGCTCGACCAGATGGCGATCGCCGACAGCGGGTTCCCGGACTACTTCTACTGGGTCACCAGCGAGCAGGCGCTCTCGATGATCTCCGGCAAGGGATACACCTGGGCCGCGAACGCCAAGGTCGCCGACGACTTCCTCGGCGCACTGTCCGAAGCCGACCAGCTCCGCGTGCTCGCAGCCGCGATCCAGGTCGGCGAACCCCACCGAGGAGCATGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0001897 | symbiont-mediated cytolysis of host cell | biological process | None (UniProt) |
| GO:0006508 | proteolysis | biological process | None (UniProt) |
| GO:0008233 | peptidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000c0c1786_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(42vZA)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50