Protein

Protein accession
A0A2D1GP54 [UniProt]
Representative
7tMqT
Source
UniProt (cluster: phalp2_20967)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MATKQMTAAQFLAALKAEGVPVRERKGWRTHNRNRKGKWGPANGVMIHHTASGTEGIESFVSNGNSALPGPLCHGLIEKSGTVVLVGWGRANHAGGGDPDVLAAVRAERWPVPKTNEHDGSNGSVDGNAHFIGYECVNRGNGKDPWPALQLEAIARASAAVCRWHGWTVDSVIRHLDWSDWKTDPRGVDWTKMRARIATILKGKPNATPFESWADGDGDSEPTRPTQPGKPDQPADPDRFPGSGGFGIGAKGPHVTRLGQMLVKRGGGRFYSVGPGPNWGEADRNATEAFQRAQGWTGSDADGIPGPTTWQLLVTGKGKSIPAAAPAKKPVVSLANVIAASRRDPAAPQGKASHPADVRLVEAALLKLGFLGKTYASDGAYGTTTVAAYNAFRRSIGLKGADASGDPGRQSLGTLGSRSGLFTVK
Physico‐chemical
properties
protein length:425 AA
molecular weight:44798,8 Da
isoelectric point:9,80
hydropathy:-0,49
Representative Protein Details
Accession
7tMqT
Protein name
7tMqT
Sequence length
488 AA
Molecular weight
51959,21730 Da
Isoelectric point
9,66724
Sequence
LHSLAEFLVRGQLRIVRIYGPHFPLDSFIHERPAGRLVYLEVHLSQVSDTISIAASQVGYQAERAPGERPSGHQKYSGQVPGLEWSNFQPWCAVWVAWVALKAGVASLYPRTASCSNGVQWFRDRGRFSNYPAIGAQVFYGPGGGTHTGIVYAYDANSIYTYEGNTSLSNDANGNKVMRRERRRRDAYVYGYGLPQFTEGVTTSDPALTGQAGFTYAASASGPAGNSGSTKPTSPPRTVTVRAGQTITAIAAAAGISVAALLSLNPGVKDHPDQIHPGDSITLPAVPDGGLEPQPTPEPPTTTPPPVSPPASNSGNAFPGAGQFGPGANNSSVTRLGELLVQRGAGRFYSEGPGPRWGEADRQAVKAFQMAQGWTGSSADGIPGPQTWALLVTHQGRDIPAATSTKPSAAPAFPGANRFRPGAVNQHVTKLGEQLVRKGYSRYYRQGPGPRWTEADRRAVQAFQHAQGWSGSAADGYPGPLTWSRLMR
Other Proteins in cluster: phalp2_20967
Total (incl. this protein): 4 Avg length: 427,3 Avg pI: 9,68

Protein ID Length (AA) pI
7tMqT 488 9,66724
A0A1V0E659 355 9,36089
A0AAU7GXA4 441 9,88141
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13865
7ucym
2 38,7% 341 1.068E-48
2 phalp2_20894
6Udpv
9 32,3% 328 6.201E-20
3 phalp2_18419
68khh
16 25,5% 356 1.495E-05

Domains

Domains [InterPro]
Disordered region
Unannotated
LysM
Unannotated
Unannotated
Representative sequence (used for alignment): 7tMqT (488 AA)
Member sequence: A0A2D1GP54 (425 AA)
1 488 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01476

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptomyces phage Abt2graduatex2
[NCBI]
2041353 Woodruffvirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MF975638 [NCBI]
CDS location
range 26138 -> 27415
strand +
CDS
ATGGCCACGAAGCAGATGACCGCCGCGCAGTTCCTCGCGGCGTTGAAGGCCGAGGGCGTCCCGGTCCGCGAGCGCAAGGGCTGGCGGACCCACAACCGCAACCGCAAGGGGAAGTGGGGTCCCGCCAATGGCGTGATGATCCACCACACAGCTTCGGGCACGGAAGGGATCGAGTCGTTCGTTTCGAACGGCAACAGCGCGCTGCCGGGCCCGCTGTGCCACGGGCTGATCGAGAAATCCGGCACGGTGGTGCTGGTCGGGTGGGGCCGCGCCAACCACGCGGGCGGCGGTGACCCTGACGTCCTGGCAGCTGTCCGCGCCGAGCGCTGGCCGGTACCGAAGACCAACGAGCACGACGGCTCGAACGGATCGGTGGACGGTAACGCCCACTTCATCGGGTACGAGTGCGTGAACCGTGGGAACGGGAAGGACCCGTGGCCCGCGCTTCAGCTCGAAGCCATCGCGCGCGCCTCGGCAGCGGTCTGCCGCTGGCACGGCTGGACGGTGGACAGCGTGATCCGTCACCTGGACTGGTCCGACTGGAAGACGGACCCGCGCGGCGTGGACTGGACGAAGATGCGCGCCCGGATCGCCACCATCCTGAAGGGGAAGCCGAACGCGACCCCGTTCGAGTCGTGGGCGGACGGTGACGGGGACAGCGAGCCGACCCGCCCGACGCAGCCCGGCAAGCCGGACCAGCCGGCGGACCCTGACCGCTTCCCCGGTTCGGGCGGCTTCGGCATCGGCGCGAAGGGCCCGCACGTGACGCGCCTCGGGCAGATGCTCGTGAAGCGCGGCGGTGGCCGCTTCTACAGCGTCGGCCCCGGGCCCAACTGGGGTGAGGCGGACCGGAACGCCACCGAGGCTTTCCAACGGGCGCAGGGCTGGACCGGCAGCGACGCGGACGGCATCCCCGGCCCCACCACGTGGCAGCTGCTGGTGACCGGCAAGGGCAAGTCCATCCCCGCAGCCGCGCCCGCGAAAAAGCCGGTGGTGTCGCTGGCCAACGTCATCGCGGCATCCCGCCGCGACCCTGCCGCCCCGCAGGGGAAGGCGTCCCACCCGGCGGACGTGCGGCTGGTGGAAGCAGCCCTGCTGAAGCTCGGCTTCCTGGGGAAGACGTACGCCAGCGATGGCGCGTACGGCACGACCACGGTGGCCGCGTACAACGCGTTCCGCCGCAGCATCGGGCTGAAGGGTGCGGACGCCTCGGGCGATCCGGGCCGGCAGTCCCTCGGGACTCTGGGCAGCCGCTCGGGTCTCTTCACCGTCAAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000c0c40af_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7tMqT) rather than this protein.
PDB ID
7tMqT
Method AlphaFoldv2
Resolution 77.52
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50