Protein

Protein accession
A0AAE7WML6 [UniProt]
Representative
4dLEG
Source
UniProt (cluster: phalp2_18118)
Protein name
SAR endolysin transglycosylase
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTRRPSLARTLFWALYLYLVPAWTAGAQDVRTYVPAGAKTYAPLLVEEQRARWPAMPQPWTLAGLVEQESCISLVHSKCWNPRAELKTSREYGFGFGQITVAYQSNGAVRFDKFAELRQSHASLKGWTWENRYDPHYQLRAIVEMNLDLWRRIPPAATTDDHLSFMLSSYNGGVGGLLQDRRLCSNTRGCDQDRWFGNVELTSLKSKLPQPQYGGRSWYDINRGHVRNVMTVRRAKYKPFWGK
Physico‐chemical
properties
protein length:243 AA
molecular weight:28035,6 Da
isoelectric point:9,79
hydropathy:-0,53
Representative Protein Details
Accession
4dLEG
Protein name
4dLEG
Sequence length
314 AA
Molecular weight
35859,42290 Da
Isoelectric point
9,26631
Sequence
VVRLKRFIIVLIIAFSCLFIIGILKEVYGNTYKQGSKGIEVGQIQSRLKALGYPIGYPIEVDNSFGAKTEAIVRQFQRDKGVPADGAVGPATWMLLFGKPQEIPLPKRAQEYLPSLKGEIKDIWPDMPMKSTTGGQVEQETCPSLNSSECWNPHAELKTSREYGFGLGQITIAYDSQGKERFNNFKWAVSLDQKLKSWKYEDRYNAEYQLRALIRFDQLYWRGVTWATDDLNHWAFTLASYNGGEGGIRNDRQLCKSQGGDYNNWFGNSGVSKYSWKSKIKISGYGDSFFNINRSYVTNILMMRRQKYIPSLEN
Other Proteins in cluster: phalp2_18118
Total (incl. this protein): 5 Avg length: 294,0 Avg pI: 9,46

Protein ID Length (AA) pI
4dLEG 314 9,26631
1DjMa 318 8,39896
1jWFT 290 9,59407
3g31d 305 10,26106
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6881
2g1Nw
30 48,8% 209 5.438E-92
2 phalp2_5482
3dIDs
497 50,0% 208 1.241E-81
3 phalp2_8655
459Vq
3 38,1% 215 1.419E-47
4 phalp2_414
7u6ty
1517 28,2% 198 1.166E-15
5 phalp2_6350
7rL94
1 26,1% 195 4.287E-10

Domains

Domains [InterPro]
PG_1
Unannotated
Representative sequence (used for alignment): 4dLEG (314 AA)
Member sequence: A0AAE7WML6 (243 AA)
1 314 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Stenotrophomonas phage Sonora
[NCBI]
2859660 Mesyanzhinovviridae >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MZ326860 [NCBI]
CDS location
range 10537 -> 11268
strand +
CDS
ATGACGCGCCGGCCCTCGCTCGCGCGTACGCTCTTCTGGGCCCTGTACCTGTACCTCGTTCCGGCGTGGACAGCGGGGGCTCAGGACGTGCGTACCTACGTGCCGGCCGGCGCGAAGACCTACGCTCCGCTGCTCGTCGAAGAGCAGCGGGCGCGTTGGCCCGCGATGCCGCAGCCGTGGACCCTCGCCGGGCTCGTCGAGCAGGAGAGTTGTATCTCTCTGGTCCACTCCAAGTGCTGGAACCCGCGGGCGGAGCTGAAGACCTCCCGCGAGTACGGCTTCGGCTTCGGCCAGATCACCGTTGCGTACCAAAGCAACGGCGCGGTCCGGTTCGACAAGTTTGCTGAGCTGCGCCAGTCCCACGCCTCCCTCAAGGGCTGGACTTGGGAGAACCGCTACGATCCGCATTACCAGCTTCGCGCTATCGTGGAGATGAACCTCGATCTGTGGCGGCGCATTCCGCCGGCCGCGACGACGGACGACCACCTGAGCTTCATGCTTTCGAGTTACAATGGTGGCGTCGGAGGTCTGTTACAGGATCGCCGGCTCTGTTCGAACACCCGGGGCTGCGACCAAGATCGTTGGTTCGGGAACGTTGAGCTGACCAGCCTCAAATCCAAGCTCCCGCAGCCGCAATACGGCGGTCGGAGTTGGTACGACATCAACCGCGGCCACGTCCGGAACGTCATGACGGTCCGCCGGGCCAAATATAAGCCGTTCTGGGGGAAGTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4dLEG) rather than this protein.
PDB ID
4dLEG
Method AlphaFoldv2
Resolution 89.83
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50