Protein

Protein accession
A0AAE9G509 [UniProt]
Representative
rlev
Source
UniProt (cluster: phalp2_39793)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTSTHVRTNKEGLEIIGNAEGCMRNPYVCPTGYLTVGIGSRIYSDEPAVRREGLTDQEIADRWVKNIQEAEKCVNDWFHGKDMNTNQFSAMTSMVFNHGCTRLRKNKDGSPTKIYTAARKQNWTEMCNSITDWDMGGKYRGLTIRKQKEKALCLKPVQ
Physico‐chemical
properties
protein length:158 AA
molecular weight:18013,3 Da
isoelectric point:9,05
hydropathy:-0,75
Representative Protein Details
Accession
rlev
Protein name
rlev
Sequence length
193 AA
Molecular weight
21637,45950 Da
Isoelectric point
9,01482
Sequence
MSKLKKASAFGVCLVSVIVGLVYDSEDRSSGIIISENGARETGDEEGCRTNPYQCAAKEWTFGIGAATTGGANVIIGKTYTNEEIADQYAKDLRKVSKCIIDYYPYNEMNQNQIDALGSLIFNIGCQGSRFYLDRESGRFKKTQLYKAAIDKDFIRMCNTFPNYSRVNGKVHKSILKRRLRERDLCLSPVNKV
Other Proteins in cluster: phalp2_39793
Total (incl. this protein): 61 Avg length: 177,1 Avg pI: 9,12

Protein ID Length (AA) pI
rlev 193 9,01482
1cdHO 192 8,91767
6ZgpK 191 9,09670
6hF28 191 8,75095
70ISZ 178 9,54411
7OzRW 193 9,01482
7P9yb 193 9,28572
7qUw2 191 8,87338
KE3o 191 9,09670
Kbhe 191 8,87338
MYQp 191 8,87338
Ohn7 191 8,87338
VPf9 191 9,20049
oGj9 191 8,99548
A0A0M3LPP5 189 9,07085
A0A6G8RAF3 168 9,16574
A0A5B9N2W3 168 8,77300
Q1I0Z1 183 9,63739
P51728 186 9,36095
H6UK32 112 8,39851
A0A0M3LPQ1 188 9,35141
A0A4D6DXT9 158 8,75102
A0A6G8RBS1 168 9,05151
A0A0M3LPW1 188 9,35141
A0A0M3LQ49 188 9,35141
A0A140XG79 168 9,18141
A0A286KIF6 168 9,18141
A0A2Z2U6T4 168 9,18141
A0A481W402 168 9,18141
A0A5P8PNH8 168 9,18141
A0A6G8RA53 168 9,18141
A0A6G8RAM6 168 9,18141
A0A6G8RAU6 168 9,18141
A0A6G8RB12 168 9,18141
A0A6G8RBD5 168 9,18141
A0A6G8RBM6 168 9,18141
A0A6G8RBP2 168 9,18141
A0A6G8RC58 168 9,18141
A0A6G8RCI1 168 9,18141
A0A6G8RCL3 168 9,18141
A0A7S6KX68 158 9,18141
A0A7T7Z8J8 168 8,14940
Q19UR6 188 9,35141
Q19UX7 188 9,35141
R9QCI4 188 9,35141
A0A7G5CF25 181 8,98188
A0AAE9KC83 181 8,98188
A0A6M4BCE8 158 9,16574
A0A8A4VCL2 187 9,19585
A0A8S5NCB6 191 9,28611
A0A9E7U3Q6 191 9,54353
A0AA96SGW8 168 8,73097
A0AAE9GAQ7 168 9,05151
A0AAE9X0A6 183 9,54688
A0AAJ6IHV3 135 9,23434
A0AAU8GFD9 191 9,54353
A0AAU8GFP1 187 9,19585
A0AAU8GJH6 187 9,19585
A0AAX6NM08 187 9,01966
Q94MY6 179 9,08355
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24393
4fhn1
517 34,2% 190 3.857E-71
2 phalp2_24123
7oopP
467 29,5% 176 4.515E-29
3 phalp2_23830
1lpLy
242 30,0% 163 4.385E-26
4 phalp2_28113
7zmZV
50 32,0% 156 1.207E-23
5 phalp2_14481
4Lj5N
23 30,7% 153 5.734E-23
6 phalp2_33221
5jbqV
447 31,8% 157 3.717E-22
7 phalp2_17145
2TusA
116 31,4% 159 2.405E-21
8 phalp2_2632
6RhYr
14867 30,1% 156 4.481E-21
9 phalp2_40213
2kZ5j
4390 25,5% 172 6.115E-21
10 phalp2_3550
4CbTI
96 27,9% 168 8.346E-21

Domains

Domains [InterPro]
Representative sequence (used for alignment): rlev (193 AA)
Member sequence: A0AAE9G509 (158 AA)
1 193 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage vB_EcoM_PiR
[NCBI]
2924893 Rosemountvirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
OM313461 [NCBI]
CDS location
range 24999 -> 25475
strand +
CDS
ATGACAAGTACGCATGTACGTACTAATAAAGAAGGTCTTGAAATAATCGGTAATGCAGAAGGTTGTATGCGCAACCCGTATGTATGCCCTACTGGTTATTTAACAGTTGGCATTGGTAGCCGCATCTACAGCGATGAGCCCGCTGTACGGCGCGAAGGCCTGACAGACCAGGAGATTGCTGACCGCTGGGTCAAGAATATACAAGAAGCAGAAAAGTGTGTTAACGACTGGTTCCACGGTAAAGATATGAACACTAACCAGTTCAGCGCTATGACTTCAATGGTATTCAACCATGGTTGCACTAGACTCCGTAAGAACAAGGATGGTTCACCTACCAAAATATATACTGCTGCCCGTAAACAAAACTGGACTGAAATGTGTAACAGCATAACTGACTGGGATATGGGTGGTAAGTACAGGGGCTTGACAATACGTAAGCAAAAGGAAAAAGCGTTATGTCTAAAACCGGTGCAATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (rlev) rather than this protein.
PDB ID
rlev
Method AlphaFoldv2
Resolution 92.85
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50