Protein

Protein accession
A0AAE9VL59 [UniProt]
Representative
4aXVp
Source
UniProt (cluster: phalp2_40411)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSMIWQELIRTYAKTSIEAPALKLVTVAQWALESKYGSSALATEHNNFAGLKYRERVNRGRESHPLATPVDYIASDGEDTYCKFANFDDFIAGYWAFVKNGSMYDGWEAYGLDPIGYIGHLHRAGYAGDQRYVVRVAAMVPAVTKQLEDLGLASAFSGINAPPLTRLAILIGHNEVAKGAFSPHLSVSEWDYNQRVAREMQAAASEFSLEPRVFFRGRNRRGYATEIAIAYAAIDAWNPAAIMELHFNAGGGTGTETLYWHSSRNSKKLADAVREALLGELDLLDRGSRARRSGDRGSTSLRASAHPTILTEPFFGDNDSDCDRMLEVGETGLARAYLIGARDYFMNLT
Physico‐chemical
properties
protein length:349 AA
molecular weight:38583,9 Da
isoelectric point:6,02
hydropathy:-0,28
Representative Protein Details
Accession
4aXVp
Protein name
4aXVp
Sequence length
239 AA
Molecular weight
25333,24010 Da
Isoelectric point
7,71302
Sequence
MNPDEKEKVSTVRWHVANKWGDSASQKAVLELCGGLLGYGPGDSKPSSPAVTPKGSLPRVAVVVGHNSKATGADAPDPIGDEFGFNNLVADKMVELASEYGIEAKRFNRSYTGSYSGEIRSAYAAVDNWNPVASIELHFNDSAPEANGTETLHSGSPKSKALAKCVQDAMLSSLGLRDRGLKEMAKTDRGGLSVHAGKAPGILVEPFFCHNSKDFKSAVSLGIDGFARMYLSGLAKYAK
Other Proteins in cluster: phalp2_40411
Total (incl. this protein): 16 Avg length: 233,4 Avg pI: 7,80

Protein ID Length (AA) pI
4aXVp 239 7,71302
1Kvzv 195 7,01191
1KwVr 208 8,67746
260TM 230 9,78380
2LcMF 230 9,78348
3EPR 226 9,57731
3Izf 230 6,13090
47T5v 248 5,84375
4HGwk 190 5,76475
4IfrQ 207 8,67707
4PvWN 184 6,85242
4aFPB 245 8,98846
7TgCO 259 4,80297
88LyV 249 9,44083
8jYkG 245 9,76279
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_11016
4Prl3
67 43,0% 179 2.276E-47
2 phalp2_34206
2TNL2
24 43,1% 176 7.805E-44
3 phalp2_21852
4t88M
271 34,5% 188 1.868E-37
4 phalp2_6329
79LaP
54 37,9% 166 1.868E-37
5 phalp2_31934
7CsRL
1 41,7% 158 2.552E-37
6 phalp2_21590
2ns10
22 36,9% 184 1.214E-36
7 phalp2_7941
wIlS
251 32,7% 165 2.744E-35
8 phalp2_30755
CL9
4 25,5% 243 3.319E-34
9 phalp2_1230
jumL
15 29,3% 194 7.592E-26
10 phalp2_36343
kBxJ
714 29,5% 200 1.409E-25

Domains

Domains [InterPro]
Unannotated
Ami3
Representative sequence (used for alignment): 4aXVp (239 AA)
Member sequence: A0AAE9VL59 (349 AA)
1 239 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01520

Taxonomy

  Name Taxonomy ID Lineage
Phage Thiohalocapsa phage LS06-2018-MD04
[NCBI]
3003842 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
OP947166 [NCBI]
CDS location
range 21040 -> 22089
strand +
CDS
ATGTCCATGATTTGGCAAGAACTGATTCGGACCTACGCGAAGACATCCATCGAGGCCCCGGCATTGAAGCTCGTCACGGTAGCGCAATGGGCGCTGGAATCCAAGTATGGATCGAGCGCCTTAGCAACAGAGCACAACAACTTTGCGGGCCTGAAGTATCGCGAGCGGGTTAATCGCGGCCGCGAAAGTCACCCATTAGCGACCCCCGTCGATTACATCGCGAGCGATGGCGAAGATACGTATTGCAAATTCGCCAACTTCGACGATTTCATTGCCGGCTATTGGGCATTCGTCAAGAATGGTTCCATGTACGACGGCTGGGAGGCATACGGATTAGACCCGATCGGCTACATCGGGCATCTGCACCGGGCCGGTTACGCGGGCGACCAGCGCTATGTTGTAAGGGTCGCGGCAATGGTCCCTGCCGTCACCAAGCAGCTCGAGGACTTGGGCCTGGCCTCGGCCTTCAGCGGCATCAATGCGCCGCCGTTGACGCGTTTAGCGATCCTGATTGGTCACAACGAGGTTGCCAAAGGGGCGTTTTCGCCACACCTATCGGTGTCCGAATGGGACTACAACCAGCGGGTAGCGCGCGAGATGCAGGCGGCCGCATCGGAGTTCAGCCTGGAGCCGCGCGTGTTCTTCAGAGGTCGCAACAGGCGCGGCTATGCCACCGAGATTGCCATCGCCTATGCCGCCATCGACGCCTGGAATCCGGCAGCGATCATGGAGCTTCATTTCAACGCGGGCGGCGGTACGGGGACCGAAACCCTGTACTGGCATTCGTCGCGCAATAGCAAGAAACTTGCGGATGCGGTCAGGGAGGCCCTCCTGGGCGAGCTTGACCTGCTCGACCGCGGCAGCCGCGCGCGCCGTTCCGGAGACCGCGGCTCGACATCATTGCGGGCCTCCGCCCACCCCACGATCCTCACTGAGCCGTTCTTTGGCGACAACGATTCCGATTGCGACCGCATGTTGGAGGTAGGCGAGACCGGTCTCGCGCGCGCCTATCTGATCGGTGCGCGCGACTATTTCATGAATCTTACTTAA

Gene Ontology

Description Category Evidence (source)
GO:0004040 amidase activity molecular function None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
A0AAE9VL59
Method SMR
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4aXVp) rather than this protein.
PDB ID
4aXVp
Method AlphaFoldv2
Resolution 92.12
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50