Protein

Protein accession
A0A649V4E1 [UniProt]
Representative
7lGFm
Source
UniProt (cluster: phalp2_22258)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTLFGIDVSNHQGEFDFARAKSEGIAFATHKITEGTGYRDPYWPRAKAEMARHFPGRWGGYVFCRTNTDPAAEARLLWQHAGGDVPLQIDYEDTTNGGSLADLNRRIDAYRQVGFTTLLPIYLPRWYWQSRMRGVDLSDLPVPIWNSHYVNGRNTPAKLYPGDTHAGWADMGGKDVQILQFSETASVAGQAIDVNAFRGSGGEFNRMFGGTTMAKADDELGKLFPSRSKYRTTDKPIDTLAGFVLNIDARIHEEYIEREALKGTQWCIDLVKREAAEGDEGAKAVLAQIEGKK
Physico‐chemical
properties
protein length:293 AA
molecular weight:32684,3 Da
isoelectric point:6,01
hydropathy:-0,51
Representative Protein Details
Accession
7lGFm
Protein name
7lGFm
Sequence length
316 AA
Molecular weight
34940,88960 Da
Isoelectric point
4,68589
Sequence
MTEFGIDVSNHQNTFDFAVAAAEGMSFATHKICEGTWLDPLWPRAREQMATHFQFWGGYIYCRLDTAPALEADAALAYLGDTDVPIQIDYEDLKGAPSIEDLLARVDALTARGFQLLPIYLPRWYWRDHMGSPDLIELPVPIWNSSYVNGTGTPAQLYPGHDHPGWKPMGGKDIAILQFSATASIGGQLIDVNAIRGGRDQLARLFWKELDMQLTDMITNKDGNEVTVAELLASIDLHASWVVDQLAGPDSRRQPGPDLDPTGWPQLDGKSIVDSVAAAHDKIDTVTTVLGQIQDKIQTVLEHLDTDPSTGRHRKQ
Other Proteins in cluster: phalp2_22258
Total (incl. this protein): 11 Avg length: 304,8 Avg pI: 5,35

Protein ID Length (AA) pI
7lGFm 316 4,68589
2YtSN 315 6,18575
4Fx1i 330 4,52071
6FCgS 293 5,98420
79IU0 305 4,93376
7hbtr 305 5,53529
7k0Ct 305 5,67017
7lGsb 316 4,70498
7wJph 282 4,61921
A0A7G8LLF4 293 6,01109
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14099
7owJl
20 61,7% 212 9.852E-103
2 phalp2_20034
1pbFu
55 43,2% 208 8.591E-75
3 phalp2_36898
7vYe1
51 39,6% 212 3.017E-55
4 phalp2_25713
4FbaX
14 38,5% 210 2.672E-54
5 phalp2_25256
7Xuqq
13 33,5% 283 2.819E-47
6 phalp2_8437
Ypuv
26 36,6% 213 2.170E-45
7 phalp2_32493
1fUkE
14 36,3% 209 4.209E-43
8 phalp2_4989
6SNir
5 33,7% 222 3.201E-41
9 phalp2_14866
3EeU
5 26,2% 320 1.102E-40
10 phalp2_18920
24XdO
30 32,3% 229 2.785E-40

Domains

Domains [InterPro]
GH25
Unannotated
Representative sequence (used for alignment): 7lGFm (316 AA)
Member sequence: A0A649V4E1 (293 AA)
1 316 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Gordonia phage Wocket
[NCBI]
2656543 Jujuvirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MN585963 [NCBI]
CDS location
range 19228 -> 20109
strand +
CDS
ATGACGCTGTTTGGCATCGACGTCTCGAACCACCAGGGCGAGTTCGACTTCGCGCGCGCGAAGTCCGAAGGGATCGCGTTCGCCACGCACAAGATCACCGAGGGCACCGGATACCGCGATCCGTACTGGCCGCGCGCGAAAGCCGAGATGGCCCGACACTTTCCCGGCCGGTGGGGCGGATACGTGTTCTGCCGCACCAACACCGACCCGGCGGCCGAGGCGCGGCTGCTGTGGCAGCACGCAGGCGGTGACGTGCCCCTGCAGATCGACTACGAGGACACCACCAACGGCGGCAGCCTCGCCGACCTGAACCGCCGGATCGACGCCTACCGCCAGGTCGGGTTCACCACACTGCTGCCGATCTACCTGCCGCGCTGGTACTGGCAATCCCGGATGCGCGGCGTCGACCTATCGGATCTGCCGGTCCCGATCTGGAACAGCCATTACGTCAACGGCCGGAACACTCCCGCCAAGCTCTACCCCGGAGACACACACGCCGGGTGGGCCGACATGGGCGGCAAGGACGTGCAGATCCTGCAGTTCTCAGAGACCGCGTCGGTGGCAGGCCAAGCCATCGACGTCAACGCCTTCCGAGGTTCGGGAGGCGAATTCAACCGCATGTTTGGAGGCACCACCATGGCCAAAGCCGACGACGAACTGGGCAAGTTGTTCCCGTCACGCTCGAAGTACCGCACGACCGACAAGCCGATCGACACCCTCGCCGGGTTCGTGCTGAATATCGACGCCCGGATCCACGAGGAGTACATCGAGCGCGAGGCGCTCAAGGGCACCCAGTGGTGCATCGACCTCGTCAAGCGCGAGGCCGCCGAAGGTGACGAAGGCGCCAAAGCCGTTCTCGCACAGATCGAAGGGAAGAAGTGA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016052 carbohydrate catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7lGFm) rather than this protein.
PDB ID
7lGFm
Method AlphaFoldv2
Resolution 93.56
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50