Protein

Protein accession
A0A166YGS6 [UniProt]
Representative
1jszc
Source
UniProt (cluster: phalp2_8504)
Protein name
Lysin A
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSQFKADRDMLSPNDSGVRAAPPSLVIVHTNEGDPNGRADDLAAYLQKPSSQASYTLIVDRTGRIVRSNDDEFVPWAAGSPANERGLHLCFVGRAVQSTAEWLAQPRQLDAAARAVADWCRRYNIPATWLTGDRMRAGARGIGGHDTTVFAWHATDHTDPGAGFPRAQFVALVNKHLTGTSQQEDDDMAFTDEDRRKLDYIYGQLRPWPQLGTNDKGEPLTLIDAVAELKANDR
Physico‐chemical
properties
protein length:234 AA
molecular weight:25771,0 Da
isoelectric point:5,49
hydropathy:-0,58
Representative Protein Details
Accession
1jszc
Protein name
1jszc
Sequence length
283 AA
Molecular weight
31684,44460 Da
Isoelectric point
5,39023
Sequence
MSNAYKCDHDILTSNDDGRRQGLIKRIYIHTFEGSDLDAVAMARYQLSPAAGGSYHMVIDLDGMTARENDDEYIPWSAGWTANRDGIHISLAGRAAFSREFWLARTKQLDKLADVVAAYCKAYGVPAIYRNAEDLKAGRWGISTHADAAIAWRETDHTDPGKGFPMDVLVDKVNALLHPTPVPPPPPAPVEPGVKYPSLIDGRPLRYSEFIRFIDEKVTRIYDDMILDPKDMNVAALYTGIQGEAYPSYVDPSKSFTLDQYVRLIDFKIDAIRKELDKQSVQS
Other Proteins in cluster: phalp2_8504
Total (incl. this protein): 7 Avg length: 273,9 Avg pI: 5,91

Protein ID Length (AA) pI
1jszc 283 5,39023
725uH 288 5,24825
726z1 274 7,10393
7cxRV 282 5,73360
7hEOZ 274 7,10342
7ubsg 282 5,27315
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_32230
7h6JD
19 50,2% 215 1.008E-79
2 phalp2_18569
7cxQw
45 43,7% 176 4.787E-53
3 phalp2_24741
6EXtT
52 35,7% 190 8.299E-35
4 phalp2_1243
oSGY
19 36,0% 183 8.543E-33
5 phalp2_31178
7Vvn5
47 37,0% 181 2.193E-30
6 phalp2_558
HtN7
28 34,6% 182 1.390E-27
7 phalp2_16608
eqCG
19 32,3% 204 8.284E-23
8 phalp2_28942
5F55F
4 30,3% 201 1.088E-17
9 phalp2_23125
4E9CJ
94 28,9% 176 2.675E-17
10 phalp2_7436
4Ksn7
7 28,4% 190 3.609E-17

Domains

Domains [InterPro]
Ami2
Unannotated
Representative sequence (used for alignment): 1jszc (283 AA)
Member sequence: A0A166YGS6 (234 AA)
1 283 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Gordonia phage McGonagall
[NCBI]
1838072 Mcgonagallvirus > Mcgonagallvirus macgonagall
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KU998255 [NCBI]
CDS location
range 3248 -> 3952
strand +
CDS
GTGAGCCAGTTCAAGGCAGACCGAGACATGTTGTCTCCCAACGATTCCGGCGTACGCGCCGCGCCGCCGTCGCTGGTCATCGTCCACACAAACGAGGGTGACCCGAACGGCCGCGCCGACGACCTGGCCGCGTACTTGCAGAAGCCGAGCAGCCAAGCGTCGTACACGCTCATCGTGGACCGCACCGGCCGCATCGTCCGCAGCAACGACGACGAGTTCGTTCCGTGGGCTGCTGGCAGCCCGGCGAACGAGCGCGGTCTGCACCTGTGTTTCGTCGGCCGCGCCGTTCAGTCCACCGCGGAATGGCTCGCGCAGCCCCGGCAGCTCGACGCCGCCGCGCGCGCCGTCGCCGACTGGTGCCGCCGGTACAACATCCCGGCGACCTGGCTCACCGGCGACCGTATGCGCGCCGGTGCGCGCGGCATCGGCGGCCACGACACCACCGTGTTCGCCTGGCACGCCACCGACCACACAGACCCCGGCGCAGGGTTCCCGCGCGCACAGTTCGTGGCGCTGGTGAACAAGCACCTGACCGGCACATCACAGCAGGAGGACGATGACATGGCATTCACCGACGAGGACCGCCGGAAGCTCGACTACATCTATGGGCAGCTTCGGCCGTGGCCGCAGCTCGGCACCAACGACAAGGGCGAGCCGCTGACGCTGATCGACGCCGTCGCCGAGCTCAAGGCCAACGACCGATGA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000203147c_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1jszc) rather than this protein.
PDB ID
1jszc
Method AlphaFoldv2
Resolution 90.06
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50