Protein

Protein accession
Q7Y5M4 [UniProt]
Representative
54ivU
Source
UniProt (cluster: phalp2_34581)
Protein name
Gp46
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIMLNKYFKRNEFACRCGCGTSTVDAELLQVVTDVREYFGLPVVITSGHRCSDHNRRVGGAASSMHMTGKAADIKVKGKDASAIASYLEHKYPDKYGIGRYNSFTHIDVRDGKARWRG
Physico‐chemical
properties
protein length:118 AA
molecular weight:13161,9 Da
isoelectric point:9,41
hydropathy:-0,44
Representative Protein Details
Accession
54ivU
Protein name
54ivU
Sequence length
202 AA
Molecular weight
21059,00140 Da
Isoelectric point
8,25474
Sequence
MSKTSVPKAATVVATPAATPAAAAEAAATATVEVVAPVAEAAAPVVTAVGAVAVVAPAPVAEVATPASPVVPVVPAAPVTPPAITKKVAYTPIHFKRSEFACPCCGVAEVSDELLQVLDDVREHFNSPVTVTSGYRCEEHNLKVDGKPGSKHKDGIAADIQVSRIPPRTVQKYLLSKYPEQYGLGCYRTFTHVDVRPTKARW
Other Proteins in cluster: phalp2_34581
Total (incl. this protein): 71 Avg length: 115,9 Avg pI: 9,18

Protein ID Length (AA) pI
54ivU 202 8,25474
A0A1Q1PUZ7 114 9,54043
I7FWL0 115 9,48448
K4FBR6 114 9,50710
A0A0U3A7A5 117 9,48338
A0A140XFU9 116 9,50749
A0A5J6T8C3 121 9,07929
A0A0K1LK41 114 9,27663
A0A2H4YEB0 114 9,34619
Q6UGC4 114 9,42420
A0A2I4Q1Q8 117 9,36186
A0A088FWP5 116 9,65719
A0A2D0VKR9 117 9,38822
A0A2H4N091 114 9,47713
A0A2K9VAY0 114 9,47758
A0A2I7QNI0 117 8,69461
A0A2I7QRN6 113 8,79202
A0A2I7QT07 113 9,03126
A0A2I7QYT6 113 9,09831
A0A2I7R2I7 115 8,70705
A0A2I7R2R3 115 7,72404
A0A2I7R7C4 113 9,09831
A0A2I7RG18 113 9,09831
A0A2I7RHX2 115 8,70705
A0A2I7RYZ6 115 8,70705
A0A2I7S323 115 8,70705
A0A2I7S631 113 9,03126
A0A2I7S6T9 113 9,03126
A0A2I7S6Z9 113 9,03126
A0A385IPL2 114 9,42420
A0A481XSR8 114 9,59878
A0A482MSH2 114 9,20262
A0A482MSV6 114 9,42388
A0A4P6DBM4 117 9,44347
A0A7H0XC60 114 9,50710
A0A7S9XGC3 115 9,32608
A0A7U0J6B3 115 9,32608
A0A8E7L3B0 114 9,42388
A0A976SPP0 114 9,44966
A0A9E7LH95 116 9,35431
A0A9E7LPL1 114 9,18257
A0A9E7LPP8 114 9,32253
A0A9E7M7M5 114 9,32253
A0A9E7SB36 114 9,32253
A0A9P0VE97 115 8,70699
A0A9P0VI80 115 7,72404
A0A9P0YAZ9 115 8,70705
A0A9Y0IH56 114 9,34613
A0A9Y0S4C3 114 9,34613
A0AAE6TVD0 114 9,51104
A0AAE7XRV4 123 9,02482
A0AAE7XS02 123 9,02482
A0AAE7XSY1 114 9,44934
A0AAE7XTB5 123 9,02482
A0AAE7XTF0 114 9,42388
A0AAE7Y1T4 113 9,09831
A0AAE7Y249 113 9,09831
A0AAE7Y477 113 9,09831
A0AAE8C7X1 115 8,70647
A0AAE8C830 114 9,42388
A0AAE8Y372 114 9,57621
A0AAE8Y5R9 114 9,42388
A0AAE9C1D5 114 9,32260
A0AAE9KYB9 115 9,18212
A0AAU6NT29 115 9,18212
A0AAU8BBP0 94 9,20565
A0AAV1MEA3 116 9,44347
A0AAV1MID0 115 9,18212
A0AAX4LXA0 114 9,44928
Q6UGH3 116 9,41440
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18632
8g8s
5 36,5% 134 1.623E-23
2 phalp2_26705
2Q6OF
1 32,6% 150 1.621E-14
3 phalp2_39724
8AJC5
1 19,8% 166 4.721E-08

Domains

Domains [InterPro]
Disordered region
PET_M15
Representative sequence (used for alignment): 54ivU (202 AA)
Member sequence: Q7Y5M4 (118 AA)
1 202 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterobacteria phage SP6 (Bacteriophage SP6)
[NCBI]
2907955 Autographiviridae > Zindervirus > Zindervirus SP6
Host Salmonella enterica subsp. enterica serovar Typhimurium
[NCBI]
90371 Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Salmonella > Salmonella enterica

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
AY288927 [NCBI]
CDS location
range 38976 -> 39332
strand +
CDS
GTGATAATGTTGAACAAATACTTCAAGCGTAACGAGTTCGCTTGCCGTTGTGGGTGCGGTACATCCACTGTTGACGCGGAACTGTTGCAGGTTGTCACAGATGTCCGTGAATACTTCGGGTTACCTGTAGTTATTACATCGGGTCATCGGTGCAGTGACCATAACCGCCGCGTAGGTGGTGCTGCATCTTCCATGCACATGACTGGCAAGGCTGCTGATATTAAAGTGAAAGGGAAGGACGCGAGTGCTATCGCATCCTACTTGGAACACAAGTACCCTGATAAATATGGTATCGGTCGATACAACTCCTTCACTCACATTGACGTGCGTGATGGTAAGGCTCGCTGGCGTGGATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi00001a8d92_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (54ivU) rather than this protein.
PDB ID
54ivU
Method AlphaFoldv2
Resolution 72.25
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50