Protein

Protein accession
S5XYX4 [UniProt]
Representative
4E7ip
Source
UniProt (cluster: phalp2_34955)
Protein name
Lysin A
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGAPMENGWPECDLSDTERLTIPGTPLSLPIREGQPHAILQAFFRDVNEFIEPANNGSGYKDEGSWTENNSVYTSNHKGGTAVDWNWNDHPLHVKDGGWGGSVLINGSQVPAMRELLAWYEGMVFWGNDWSSPVDSMHFQMGYNTFGSANFARVDSFIQRKIRADGFSTYRRGGTPRGGGFAEVPAAPVHPIKPTSGLTPEVLWRIAGGVASELPVSHFERWFDELVECQAACGVLGNIDRSAMWYAQVFHESGNLVHTEEIASGAAYEGRCEGLGNCQPGDGVRFKGRSFIQVTGRSNYTKLSGWAHGKGYVPTPDYFVIHPDQLDDDQYAMLGVTWYWTTQRPMNDAADARNLELATRYVNGGTNGLAHRREIYNRALAENANLLLTNPVEPWEELMATAVPSLSIYANPGEEDVPLAVMLAALDAHGPHEPYVERQAIEFGDADSIRRIARTANGQGRVKTPAAIKQATDAFRLIPAEFIRAAIPA
Physico‐chemical
properties
protein length:489 AA
molecular weight:53781,4 Da
isoelectric point:5,18
hydropathy:-0,39
Representative Protein Details
Accession
4E7ip
Protein name
4E7ip
Sequence length
163 AA
Molecular weight
17893,67510 Da
Isoelectric point
4,44330
Sequence
MWCAQIGEESGGLQWMEELASGQEYEGRCSDLGNCSPGDGPRFKGRGPIQITGRYNYASLSAWAFGQGLVPSPTFFTDDPTQLASDQFGFVGVNWYWTTARNMNSFADAGDILGATQAVNGGTHGLAERTARWNRCRAMGNMLLTLPEEDWQPVMDELLGIDR
Other Proteins in cluster: phalp2_34955
Total (incl. this protein): 9 Avg length: 354,9 Avg pI: 5,91

Protein ID Length (AA) pI
4E7ip 163 4,44330
2YuF5 182 7,70091
6SgPP 182 7,93981
7PUCn 217 6,11431
W8EEH2 488 5,50960
W8EEP3 489 5,32208
W8EE96 492 5,50960
W8FTG1 492 5,50960
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23279
520Od
3 50,0% 144 3.114E-53
2 phalp2_1654
8rD3A
4011 44,4% 135 3.162E-40
3 phalp2_22262
7pbAz
301 41,7% 139 2.602E-38
4 phalp2_8404
IwNf
80 43,9% 132 3.639E-35
5 phalp2_28972
62TOB
63 41,4% 140 6.941E-32
6 phalp2_23254
4Uvr8
48 33,1% 175 1.784E-31
7 phalp2_82
4Uvd5
73 35,8% 170 9.612E-29
8 phalp2_2991
1bUPz
10 48,0% 104 1.316E-28
9 phalp2_33611
oYfN
2 41,8% 117 1.627E-27
10 phalp2_13837
7g328
191 38,4% 143 4.175E-27

Domains

Domains [InterPro]
Representative sequence (used for alignment): 4E7ip (163 AA)
Member sequence: S5XYX4 (489 AA)
1 163 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00182

Taxonomy

  Name Taxonomy ID Lineage
Phage Mycobacterium phage KayaCho
[NCBI]
1340830 Julieunavirus >
Host Mycobacterium
[NCBI]
1763 Actinobacteria > Actinobacteria > Corynebacteriales > Mycobacteriaceae >
Host Mycobacterium smegmatis str. MC2 155
[NCBI]
246196 Actinobacteria > Actinobacteria > Corynebacteriales > Mycobacteriaceae > Mycobacterium >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KF024729 [NCBI]
CDS location
range 38353 -> 39822
strand +
CDS
TTGGGAGCACCGATGGAGAACGGCTGGCCCGAGTGCGACCTGTCGGACACCGAGCGCCTGACGATCCCCGGCACGCCGCTGAGTCTGCCCATCCGTGAGGGCCAGCCGCACGCCATCCTCCAAGCGTTCTTCCGCGACGTCAACGAGTTCATCGAGCCCGCCAACAACGGCAGCGGCTACAAGGACGAGGGCAGCTGGACCGAGAACAACAGCGTCTACACGTCGAACCACAAGGGCGGCACCGCCGTCGACTGGAACTGGAACGATCACCCGCTGCACGTCAAGGACGGCGGCTGGGGCGGCTCGGTGCTCATCAACGGCTCCCAGGTACCCGCGATGCGCGAGCTGCTGGCCTGGTACGAGGGCATGGTGTTCTGGGGCAACGACTGGAGCAGCCCCGTCGACTCGATGCACTTCCAGATGGGCTACAACACGTTTGGCTCGGCCAACTTCGCCAGGGTCGACAGCTTCATCCAGCGGAAGATCCGCGCCGACGGGTTCTCCACGTACCGGCGCGGCGGCACGCCTCGGGGCGGCGGGTTCGCCGAGGTGCCCGCTGCCCCGGTCCACCCGATCAAGCCGACGTCCGGCCTCACGCCTGAGGTGCTATGGCGGATCGCTGGCGGCGTGGCGTCCGAGCTGCCGGTGAGCCACTTCGAGCGATGGTTCGATGAGCTGGTCGAGTGCCAGGCGGCGTGCGGCGTGCTGGGCAACATCGACCGCTCGGCCATGTGGTACGCCCAGGTGTTCCATGAGTCGGGCAACCTGGTCCACACTGAGGAGATCGCCAGCGGCGCGGCCTACGAGGGCCGGTGCGAGGGCCTGGGCAACTGCCAGCCCGGTGACGGCGTGCGGTTCAAGGGCCGGTCGTTTATCCAGGTCACGGGCCGATCCAACTACACCAAGCTGTCCGGCTGGGCTCACGGCAAGGGCTACGTCCCGACGCCTGACTACTTCGTGATCCACCCCGATCAGCTCGATGATGATCAGTACGCCATGCTCGGCGTCACCTGGTACTGGACCACTCAGCGCCCGATGAACGACGCCGCCGACGCCCGCAACCTGGAGCTGGCCACCCGCTACGTGAACGGCGGAACCAACGGCCTGGCCCACCGCCGGGAAATCTACAACCGCGCCCTCGCTGAGAACGCGAACCTGCTGCTGACCAACCCCGTGGAGCCCTGGGAGGAACTGATGGCCACCGCCGTCCCGTCGCTGTCGATCTATGCCAACCCCGGTGAGGAGGACGTGCCGCTGGCGGTGATGCTGGCCGCGCTCGACGCTCACGGCCCGCATGAGCCCTACGTGGAGCGCCAGGCCATCGAGTTCGGTGACGCCGACTCCATCCGCCGCATCGCCCGCACCGCCAACGGTCAGGGCCGAGTGAAGACCCCGGCTGCCATCAAGCAGGCCACCGACGCCTTCCGCCTGATTCCCGCCGAGTTCATCCGCGCCGCCATCCCCGCCTAA

Gene Ontology

Description Category Evidence (source)
GO:0008233 peptidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000387f98e_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4E7ip) rather than this protein.
PDB ID
4E7ip
Method AlphaFoldv2
Resolution 93.57
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50