Protein

Protein accession
W8EEP3 [UniProt]
Representative
4E7ip
Source
UniProt (cluster: phalp2_34955)
Protein name
Lysin A
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGMTLENGWPECDLSDTERLTIPGTALSLPIRKGQPHAILQAFFRDVNEFIEPVMNARGLSDEGSWTENNSVYTSNHKGATAVDWNWSDHPVQIRDAGWDGSVLINGSQVPAMRELLAWYEGMVFWGNDWNSFIDSMHFQMGYNTFGAQNFDRVHSFIQRKIRADGFSTYRRGGTPRGGGFAEVPAAPLHPIKPTSGLTPEVLWRIAGGAASKLSVGHFERWFDELVECQAACGVLGNIDRSAMWYSQVFPESGNLVYTEEIASGAAYEGRCEGLGNCQPGDGVRFKGRSFMQVTGRSNYTKMSGWAHGKGYVPTPDYFVVHPDQLDDERFAFLGVTWYWTTQRPMNDAADARNLELATRYVNGGLTNLEGRRAVYNRALAENANLLLTNPVEPWEELMATAVPSLSIYANPGEPDVPLAVMLAALDAHGPHEPYVERQAIEFGDADSIRRIARTANGQGKVKTPAAIKQATDAFRLIPAEFVRAAIPA
Physico‐chemical
properties
protein length:489 AA
molecular weight:54013,0 Da
isoelectric point:5,32
hydropathy:-0,34
Representative Protein Details
Accession
4E7ip
Protein name
4E7ip
Sequence length
163 AA
Molecular weight
17893,67510 Da
Isoelectric point
4,44330
Sequence
MWCAQIGEESGGLQWMEELASGQEYEGRCSDLGNCSPGDGPRFKGRGPIQITGRYNYASLSAWAFGQGLVPSPTFFTDDPTQLASDQFGFVGVNWYWTTARNMNSFADAGDILGATQAVNGGTHGLAERTARWNRCRAMGNMLLTLPEEDWQPVMDELLGIDR
Other Proteins in cluster: phalp2_34955
Total (incl. this protein): 9 Avg length: 354,9 Avg pI: 5,91

Protein ID Length (AA) pI
4E7ip 163 4,44330
2YuF5 182 7,70091
6SgPP 182 7,93981
7PUCn 217 6,11431
W8EEH2 488 5,50960
S5XYX4 489 5,18374
W8EE96 492 5,50960
W8FTG1 492 5,50960
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23279
520Od
3 50,0% 144 3.114E-53
2 phalp2_1654
8rD3A
4011 44,4% 135 3.162E-40
3 phalp2_22262
7pbAz
301 41,7% 139 2.602E-38
4 phalp2_8404
IwNf
80 43,9% 132 3.639E-35
5 phalp2_28972
62TOB
63 41,4% 140 6.941E-32
6 phalp2_23254
4Uvr8
48 33,1% 175 1.784E-31
7 phalp2_82
4Uvd5
73 35,8% 170 9.612E-29
8 phalp2_2991
1bUPz
10 48,0% 104 1.316E-28
9 phalp2_33611
oYfN
2 41,8% 117 1.627E-27
10 phalp2_13837
7g328
191 38,4% 143 4.175E-27

Domains

Domains [InterPro]
Representative sequence (used for alignment): 4E7ip (163 AA)
Member sequence: W8EEP3 (489 AA)
1 163 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00182

Taxonomy

  Name Taxonomy ID Lineage
Phage Mycobacterium phage Julie1
[NCBI]
1463812 Julieunavirus > Julieunavirus julie1
Host Mycobacterium smegmatis str. MC2 155
[NCBI]
246196 Actinobacteria > Actinobacteria > Corynebacteriales > Mycobacteriaceae > Mycobacterium >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KJ433976 [NCBI]
CDS location
range 38349 -> 39818
strand +
CDS
TTGGGAATGACTTTGGAGAACGGGTGGCCTGAGTGCGACCTGTCCGACACTGAGCGCCTGACGATCCCCGGCACCGCGCTGAGCCTGCCGATCCGCAAGGGCCAGCCGCACGCGATCCTGCAAGCGTTCTTCCGTGACGTCAACGAGTTCATCGAGCCGGTGATGAACGCCCGAGGGCTGAGCGACGAGGGCAGCTGGACCGAGAACAACAGCGTCTACACGTCGAACCACAAGGGCGCGACCGCCGTCGACTGGAACTGGTCCGATCACCCGGTCCAGATCCGCGACGCGGGCTGGGACGGCTCGGTGCTCATCAACGGCTCCCAGGTGCCCGCCATGCGCGAGCTGCTGGCGTGGTACGAGGGCATGGTCTTCTGGGGCAACGACTGGAACTCGTTCATCGACTCGATGCACTTCCAGATGGGCTACAACACGTTTGGCGCTCAGAACTTCGACCGGGTGCATTCGTTCATCCAGCGGAAGATCCGCGCCGACGGGTTCTCCACGTACCGACGTGGCGGCACCCCGCGCGGCGGCGGCTTCGCCGAGGTGCCCGCTGCCCCGCTCCACCCGATCAAGCCGACGTCCGGCCTCACCCCCGAGGTGCTGTGGAGGATCGCCGGGGGCGCGGCGTCGAAGCTGTCCGTCGGCCACTTCGAGCGGTGGTTCGATGAGCTGGTCGAGTGCCAGGCGGCGTGCGGCGTGCTGGGCAACATCGACCGCTCGGCCATGTGGTACTCCCAGGTGTTCCCCGAGTCGGGCAACCTGGTCTACACCGAGGAGATCGCCAGCGGCGCGGCCTACGAGGGCCGGTGCGAGGGCCTGGGCAACTGCCAGCCCGGCGACGGTGTGAGGTTCAAGGGCCGGTCGTTTATGCAGGTCACGGGCCGATCCAACTACACGAAGATGTCCGGCTGGGCTCACGGCAAGGGCTACGTGCCGACGCCCGACTACTTCGTGGTCCACCCCGATCAGCTCGATGACGAGCGGTTCGCGTTCCTCGGCGTCACCTGGTACTGGACCACTCAGCGCCCGATGAACGACGCCGCCGACGCCCGCAACCTGGAGCTGGCCACGCGCTACGTGAACGGCGGGCTGACCAACCTGGAGGGCCGTCGGGCCGTCTACAACCGCGCTCTCGCTGAGAACGCGAACCTGCTGCTCACCAACCCCGTGGAACCCTGGGAGGAACTGATGGCCACCGCCGTCCCGTCGCTGTCGATCTATGCCAACCCCGGCGAGCCGGACGTGCCGCTGGCGGTGATGCTGGCCGCGCTGGACGCTCACGGCCCGCATGAGCCCTACGTGGAGCGTCAGGCCATCGAGTTCGGTGACGCCGACTCCATCCGCCGCATCGCCCGCACGGCCAACGGTCAGGGCAAGGTGAAGACCCCGGCGGCTATCAAGCAGGCCACCGACGCCTTCCGCCTGATCCCCGCCGAGTTCGTCCGCGCCGCCATCCCCGCCTAA

Gene Ontology

Description Category Evidence (source)
GO:0008233 peptidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4E7ip) rather than this protein.
PDB ID
4E7ip
Method AlphaFoldv2
Resolution 93.57
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50