Protein

Protein accession
W8EE96 [UniProt]
Representative
4E7ip
Source
UniProt (cluster: phalp2_34955)
Protein name
Lysin A
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MALGMTLENGWPECDLSDTERLTIPGTALSLPIRKGQPHAILQAFFRDVNEFIEPVMNARGLSDEGSWTENNSVYTSNHKGATAVDWNWSDHPLGVKNGGWDGSVLINGSQVPAMRELLTWYEGMVFWGNDWSSPVDSMHFQMGYNTCCGGDNAKRVDSFIQRKIRADGFSTFRRGGTPRGGGFAELPAAPVHPIKPTSGLTPEVLWRIAGGAASKLPVSHFERWFDELVECQAACGVLGNIDRSAMWYAQVFHESGNLVHTEEIASGAAYEGRCEGLGNCQPGDGVRFKGRSFIQVTGRSNYTKLSGWAHSKGYVPTPDYFVVHPDQLDDEQYAMLGVTWYWTTQRRMNDAADARNLELATRYVNGGTNGLDHRRAIYNRALAENANLLLTNPVEPWEELMATAVPSLSIYANPGEADVPLAVMIAALDAHGPHEPYVERQAQEFGDVDSIRRIARTANGQGRVKTPAAIKQATDAFRLIPPEFVRAAIPA
Physico‐chemical
properties
protein length:492 AA
molecular weight:54111,0 Da
isoelectric point:5,51
hydropathy:-0,36
Representative Protein Details
Accession
4E7ip
Protein name
4E7ip
Sequence length
163 AA
Molecular weight
17893,67510 Da
Isoelectric point
4,44330
Sequence
MWCAQIGEESGGLQWMEELASGQEYEGRCSDLGNCSPGDGPRFKGRGPIQITGRYNYASLSAWAFGQGLVPSPTFFTDDPTQLASDQFGFVGVNWYWTTARNMNSFADAGDILGATQAVNGGTHGLAERTARWNRCRAMGNMLLTLPEEDWQPVMDELLGIDR
Other Proteins in cluster: phalp2_34955
Total (incl. this protein): 9 Avg length: 354,9 Avg pI: 5,91

Protein ID Length (AA) pI
4E7ip 163 4,44330
2YuF5 182 7,70091
6SgPP 182 7,93981
7PUCn 217 6,11431
W8EEH2 488 5,50960
S5XYX4 489 5,18374
W8EEP3 489 5,32208
W8FTG1 492 5,50960
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23279
520Od
3 50,0% 144 3.114E-53
2 phalp2_1654
8rD3A
4011 44,4% 135 3.162E-40
3 phalp2_22262
7pbAz
301 41,7% 139 2.602E-38
4 phalp2_8404
IwNf
80 43,9% 132 3.639E-35
5 phalp2_28972
62TOB
63 41,4% 140 6.941E-32
6 phalp2_23254
4Uvr8
48 33,1% 175 1.784E-31
7 phalp2_82
4Uvd5
73 35,8% 170 9.612E-29
8 phalp2_2991
1bUPz
10 48,0% 104 1.316E-28
9 phalp2_33611
oYfN
2 41,8% 117 1.627E-27
10 phalp2_13837
7g328
191 38,4% 143 4.175E-27

Domains

Domains [InterPro]
Representative sequence (used for alignment): 4E7ip (163 AA)
Member sequence: W8EE96 (492 AA)
1 163 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00182

Taxonomy

  Name Taxonomy ID Lineage
Phage Mycobacterium phage 39HC
[NCBI]
1463809 Julieunavirus >
Host Mycobacterium smegmatis str. MC2 155
[NCBI]
246196 Actinobacteria > Actinobacteria > Corynebacteriales > Mycobacteriaceae > Mycobacterium >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KJ433973 [NCBI]
CDS location
range 38368 -> 39846
strand +
CDS
GTGGCTTTGGGAATGACTCTGGAGAACGGCTGGCCGGAATGCGACCTGTCCGACACTGAGCGCCTGACGATCCCCGGCACCGCGCTGAGCCTGCCGATCCGCAAGGGCCAGCCGCACGCGATCTTGCAGGCGTTCTTCCGTGACGTCAACGAGTTCATCGAGCCGGTGATGAACGCCCGAGGGCTCAGCGACGAGGGCAGCTGGACCGAGAACAACAGCGTCTACACGTCGAACCACAAGGGCGCGACCGCCGTCGATTGGAACTGGAGCGACCATCCGCTGGGCGTCAAGAACGGCGGCTGGGACGGCTCGGTGCTCATCAACGGCTCCCAGGTACCCGCCATGCGCGAGCTGCTGACCTGGTACGAGGGCATGGTCTTCTGGGGCAACGACTGGAGCAGCCCCGTCGATTCGATGCACTTCCAGATGGGCTACAACACCTGCTGCGGCGGCGACAACGCCAAGCGCGTGGACAGCTTCATCCAGCGGAAGATCCGCGCCGACGGGTTCTCGACGTTCCGGCGCGGTGGCACGCCTCGGGGCGGCGGGTTCGCTGAGCTGCCCGCTGCGCCTGTCCACCCGATCAAGCCGACCTCGGGCCTCACACCGGAGGTGCTGTGGCGGATCGCTGGCGGCGCGGCGTCGAAGCTGCCGGTGAGCCACTTCGAGCGGTGGTTCGATGAGCTGGTCGAGTGCCAGGCGGCGTGCGGCGTGCTGGGCAACATCGACCGCTCGGCCATGTGGTACGCCCAGGTGTTCCACGAATCGGGGAACCTGGTCCACACCGAGGAGATCGCCAGCGGCGCGGCCTACGAGGGCCGGTGCGAGGGCCTGGGCAACTGCCAGCCCGGTGACGGTGTGCGGTTCAAGGGCCGGTCGTTTATCCAGGTCACGGGCCGATCCAACTACACCAAGCTGAGCGGCTGGGCGCACAGCAAGGGCTACGTGCCGACGCCGGACTACTTCGTGGTCCACCCCGATCAGCTCGATGACGAGCAGTACGCCATGCTCGGGGTCACCTGGTACTGGACCACTCAGCGCCGGATGAACGACGCCGCCGACGCCCGCAACCTGGAGCTGGCCACGCGGTACGTGAACGGTGGCACCAACGGCCTCGACCACCGCCGGGCCATCTACAACCGCGCTCTCGCTGAGAACGCGAACCTGCTGCTGACCAACCCCGTCGAACCCTGGGAGGAACTGATGGCCACCGCCGTCCCGTCACTGTCCATCTACGCGAACCCCGGTGAGGCGGACGTGCCGCTGGCGGTGATGATCGCCGCGCTGGACGCCCACGGCCCGCATGAGCCCTACGTCGAGCGCCAGGCCCAGGAGTTTGGCGACGTCGACTCCATCCGGCGCATCGCCCGCACGGCCAACGGCCAGGGCCGAGTGAAGACCCCGGCTGCCATCAAGCAGGCCACCGACGCCTTCCGCCTGATCCCGCCCGAGTTCGTCCGCGCCGCCATTCCCGCCTAA

Gene Ontology

Description Category Evidence (source)
GO:0008233 peptidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4E7ip) rather than this protein.
PDB ID
4E7ip
Method AlphaFoldv2
Resolution 93.57
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50