Protein
- Protein accession
- W8EE96 [UniProt]
- Representative
- 4E7ip
- Source
- UniProt (cluster: phalp2_34955)
- Protein name
- Lysin A
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MALGMTLENGWPECDLSDTERLTIPGTALSLPIRKGQPHAILQAFFRDVNEFIEPVMNARGLSDEGSWTENNSVYTSNHKGATAVDWNWSDHPLGVKNGGWDGSVLINGSQVPAMRELLTWYEGMVFWGNDWSSPVDSMHFQMGYNTCCGGDNAKRVDSFIQRKIRADGFSTFRRGGTPRGGGFAELPAAPVHPIKPTSGLTPEVLWRIAGGAASKLPVSHFERWFDELVECQAACGVLGNIDRSAMWYAQVFHESGNLVHTEEIASGAAYEGRCEGLGNCQPGDGVRFKGRSFIQVTGRSNYTKLSGWAHSKGYVPTPDYFVVHPDQLDDEQYAMLGVTWYWTTQRRMNDAADARNLELATRYVNGGTNGLDHRRAIYNRALAENANLLLTNPVEPWEELMATAVPSLSIYANPGEADVPLAVMIAALDAHGPHEPYVERQAQEFGDVDSIRRIARTANGQGRVKTPAAIKQATDAFRLIPPEFVRAAIPA
- Physico‐chemical
properties -
protein length: 492 AA molecular weight: 54111,0 Da isoelectric point: 5,51 hydropathy: -0,36
Representative Protein Details
- Accession
- 4E7ip
- Protein name
- 4E7ip
- Sequence length
- 163 AA
- Molecular weight
- 17893,67510 Da
- Isoelectric point
- 4,44330
- Sequence
-
MWCAQIGEESGGLQWMEELASGQEYEGRCSDLGNCSPGDGPRFKGRGPIQITGRYNYASLSAWAFGQGLVPSPTFFTDDPTQLASDQFGFVGVNWYWTTARNMNSFADAGDILGATQAVNGGTHGLAERTARWNRCRAMGNMLLTLPEEDWQPVMDELLGIDR
Other Proteins in cluster: phalp2_34955
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_23279
520Od
|
3 | 50,0% | 144 | 3.114E-53 |
| 2 |
phalp2_1654
8rD3A
|
4011 | 44,4% | 135 | 3.162E-40 |
| 3 |
phalp2_22262
7pbAz
|
301 | 41,7% | 139 | 2.602E-38 |
| 4 |
phalp2_8404
IwNf
|
80 | 43,9% | 132 | 3.639E-35 |
| 5 |
phalp2_28972
62TOB
|
63 | 41,4% | 140 | 6.941E-32 |
| 6 |
phalp2_23254
4Uvr8
|
48 | 33,1% | 175 | 1.784E-31 |
| 7 |
phalp2_82
4Uvd5
|
73 | 35,8% | 170 | 9.612E-29 |
| 8 |
phalp2_2991
1bUPz
|
10 | 48,0% | 104 | 1.316E-28 |
| 9 |
phalp2_33611
oYfN
|
2 | 41,8% | 117 | 1.627E-27 |
| 10 |
phalp2_13837
7g328
|
191 | 38,4% | 143 | 4.175E-27 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Mycobacterium phage 39HC [NCBI] |
1463809 | Julieunavirus > |
| Host |
Mycobacterium smegmatis str. MC2 155 [NCBI] |
246196 | Actinobacteria > Actinobacteria > Corynebacteriales > Mycobacteriaceae > Mycobacterium > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
KJ433973
[NCBI]
CDS location
range 38368 -> 39846
strand +
strand +
CDS
GTGGCTTTGGGAATGACTCTGGAGAACGGCTGGCCGGAATGCGACCTGTCCGACACTGAGCGCCTGACGATCCCCGGCACCGCGCTGAGCCTGCCGATCCGCAAGGGCCAGCCGCACGCGATCTTGCAGGCGTTCTTCCGTGACGTCAACGAGTTCATCGAGCCGGTGATGAACGCCCGAGGGCTCAGCGACGAGGGCAGCTGGACCGAGAACAACAGCGTCTACACGTCGAACCACAAGGGCGCGACCGCCGTCGATTGGAACTGGAGCGACCATCCGCTGGGCGTCAAGAACGGCGGCTGGGACGGCTCGGTGCTCATCAACGGCTCCCAGGTACCCGCCATGCGCGAGCTGCTGACCTGGTACGAGGGCATGGTCTTCTGGGGCAACGACTGGAGCAGCCCCGTCGATTCGATGCACTTCCAGATGGGCTACAACACCTGCTGCGGCGGCGACAACGCCAAGCGCGTGGACAGCTTCATCCAGCGGAAGATCCGCGCCGACGGGTTCTCGACGTTCCGGCGCGGTGGCACGCCTCGGGGCGGCGGGTTCGCTGAGCTGCCCGCTGCGCCTGTCCACCCGATCAAGCCGACCTCGGGCCTCACACCGGAGGTGCTGTGGCGGATCGCTGGCGGCGCGGCGTCGAAGCTGCCGGTGAGCCACTTCGAGCGGTGGTTCGATGAGCTGGTCGAGTGCCAGGCGGCGTGCGGCGTGCTGGGCAACATCGACCGCTCGGCCATGTGGTACGCCCAGGTGTTCCACGAATCGGGGAACCTGGTCCACACCGAGGAGATCGCCAGCGGCGCGGCCTACGAGGGCCGGTGCGAGGGCCTGGGCAACTGCCAGCCCGGTGACGGTGTGCGGTTCAAGGGCCGGTCGTTTATCCAGGTCACGGGCCGATCCAACTACACCAAGCTGAGCGGCTGGGCGCACAGCAAGGGCTACGTGCCGACGCCGGACTACTTCGTGGTCCACCCCGATCAGCTCGATGACGAGCAGTACGCCATGCTCGGGGTCACCTGGTACTGGACCACTCAGCGCCGGATGAACGACGCCGCCGACGCCCGCAACCTGGAGCTGGCCACGCGGTACGTGAACGGTGGCACCAACGGCCTCGACCACCGCCGGGCCATCTACAACCGCGCTCTCGCTGAGAACGCGAACCTGCTGCTGACCAACCCCGTCGAACCCTGGGAGGAACTGATGGCCACCGCCGTCCCGTCACTGTCCATCTACGCGAACCCCGGTGAGGCGGACGTGCCGCTGGCGGTGATGATCGCCGCGCTGGACGCCCACGGCCCGCATGAGCCCTACGTCGAGCGCCAGGCCCAGGAGTTTGGCGACGTCGACTCCATCCGGCGCATCGCCCGCACGGCCAACGGCCAGGGCCGAGTGAAGACCCCGGCTGCCATCAAGCAGGCCACCGACGCCTTCCGCCTGATCCCGCCCGAGTTCGTCCGCGCCGCCATTCCCGCCTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0008233 | peptidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(4E7ip)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50