Protein

Protein accession
E5DID2 [UniProt]
Representative
4xA7u
Source
UniProt (cluster: phalp2_21864)
Protein name
Uncharacterized protein vs.1
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKKALCAALLLVSSLGFASEHTFSNVQLDNLQYAYSFGEQFQKNGKYKDHSKRYDNNGLGYIMAALVWQESSAGLRTTGKHGHHAYGIFQNYLPTLRERVKQAGWSMTDAEIIRMAKNRKHSAAWAYIELSYWLDRHNGDMRKAIASYNAGNKWKAGNKYASEVLAKANYLKSRKMLHYTVE
Physico‐chemical
properties
protein length:182 AA
molecular weight:20793,4 Da
isoelectric point:9,71
hydropathy:-0,59
Representative Protein Details
Accession
4xA7u
Protein name
4xA7u
Sequence length
152 AA
Molecular weight
17631,37560 Da
Isoelectric point
9,97205
Sequence
MKYILISLTILLQLHSLSLSQQRVLAKANAFGKKYNLSYSLRAIILQESSAGLQRANYLSGCFGVGHIRLRTYLDRHNLSKSYKSQVKYMKLLMHNDYVNMRAMVDELHFWLKVHNGNYYKAYASYFAGQKWEKGVEYAKAIQSKINQLKGK
Other Proteins in cluster: phalp2_21864
Total (incl. this protein): 35 Avg length: 188,4 Avg pI: 9,41

Protein ID Length (AA) pI
4xA7u 152 9,97205
1Idow 190 9,56274
1LKZ2 174 9,78471
1Wjol 214 9,49266
2hM4 181 7,82235
2npMO 162 9,57757
3OAEc 213 9,60207
45OR3 154 9,60477
4hXOe 161 9,35986
4lwES 173 9,57725
4lxp7 193 5,24387
5WiwZ 218 9,56822
7PBWT 211 9,49189
7jJUA 215 9,55836
7zVjR 213 9,51903
7zYB4 215 9,73661
7zZ0j 213 9,56822
8JLnE 213 9,56822
8L4De 215 9,55836
8LhBB 215 9,67253
8s4ws 173 9,45779
lTnM 168 9,33684
A0A1D3RKJ3 182 9,64236
A0A2S1GMD1 183 9,54301
A0A2D1GLY7 182 9,63501
A0A1B0VVE1 182 9,58911
W6ASH0 182 9,64229
A0A0B4L916 182 9,57028
A0A6M5CEI4 178 9,52187
A0A7D5DSV7 178 9,52187
A0A7S9XC83 180 9,44579
A0AA46MEX4 182 9,64229
A0AA49H0E7 182 9,64229
A0AAX4NSA1 182 9,59369
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_30113
3cbsX
116 34,9% 146 4.354E-45
2 phalp2_33501
7Btnk
252 30,2% 142 5.933E-35
3 phalp2_33808
1iaup
27 36,1% 144 6.702E-33
4 phalp2_19301
8rcoV
4 28,3% 141 4.533E-29
5 phalp2_31960
7PBFx
121 34,7% 118 6.211E-29
6 phalp2_38011
5E1kA
59 30,3% 145 2.714E-27
7 phalp2_17388
4Kl1I
1 19,2% 151 1.328E-20
8 phalp2_19657
4vjVo
6 28,2% 138 3.406E-20
9 phalp2_2223
4hYd3
1 26,6% 150 1.756E-14
10 phalp2_13468
4JGMJ
2 21,4% 163 2.133E-13

Domains

Domains [InterPro]
Unannotated
Representative sequence (used for alignment): 4xA7u (152 AA)
Member sequence: E5DID2 (182 AA)
1 152 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterobacter phage CC31
[NCBI]
709484 Straboviridae > Karamvirus > Karamvirus cc31
Host Escherichia coli
[NCBI]
562 Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
GU323318 [NCBI]
CDS location
range 60387 -> 60935
strand -
CDS
ATGAAAAAAGCTCTATGCGCGGCGTTACTCCTAGTTTCGTCTTTAGGATTTGCGTCCGAACACACCTTCAGTAATGTCCAACTCGACAATTTGCAATATGCATATTCGTTTGGTGAACAATTTCAGAAGAATGGGAAGTATAAAGACCATAGTAAGCGGTATGACAACAATGGACTTGGTTACATTATGGCTGCACTAGTCTGGCAGGAAAGTTCTGCAGGCTTACGTACAACTGGTAAACACGGGCATCACGCTTACGGGATATTTCAAAATTATCTTCCTACACTCAGGGAAAGAGTTAAACAAGCCGGATGGTCAATGACAGACGCAGAAATCATCCGCATGGCAAAGAACCGGAAGCACTCCGCCGCTTGGGCGTATATTGAGTTAAGTTATTGGCTGGACCGTCATAACGGTGATATGAGAAAAGCCATAGCGTCATACAACGCAGGAAACAAGTGGAAAGCCGGTAACAAGTACGCCTCTGAAGTATTAGCAAAGGCGAATTACTTGAAATCTCGTAAAATGCTTCATTATACGGTGGAATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0001ebcb9a_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4xA7u) rather than this protein.
PDB ID
4xA7u
Method AlphaFoldv2
Resolution 92.57
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50