Protein

Protein accession
A0A2H5BLQ0 [UniProt]
Representative
6OclU
Source
UniProt (cluster: phalp2_39649)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSKLVDKVLAVAKAEVGTKEKQSGGHWVNDSKYNRWYGKIPGYGQGGYGYPWCAVFVAWVADKAGAASLFPKTAGCATAVNWFKAKGRFSEYPAIGAQVFYGANGGTHTGLVYDYDDTYIYTYEGNTNDNGSPEGDGVYRKKWARRSSYVYGYGYPKYPEGIKSADPKFKDEAPKAETKPAESKPAPSKPAAPKPAPKPVSKIVALKDGVKPYARHAQVKDLQRFLVKAGYGPIPGAYTDYYGPETQKAVARFHNKNPHLRSAGKSYDPAIGKSGFKELQKEAGVK
Physico‐chemical
properties
protein length:286 AA
molecular weight:31190,8 Da
isoelectric point:9,55
hydropathy:-0,70
Representative Protein Details
Accession
6OclU
Protein name
6OclU
Sequence length
174 AA
Molecular weight
18746,39350 Da
Isoelectric point
7,03766
Sequence
VSIEKVLSVAAAEVGEHEKYSAGHWVNDSKYTRWYGTIPGYGQGGYGYPWCAVFVTWCAHKAGFASLYPKTAGCASAVNWFKNKGSFSEYPAVGAQVFFGNGGGTHTGIVYAYDADYVYTYEGNTNTSSSAEGDGVYAKKRARRDLYLYGYGYPDVSGSLSADPNAAKFGYTVA
Other Proteins in cluster: phalp2_39649
Total (incl. this protein): 48 Avg length: 240,9 Avg pI: 8,10

Protein ID Length (AA) pI
6OclU 174 7,03766
11zMy 165 9,59665
1rw2F 134 8,95081
24QXc 154 8,34571
3OQup 218 5,06063
48dXd 165 5,52267
4KxRU 238 9,17309
4Ky5T 219 8,60738
4LDNb 201 9,75724
4LFTy 217 7,69903
4LRdR 219 8,60738
4LqsL 201 9,80121
4MFha 189 9,90062
4Trkk 218 5,13685
5H4kd 172 6,50110
5imtJ 175 8,89814
5y9Zr 172 6,88408
6Ci7y 218 5,17743
6UQEC 185 6,12931
6x3hy 175 9,16839
78Il5 197 5,04778
7FjZ 162 4,83640
7heXl 233 9,49595
7p6IJ 279 6,11982
7tMvb 198 4,93495
7ulBZ 270 4,93029
7zkw0 183 9,03532
8ErZ 163 4,67054
DnMx 257 7,57257
DodW 198 5,51886
A0A385E0L2 363 9,24762
A0A2H5BLZ5 284 9,38004
A0A1J0GNX0 290 9,72565
A0A481W0D0 282 9,40576
A0A5J6TPZ0 370 9,17425
A0A5J6D8A1 288 9,60174
A0A7G4AX32 283 9,54353
K4IBM9 276 9,38326
A0A2H4PGE4 285 9,75744
A0A385DSJ0 368 9,34987
A0A411B580 370 9,17425
A0A411CP72 369 9,34110
A0A411CPI2 363 9,30899
A0A411CYF9 280 9,47983
A0AA50F148 286 9,81204
A0AA51GFJ3 285 9,81281
A0AA51GGF7 285 9,73133
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_4499
3nCxR
65 42,3% 170 2.884E-44
2 phalp2_11549
ZJ7o
17 36,4% 159 8.490E-39
3 phalp2_8114
6H1JM
136 38,0% 163 4.586E-36
4 phalp2_34753
3g9I4
5 36,8% 187 6.282E-36
5 phalp2_1959
4y5vj
188 38,0% 171 2.228E-32
6 phalp2_25538
3nJBF
256 40,0% 175 4.176E-32
7 phalp2_1568
7XJtL
188 38,9% 172 2.009E-31
8 phalp2_16018
3yVVm
206 36,0% 161 9.661E-31
9 phalp2_40071
41qFE
139 33,3% 168 1.810E-30
10 phalp2_18705
oeYw
31 39,5% 149 1.630E-29

Domains

Domains [InterPro]
Representative sequence (used for alignment): 6OclU (174 AA)
Member sequence: A0A2H5BLQ0 (286 AA)
1 174 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05257

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptomyces phage Omar
[NCBI]
2059882 Omarvirus > Omarvirus omar
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MG593802 [NCBI]
CDS location
range 21948 -> 22808
strand +
CDS
GTGAGCAAGCTCGTGGACAAGGTGCTCGCCGTCGCGAAGGCGGAGGTCGGCACGAAGGAGAAGCAGTCGGGTGGCCACTGGGTCAACGACTCCAAGTACAACCGGTGGTACGGCAAGATCCCCGGCTACGGCCAGGGCGGCTACGGCTACCCCTGGTGCGCCGTGTTCGTGGCGTGGGTGGCGGACAAGGCCGGCGCCGCGTCCCTGTTCCCGAAGACGGCCGGCTGCGCGACGGCGGTGAACTGGTTCAAGGCCAAGGGCCGCTTCAGCGAGTACCCGGCGATCGGTGCGCAGGTGTTCTACGGGGCGAACGGCGGCACCCACACGGGCCTGGTGTACGACTACGACGACACCTACATCTACACCTACGAGGGCAACACCAACGACAACGGTTCGCCCGAGGGTGACGGTGTCTACCGGAAGAAGTGGGCGCGGCGGTCCTCGTACGTGTACGGCTACGGCTACCCCAAGTACCCGGAGGGCATCAAGTCCGCGGACCCCAAGTTCAAGGACGAGGCGCCGAAGGCGGAGACCAAGCCGGCCGAGAGCAAGCCCGCTCCGTCGAAGCCGGCCGCGCCCAAGCCGGCGCCCAAGCCGGTGTCGAAGATCGTGGCGCTGAAGGACGGCGTGAAGCCGTACGCCCGGCACGCGCAGGTGAAGGACCTCCAGCGGTTCCTCGTCAAGGCGGGCTACGGCCCGATCCCCGGCGCGTACACGGACTACTACGGACCGGAGACCCAGAAGGCGGTCGCCCGGTTCCACAACAAGAACCCGCACCTGCGGTCGGCCGGGAAGTCCTACGACCCGGCGATCGGCAAGTCCGGCTTCAAGGAGCTCCAGAAGGAGGCAGGCGTCAAGTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000ca0dab5_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6OclU) rather than this protein.
PDB ID
6OclU
Method AlphaFoldv2
Resolution 95.29
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50