Protein

Protein accession
60r9g [EnVhog]
Representative
4MhZ3
Source
EnVhog (cluster: phalp2_33124)
Protein name
60r9g
Lysin probability
85%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MMKVSSHGIALIMNFEGLRTAAYKPVSSERGWTIGYGHHGLDVNEHSICTELEAERLLKSDLEKVERQITAALNADEIEVTQGMFDALCSLLFNLSGKQTKDGRQLTPIQTLISYKLWAKMKKGDKYGASLEFLDINKAGGVVLQGLTKRRQAEQKLFLS
Physico‐chemical
properties
protein length:160 AA
molecular weight:17830,4 Da
isoelectric point:8,50
hydropathy:-0,26
Representative Protein Details
Accession
4MhZ3
Protein name
4MhZ3
Sequence length
195 AA
Molecular weight
21974,63190 Da
Isoelectric point
5,18948
Sequence
MTTSDYGREKIENFEGCVLHAYKARPAEKYWTIGYGHYGPDVKEDQTITQQEADALFKQDLKSREQAVDALGIELNQWQFDALVDFAYNCGIGALQSVCKSKDLGEICKLIPNWCKDASGKVLEGLVNRRAWEVSMISPGTIKKGKCIADCLNIRRKAGMDGEICGSYHEGDEVEVLDEWIQTPRGWVASRYIQQ
Other Proteins in cluster: phalp2_33124
Total (incl. this protein): 118 Avg length: 189,2 Avg pI: 7,94

Protein ID Length (AA) pI
4MhZ3 195 5,18948
13qOH 222 8,04425
13txz 206 9,03255
1DmuH 150 5,51039
1EqkI 146 4,52407
1Io9h 158 7,79025
1eZSr 232 8,80743
1zIBC 189 8,87228
2VioU 201 6,91113
2aqyn 149 8,68017
38GLE 158 8,45221
3AGez 159 8,81549
3PBbi 195 5,64800
3WJfw 158 7,75229
3caC3 237 9,08439
3icX6 158 8,45221
3nPz8 220 7,60173
3zStR 146 4,74983
45ugM 212 9,37707
4BPmT 150 7,66146
4EDu6 208 9,00038
4EmtB 147 6,31239
4GdF2 149 8,69222
4GgHM 149 8,83960
4Gitg 149 8,89762
4L1y2 151 8,25513
4L9Dw 160 7,75718
4LBf3 206 9,97669
4MRAW 208 9,95045
4T4Ke 145 4,76779
4Tucj 210 9,68774
4XFcK 151 5,12241
4kh0Z 196 9,36914
5K6sz 160 8,83160
5LlZ1 159 8,47748
5T5FG 199 6,11431
5U2UX 159 9,01921
5WDwC 160 9,05570
5WONN 159 9,01901
5Wwts 199 7,75064
5XF0z 199 5,85415
5XJnB 160 6,90778
5YmgZ 160 9,43819
5iomz 235 8,59629
5iqAt 234 6,84156
62n5m 160 8,83160
64m4I 251 5,46350
68xYW 160 9,00187
69HSr 201 6,52690
69Ug1 160 9,32124
69gGT 199 7,72740
6ERki 160 9,15233
6IVzx 145 5,18840
6OSGn 150 4,30552
6SyWz 207 9,51684
6aRK3 159 8,78480
6dvHL 199 6,52582
6ezpZ 199 5,85785
6iU5l 159 9,03661
6icA5 160 8,50423
6k9Zs 159 9,27818
6kBsR 199 6,44181
6mAL0 159 9,01921
6nLm7 160 8,83160
6oWbV 199 6,44096
6tG0V 159 9,03655
6w9pv 160 8,39374
76MCH 202 9,28391
7WOgf 219 9,66176
7Y5Nw 201 6,96689
7YBdU 224 8,35673
7YxOk 158 7,76315
7cYQ7 208 8,57470
7dVIu 206 9,44973
7kdiD 210 8,28601
7kdis 209 8,57360
7mBI6 201 9,20230
7ofmR 201 6,58999
7ofmX 201 6,59965
7q0nL 211 6,95939
7q2Up 158 6,90806
7qGS2 145 5,89440
7qGTm 145 7,75030
7vBb5 207 9,09676
7vnHX 214 8,74631
7vob3 215 8,90439
7x2Oh 215 9,11991
7xIW5 206 8,85900
7xReK 204 6,62353
7xkDN 211 8,54691
7xkkk 206 9,25516
7yYAV 207 9,13796
7yYGR 209 9,04512
7yzci 205 9,40769
7yzh0 204 9,17000
7zIOZ 208 9,02224
7zK3w 215 9,11991
7zeD9 206 8,96086
82A4u 244 5,17874
82obE 232 4,53987
82obu 232 4,94422
82rPm 245 4,87709
889jJ 219 9,42033
8bsZ1 190 8,31998
8dlan 199 6,44096
8fuxC 201 5,97977
8lp1K 216 9,33942
8ob4o 201 7,72740
8opKj 232 5,22955
8qWQY 219 9,37166
8rKQP 160 9,32144
8rZb5 219 9,36379
blu4 165 8,60274
edMi 189 8,69435
fq7X 213 8,26589
nuv8 219 8,51751
ocp6 227 9,22254
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_40316
3fQk2
50 50,6% 144 1.680E-68
2 phalp2_34307
3A2Hz
178 43,7% 160 4.199E-58
3 phalp2_2498
5GMvl
297 37,8% 190 3.125E-55
4 phalp2_23833
1mogV
302 42,4% 146 8.035E-55
5 phalp2_28113
7zmZV
50 36,6% 218 5.313E-54
6 phalp2_17145
2TusA
116 44,3% 142 4.811E-53
7 phalp2_33221
5jbqV
447 46,2% 147 9.029E-53
8 phalp2_33932
23JqA
85 30,9% 236 5.967E-52
9 phalp2_24016
7V0GL
4 42,5% 141 6.056E-49
10 phalp2_16385
5LDDR
25 43,2% 148 5.477E-48

Domains

Domains
GH24
Unannotated
Representative sequence (used for alignment): 4MhZ3 (195 AA)
Member sequence: 60r9g (160 AA)
1 195 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4MhZ3) rather than this protein.
PDB ID
4MhZ3
Method AlphaFoldv2
Resolution 93.72
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50