Protein

Protein accession
4zhWQ [EnVhog]
Representative
40K7Z
Source
EnVhog (cluster: phalp2_28537)
Protein name
4zhWQ
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MPDLPCIWRTARAFKPGRTGAISAIILHSSDGHEQGDLETLTGAEVSAHYYVTRTGTLYHLVLDRDTAFHVGKADRPEHSNGRTIGIEQEHIDGQDDWPDAQITTVARLVCALRRQYGFLPVLAHAAVAVPPGRKVDPQGYPWDKLSAATKAEAGKWWKLLPKQGG
Physico‐chemical
properties
protein length:166 AA
molecular weight:18228,4 Da
isoelectric point:7,10
hydropathy:-0,41
Representative Protein Details
Accession
40K7Z
Protein name
40K7Z
Sequence length
191 AA
Molecular weight
21429,00210 Da
Isoelectric point
8,88195
Sequence
MKIRDYRKKIPFTNYSKGRCYWTPGLITLHTTEGSYEGACSWFCNSASGVSAHFVVGLSGEITQCVDLADTAYVNGTSFSSDDNRYYGKATNPIVKQRNANANSYTVGIEIAGFYDEKTKQCSMTDAQKNTVIELIDYIITEVKKKYGNSIPVDRTRICGHYEVNPVTKPNCGRGFPYEEIIKGVLNLRKR
Other Proteins in cluster: phalp2_28537
Total (incl. this protein): 71 Avg length: 159,1 Avg pI: 8,67

Protein ID Length (AA) pI
40K7Z 191 8,88195
14wKs 149 10,57264
1P8J9 150 8,79241
1Zw1v 150 8,82464
1luiC 168 9,80978
1wTiP 154 9,14441
2PaLx 179 8,33333
2Q2Yv 208 6,75568
2Qg9S 151 9,32730
2W962 143 9,45424
2XJFl 151 9,29971
2rkmT 150 9,27837
3OM44 185 7,71938
3PlJ1 166 9,12674
3UFya 150 9,27837
3UOeC 150 9,27947
3VN8z 150 8,88911
3YLG8 150 9,17316
3jydA 151 9,07652
428wy 161 7,05437
43DTu 150 9,24962
44g1x 150 9,25142
44stM 150 8,99290
4A3Bx 168 9,49531
4AiD3 151 8,43474
4DNHL 151 8,93140
4Gq1L 167 6,70299
4NrA5 165 9,21158
4USjj 133 9,89030
4bePn 170 9,78670
4lQJO 161 9,71624
4qKUp 151 9,29636
4rxn9 151 8,79976
4vUGf 167 6,04070
4vX2U 121 7,17038
4w5JJ 170 7,04959
4w5tH 166 5,88149
4zJME 134 9,24917
4zkIH 167 6,03627
4zkVX 167 6,43221
54rwW 153 9,06208
59DJ1 153 9,41427
5BedM 187 7,94671
5BgXx 228 8,34648
5C6tL 149 9,78793
5CebO 151 9,26303
5FeXy 172 8,80801
5a8nm 169 9,01141
5cSSw 151 9,60439
5kuPH 150 9,27908
5lveF 150 9,12287
5m6Qj 150 9,09734
5mXNp 158 9,67247
5mxgy 150 9,15304
5ztOJ 150 9,17316
6Fz1b 186 8,96222
6KyFD 150 9,12287
6LUVR 150 9,32891
6MAeU 113 9,54649
6U9sv 181 8,64465
8lWSX 167 9,15130
IQmr 151 8,58630
KfJG 125 9,37604
LWgo 150 9,27360
hCdp 177 5,95760
iBSm 167 7,27394
ouWs 223 5,39313
uLWs 150 9,20584
x8eY 150 9,17322
xcHU 150 9,17322
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24509
4NPVU
68 47,4% 175 4.200E-47
2 phalp2_20417
3dS44
3 42,5% 174 5.203E-46
3 phalp2_28639
8jnfC
92 35,8% 184 2.834E-37
4 phalp2_11760
45aqa
242 32,7% 168 9.952E-37
5 phalp2_30952
173qx
118 40,3% 161 4.781E-36
6 phalp2_9423
jh6D
5 32,5% 166 4.200E-29
7 phalp2_18213
4LQlA
5 33,1% 169 2.453E-27
8 phalp2_32182
6TIHm
13 38,3% 146 8.569E-27
9 phalp2_26158
8xJev
305 28,6% 164 4.090E-26
10 phalp2_16178
4A1r9
27 29,1% 192 5.590E-26

Domains

Domains
Representative sequence (used for alignment): 40K7Z (191 AA)
Member sequence: 4zhWQ (166 AA)
1 191 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (40K7Z) rather than this protein.
PDB ID
40K7Z
Method AlphaFoldv2
Resolution 97.38
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50