Protein

Protein accession
17a9h [EnVhog]
Representative
4H4oF
Source
EnVhog (cluster: phalp2_39380)
Protein name
17a9h
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDIIKSNVKFNGELIPLKKVDYLIIHHTQSKTASVEQIHQWHLNKKWKGIGYNFYIRKNGDIYECRGFNEGAQARGYNDKSLGIALEGNFMIEKPTEEQIKSLIDFCKFLKQKYPNVEIREHKEVNDTDCAGTNFNIDEIRYKVNDKNYYKEKLIEIRDFINEIINYGGI
Physico‐chemical
properties
protein length:170 AA
molecular weight:20014,6 Da
isoelectric point:7,68
hydropathy:-0,68
Representative Protein Details
Accession
4H4oF
Protein name
4H4oF
Sequence length
171 AA
Molecular weight
18800,88000 Da
Isoelectric point
6,14210
Sequence
MQSFDVIGPQITDISMELERGPTPYTTRGASAIQRIVIHHTATPASTTPQAIARYHVRERGWPGIGYHYVIAADGTVYQTNPPTSVSYHVRAFNPESIGIALVGDFTAEPPPAAQLRAAAELIRSIRSLLGRELPVFGHRDLNATQCPGNTWHEWRRTLTSMIEGEQDGEA
Other Proteins in cluster: phalp2_39380
Total (incl. this protein): 116 Avg length: 161,1 Avg pI: 8,24

Protein ID Length (AA) pI
4H4oF 171 6,14210
10vEA 160 8,32463
16HW1 164 9,23041
16aMY 160 6,74840
1HOlU 165 9,20152
1HQDi 165 8,37775
1Nwz9 187 10,19827
1cuOD 138 9,04577
1gHEE 151 9,25858
1hj2S 160 6,70310
1hm5O 159 5,11479
1oWLQ 166 9,67169
1zJW7 169 5,10274
2GVAu 99 6,25368
2HUiA 156 9,21171
2KJnJ 185 6,38640
2TY2L 210 5,39387
2VmWq 138 9,03043
2afZ4 151 6,84747
2tKkV 201 7,59007
2vFFG 140 9,02082
330a0 162 9,36708
330s9 152 6,61386
37Wd6 186 8,44170
37Ws5 149 9,26909
38ILq 151 9,17045
3D9L 140 9,46159
3FWsS 143 9,05550
3K4fn 143 8,78222
3NBo7 180 6,45261
3S0A7 137 6,81246
3TVE5 169 9,80630
3U2zx 186 10,24320
3XoYz 212 10,48612
3ZpuU 158 5,95266
3aXWU 205 7,08517
3duN 170 6,53656
3mXXI 200 4,92580
3ofXx 185 6,44414
401sK 147 9,56887
40ESS 148 9,60690
42imN 169 9,95806
48urN 197 6,02939
4Cad9 158 6,14830
4Hozz 180 9,58369
4USSl 177 9,79940
4VjXN 159 7,77236
4itlO 163 10,32128
4jNim 152 8,42339
4jwW7 185 6,16569
4sBWQ 152 6,45869
4tmHG 158 8,24004
4xNL8 193 6,11794
4yDS0 158 6,64933
4yDfc 161 6,78944
5Dmod 177 9,56887
5RBj3 148 9,40408
5RLOJ 147 9,44044
5Sy99 146 9,38990
5T4pZ 147 9,55481
5U3Un 144 9,62830
5UQCc 146 9,58460
5hRUU 117 9,19566
5nhGL 169 10,11794
5rJs9 144 9,25942
61rJ5 194 5,48612
63qBz 148 9,23550
6RhLB 183 9,20623
6Vqv0 143 6,74192
6e4hV 146 9,42504
6g1Ay 146 9,38990
6gSv2 146 9,33272
6nVPG 146 9,49176
6nZAO 153 7,73263
6vNt4 145 9,11191
6vk3b 152 9,53134
6vtNI 144 9,58163
7CsOm 174 9,91538
7HThb 188 7,19106
7Thjc 143 7,71535
7TlJH 143 7,10973
7VSdf 156 9,11062
7xZGi 147 9,69206
80tDS 139 9,06724
81f3R 156 6,70520
82ZmI 170 6,49393
82ppB 150 8,57328
82q14 195 8,96003
83CyK 139 9,06724
84clq 159 9,61947
85u3T 138 7,15673
89LDf 150 8,69422
8EBlu 140 8,88189
8GtvU 185 6,44414
8chQN 143 6,74192
8dsZg 137 6,79080
8fwsS 151 9,41208
8hwVU 142 9,30074
8jW1S 140 9,46159
8kyl3 174 8,38104
8niJc 169 7,82300
8pPX3 152 7,12485
8rzCy 183 6,73908
8t917 180 9,09844
8txXc 185 6,23901
9TJc 185 7,22022
BtmQ 180 5,96204
b6J2 182 8,28878
ipVw 158 8,24230
jyYE 140 9,46159
s3XF 140 9,46159
vj6W 210 8,97576
A0A088FLK9 156 9,67511
A0A3Q8I6A1 171 9,37423
W0FBY3 155 9,29971
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25442
2RRxP
189 42,6% 136 3.012E-49
2 phalp2_12934
2XGYf
62 49,2% 128 5.658E-49
3 phalp2_37356
8qdpv
23 44,3% 142 1.456E-48
4 phalp2_35354
7vHgi
29 42,1% 121 9.879E-45
5 phalp2_19793
4W5p2
179 39,8% 128 1.009E-41
6 phalp2_28697
4ATEG
42 32,7% 162 1.383E-41
7 phalp2_22703
865Sd
18 34,2% 146 4.873E-41
8 phalp2_3586
4HvU7
4 44,1% 154 6.050E-40
9 phalp2_15837
4idZC
2 40,3% 156 2.920E-39
10 phalp2_30567
5IVuf
14 35,7% 151 7.507E-39

Domains

Domains
Representative sequence (used for alignment): 4H4oF (171 AA)
Member sequence: 17a9h (170 AA)
1 171 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4H4oF) rather than this protein.
PDB ID
4H4oF
Method AlphaFoldv2
Resolution 93.54
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50