Protein

Protein accession
5i53k [EnVhog]
Representative
4aDQQ
Source
EnVhog (cluster: phalp2_5594)
Protein name
5i53k
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MFAPFPSRTISSPYGWRVHPERGIRQHHPAIDYRVPGGVAVHAIASGAVSAVRFDRVLGNVVEIDHGAGYVSRYAHLRDPSPLEVGDHVTADSEIGHVGKTGSGARGDHLHLELWHHGRHIDPAHQLTERP
Physico‐chemical
properties
protein length:131 AA
molecular weight:14344,8 Da
isoelectric point:6,92
hydropathy:-0,43
Representative Protein Details
Accession
4aDQQ
Protein name
4aDQQ
Sequence length
144 AA
Molecular weight
15746,82900 Da
Isoelectric point
9,72842
Sequence
MRFPFDKPVPKISSGYGWRIHPIEKIRKHHNGVDYAGALGTPVRAIADGKVIFAGASTLKFSNGEPAGGGYLVKIRHKVNGKWITSAYMHMVKGSIAVKKGQDVIEGQVIGKLGNTGESTGPHLHFEIQEGKDYVWTSNGTRYE
Other Proteins in cluster: phalp2_5594
Total (incl. this protein): 125 Avg length: 163,3 Avg pI: 8,65

Protein ID Length (AA) pI
4aDQQ 144 9,72842
1LCjf 161 10,03046
1Ldns 161 9,83518
1MCPN 156 9,82074
1MLBb 158 9,59356
1c4uM 131 6,27715
1cToF 190 6,93518
1dUWB 180 9,87096
1iDHP 119 10,22902
20xz9 156 9,82094
21GTS 190 6,63166
21llM 188 5,91492
21x2f 200 6,03661
23IlD 187 8,93334
23KGU 191 8,57979
2GENT 170 9,60606
2Zx6Y 156 9,82094
2fX2G 224 4,90187
37mlD 140 10,25713
38g1Z 145 10,11530
3TCGj 191 6,63921
3V3Kp 198 5,48072
3V3Y5 118 4,48769
3VIyP 197 8,22180
3WANO 193 6,02751
3WBMn 197 8,54459
3WFQK 188 6,01387
3WJOw 196 6,35394
3aqmO 160 6,49376
3bYKk 195 7,66510
3dRLn 182 6,99912
3drRn 142 10,27944
3e22t 189 5,80306
3h4en 187 6,35400
3mTfl 119 6,27078
3tSio 177 8,58592
3vUI6 189 9,24981
3wk7P 189 7,68136
3yVWH 190 6,99315
3yWnk 149 8,62008
40ZSs 201 8,26190
41Fpn 142 10,35602
41PSG 139 10,03529
41Yfx 143 10,47568
41o6R 184 7,00725
45aGG 139 8,22277
49ACo 156 9,82094
49AE5 164 9,65345
49SvU 139 10,36112
49rHf 157 10,07997
4AebB 139 10,28401
4BRqC 140 10,42043
4BYFp 140 10,35345
4KKyU 163 10,09699
4KT7s 156 10,14109
4Ko2k 156 10,36389
4NHBm 138 10,44273
4NHNS 142 10,39213
4NJmZ 140 10,07552
4RrmZ 156 10,20968
4Ytos 133 9,75511
4acqU 156 10,06688
4bsIX 156 9,79367
4bsf5 156 9,99397
4bt1F 161 9,74783
4kCve 139 6,57482
4l5ts 156 9,79367
4ppVA 156 9,79367
58puw 139 10,36376
5Ca4f 138 10,35854
5KAa0 196 8,27363
5PJU8 190 7,03692
5WjgF 196 8,16784
5YuID 199 8,79202
5daHn 140 10,47207
5eIn1 123 10,24778
5eZLg 156 9,79367
5gfz9 156 9,86265
5hkhi 140 10,35854
5hr4P 138 10,40354
5icUw 156 9,86265
5lAk3 144 10,51391
5mIF 187 6,56902
5xxhR 156 10,12761
60Cej 195 7,03481
62OIC 193 7,59712
64xAu 188 6,20099
67Ko9 185 5,25359
67UcD 190 6,41868
6GgCQ 145 10,00035
6IABv 108 9,92679
6Is59 141 10,04064
6KSDl 144 10,08758
6KVzY 140 10,13213
6Tly1 130 4,70402
6Ur3k 145 10,32076
6awz8 196 6,63859
6g14P 196 8,96557
6m5QE 195 7,03561
6p0oW 196 6,36008
6qX7Y 188 6,26874
6rJOJ 192 8,24236
6sWxI 195 8,24223
6uNvV 203 9,53167
6x8oB 156 9,79302
6xs5x 144 10,08758
6ycpP 156 9,85001
71zIQ 202 5,46430
7hpWU 136 7,19209
82FXF 208 7,62963
82GcE 187 9,39061
82Hto 186 7,81101
83H4l 199 8,80356
85FnY 140 9,89791
FNXV 163 9,81423
J9wg 140 9,95032
a4mz 188 6,11937
bsYm 160 9,75486
c5SU 156 9,82094
ftaf 147 5,27917
A0A6J5LW88 145 10,07185
A0A6J5MQD6 86 9,69793
A0A6J5N096 69 9,77452
A0A6J5N0U5 156 9,99474
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36747
6BM9K
381 57,6% 144 1.277E-56
2 phalp2_14848
7w0BH
74 39,2% 140 2.320E-32
3 phalp2_29748
1xSNS
1 58,6% 87 5.975E-32
4 phalp2_5756
7Czmx
45 44,4% 117 7.444E-31
5 phalp2_19030
17y4D
9 39,6% 131 4.070E-28
6 phalp2_24228
2SnS7
7 39,5% 129 7.643E-28
7 phalp2_5933
67ENK
29 32,3% 139 3.038E-25
8 phalp2_532
pOAI
4 39,0% 123 5.704E-25
9 phalp2_34915
4pbkX
159 35,0% 137 2.010E-24
10 phalp2_31190
87Snf
9 35,5% 118 1.820E-23

Domains

Domains
Representative sequence (used for alignment): 4aDQQ (144 AA)
Member sequence: 5i53k (131 AA)
1 144 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4aDQQ) rather than this protein.
PDB ID
4aDQQ
Method AlphaFoldv2
Resolution 89.16
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50