Protein

Protein accession
4QLBp [EnVhog]
Representative
t4Qh
Source
EnVhog (cluster: phalp2_28237)
Protein name
4QLBp
Lysin probability
96%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MNKEDAVLQFLIQEEGFKPRTYIPKKDGKAIGNSGLTFGVGIDIGQMNAKEFLSMGLPTEFQESLLPYVGKKGAEALAVEKELGHYTLPTDVAMDISRQKISKSTDIMKNKYKNYDELPYQQQAVGLSLLHNFGDSSLDFGTMKSVMGGDLESGIARLRNADEWKNPELFARRNREADLLQSLIQQQAPKQQAQLFNTGVR
Physico‐chemical
properties
protein length:201 AA
molecular weight:22447,3 Da
isoelectric point:6,17
hydropathy:-0,52
Representative Protein Details
Accession
t4Qh
Protein name
t4Qh
Sequence length
187 AA
Molecular weight
21445,88950 Da
Isoelectric point
9,72062
Sequence
MPLDINWKFLLEVEGRTRRGYVPKDKDGVIGQSGVTIGKGIDIGQMSILQLQKLDISDRVRAMLIPYAEKKREAAVAMLRTRPLFMEWADVEELEKAVIARELGLLIKYYNRDSKVKFQDIPTQAQTVLLSLVWNFGADLPGKLPTTWKMATRQEWVKLADYLSDFPGKQKKLDGRRKREAALLRTI
Other Proteins in cluster: phalp2_28237
Total (incl. this protein): 121 Avg length: 207,3 Avg pI: 8,13

Protein ID Length (AA) pI
t4Qh 187 9,72062
109BH 195 9,13396
147Em 209 5,30441
14drx 211 7,91048
19eqm 240 8,75476
1HOMQ 204 7,90951
1JvL 203 7,90951
1kWhC 205 9,15008
1qx7k 208 9,20023
1tApz 195 9,15375
1tpPx 170 8,63137
1tqVH 195 8,95964
1uguc 201 9,32118
20NB8 212 8,70892
21YDb 194 9,17471
23EZS 210 9,09289
23Foy 216 8,89743
25ST7 211 6,76528
2Bz9B 211 7,89959
2GIrc 211 6,92398
2K4Bs 208 6,33098
2QHw3 211 9,32169
2WyLI 203 5,15987
2eR3O 204 7,87844
2jtK1 204 7,88837
2jtbz 211 6,44323
2mHX1 211 7,89926
2mlbl 205 9,13428
2nKZC 204 7,87844
2qYOQ 204 6,31358
2qjsQ 204 6,91875
34Mx6 204 6,43528
34Qi0 211 6,44318
35Adf 204 8,75495
37RPu 221 7,52130
3JbOT 197 6,57453
3TDhZ 216 8,89749
3TFq5 215 8,70331
3TKWx 216 8,54872
3VClx 203 7,88830
3WIsE 215 8,83483
3ZBqx 210 8,92702
3ZHBZ 212 8,79099
3dVlo 207 9,19998
3gPvA 209 8,36775
3gUsf 205 8,58359
3j10E 207 6,83144
3lOEa 210 9,66486
3me0k 208 8,92882
3miGy 210 7,73535
3mqIi 163 6,73811
3ms1F 210 9,32492
3nv5n 208 9,84434
40dB6 228 8,49772
40j8A 219 8,55678
42BBK 195 9,32105
48rPY 195 9,51974
4L6eT 213 8,81916
4k9P9 219 8,96422
4kh56 214 7,64214
4qJuu 190 10,83000
4v61B 212 7,88888
4xF93 240 8,99535
4xojW 205 6,92051
4xp2D 210 9,82268
5FjWF 203 6,92023
5SZZB 298 9,67679
69Q2d 221 6,20070
6BdHt 212 5,30441
6Be41 211 5,30441
6JOwS 195 8,63085
6JVyq 210 5,97505
6K4TZ 201 9,04564
6OzQ8 205 6,91875
6V4KN 204 8,96035
6bhDO 197 7,29787
6prRt 273 8,11729
722s3 208 9,12197
722tC 204 7,63702
722tb 211 9,63378
722vU 210 9,37591
722wY 212 9,02920
72ub5 190 9,34761
76q8O 204 8,24707
7Fp1V 188 9,25697
7K9P 204 6,31682
7V9tV 211 9,32169
7WYoL 204 8,85056
7Xu9T 204 8,41424
7hhaM 212 9,20816
7hhcG 209 9,01747
7nIrv 163 7,01657
7nIte 210 9,59884
7p0cM 203 8,82168
7wFoE 214 9,09470
7wFwK 197 9,15536
7wFxi 197 9,32124
7wQIP 197 9,32156
7wQJ3 197 9,32156
86Otc 197 7,17816
8AWc 195 9,13396
8CwJ1 211 6,44323
8Fi9l 211 6,92273
8Fj7G 211 7,83512
8aJlS 221 6,30006
8iVbH 221 6,80746
8jvIn 221 6,71470
8wnUE 211 7,88869
8xlvu 208 7,88869
8xmp1 211 8,68861
f7tX 193 8,47825
f7y9 204 6,43045
gFoB 213 6,44448
lCUi 211 9,29539
lDTp 205 9,45031
n3Z7 204 6,12459
nagT 204 6,12459
ogFo 205 7,58666
q5r0 204 6,91875
sZU8 190 9,55062
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20898
6VHSS
327 33,3% 183 2.155E-42
2 phalp2_33062
4ARSx
1 30,8% 185 2.409E-40
3 phalp2_26096
7w9ed
2 31,1% 180 9.410E-32
4 phalp2_6029
4Ttjs
1 32,7% 186 6.175E-31
5 phalp2_10635
2uHA2
48 27,2% 202 7.578E-30
6 phalp2_39384
4Ji2N
17 27,9% 193 8.315E-28
7 phalp2_28188
8EnbV
1 28,8% 184 2.908E-27
8 phalp2_40033
26aY8
2 21,5% 186 7.435E-27
9 phalp2_4098
1twmT
2 25,4% 185 1.017E-26
10 phalp2_29328
OWz8
5 30,8% 185 6.640E-26

Domains

Domains
Representative sequence (used for alignment): t4Qh (187 AA)
Member sequence: 4QLBp (201 AA)
1 187 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF16754

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (t4Qh) rather than this protein.
PDB ID
t4Qh
Method AlphaFoldv2
Resolution 90.38
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50