Protein

Protein accession
4U2we [EnVhog]
Representative
4gtRR
Source
EnVhog (cluster: phalp2_25638)
Protein name
4U2we
Lysin probability
98%
PhaLP type
endolysin
Probability: 84% (predicted by ML model)
Protein sequence
MLKSLQRFFKEFGFLGGLVMFALLAVFATYPGEPLKQLGFLSTPQLTEFSGLAAFFLYAFSGLLVWRMATYFLFPKLSLSKIIERLTDSEKAVLLGKIILAIALIVLGATVHAQTPAQTAANKLPHIAIAEKYVGVTEKPRNSNRGPEVETFLKFVGLGGGYSYCAAFVSFVLDKSGACYPAVNGKLVKTAAARGFVIPGSIPAEKVLRGEYKLQGGEVVVWRRGESWQGHVGFVTKRISDASFTTIEANTSPGTKGSQANGGGIYARSRTIVPTSHFRIIAFTPVKYKCKNT
Physico‐chemical
properties
protein length:293 AA
molecular weight:31761,7 Da
isoelectric point:9,87
hydropathy:0,23
Representative Protein Details
Accession
4gtRR
Protein name
4gtRR
Sequence length
209 AA
Molecular weight
24022,41300 Da
Isoelectric point
9,56222
Sequence
MFRKIISSLIIFLFCFNITYSKNIIPYTTIPHLDTAISYYNRGVKEKTNNNDGKEVEMFLRYVGLPKGNPWCAAFISYCEGVCTGILNKTKSALARNFITKKSISAKDVLYGRVVIKPGTLVIFQKGNTLNGHIGTVYYWDKERGQLVEGNAGDKVSFMERSIQPRNYSRIISFTIVEYDKSIQEKIDRYKQKKFDKTENSTVNESRTF
Other Proteins in cluster: phalp2_25638
Total (incl. this protein): 97 Avg length: 188,6 Avg pI: 9,54

Protein ID Length (AA) pI
4gtRR 209 9,56222
13mui 254 8,37704
14N25 169 9,56326
14uDC 199 9,80102
18XZ4 128 9,96554
1HYyY 180 9,59581
1IhLQ 254 6,83349
1JSY9 182 10,06940
1KF9f 179 9,93221
1OroE 219 4,94564
1pcrg 159 10,23818
2FaBF 181 9,73339
2HGsr 186 9,57280
2Hhrd 186 9,80347
2QOUx 196 9,48074
2UdE7 242 9,56023
2YEbu 185 10,14566
2b030 180 10,11704
2fxWp 171 9,51877
2jFu1 180 9,68265
34nTy 159 11,37424
35WIh 181 9,90610
38vD3 181 10,29027
38vwP 173 9,66479
39IN 181 9,76085
3LHQ 203 9,65609
3Pr6A 206 9,86536
3dFv6 181 9,69548
3eLzm 194 9,57770
3fgS8 259 6,53833
3ia1I 169 9,47210
3n437 198 9,42052
3yTdj 182 9,86329
41ICF 185 10,22270
41Le7 185 10,29040
420Px 185 10,40863
4Cd8a 185 10,25088
4Gr4L 173 9,58698
4HUUe 187 9,68072
4O42n 150 10,33049
4Oprn 185 10,33037
4Tl69 180 9,78696
4Wm5M 175 9,60735
4Yuuo 174 10,17912
4fPxL 182 9,72552
4jrev 180 9,45811
4lESH 172 9,51781
4nlvb 185 10,33037
4pe6z 185 10,22270
4pm46 185 10,22257
4rIXW 185 10,29252
4wNyd 188 9,53283
4wSTO 181 9,67994
4zAWB 155 10,23089
54dtj 185 10,22257
55mTb 185 10,33037
5AXvd 185 10,25893
5IUSZ 183 11,54269
5IX4u 176 9,39751
5hAuO 182 10,14044
5iAn7 131 10,71492
5ij7C 185 10,29252
5kaPo 178 8,74309
5ucBg 201 9,79908
5ufMm 185 10,64117
5vRu8 154 10,19924
5vcyM 153 10,28691
5vg5o 185 10,29252
5vjfp 185 10,22257
6ALFp 225 7,00179
6ALvl 277 6,31285
6AMfp 262 5,88985
6Fz8x 173 9,87748
6HMOB 174 9,97592
6RcMd 170 9,78838
6RqdJ 181 9,59704
6Rybt 181 9,66750
6Wmbf 187 9,74370
6zeV2 150 10,43145
7I9os 229 5,04221
7VE36 220 5,13963
7WaoK 229 5,02982
7XozP 184 9,89456
80bsd 167 10,33952
81r4Q 185 10,36254
88eM1 181 9,80108
8e9ID 185 10,36254
8ebcX 167 10,25513
8rbWE 184 10,05966
8rd1y 182 10,12845
8s3Vy 162 9,40853
8t4nf 182 10,04451
GBbW 171 9,92557
hHAv 200 9,94465
v7BM 178 9,22351
wvQG 293 7,59411
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28163
fkdY
23 32,6% 190 5.802E-29
2 phalp2_40005
1XrwF
16 26,0% 165 5.151E-28
3 phalp2_9091
5CU5d
533 33,8% 139 7.036E-28
4 phalp2_5100
1J6h1
99 30,0% 183 1.239E-24
5 phalp2_29584
7Xnop
11 26,6% 195 9.589E-23
6 phalp2_13408
4tHhZ
205 27,7% 144 1.622E-19
7 phalp2_30395
4LZwv
478 28,8% 156 1.622E-19
8 phalp2_21417
87nDT
43 27,7% 180 2.209E-19
9 phalp2_16656
hF2b
74 25,9% 162 1.918E-18
10 phalp2_26940
4BHPP
11 28,7% 139 2.611E-18

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 4gtRR (209 AA)
Member sequence: 4U2we (293 AA)
1 209 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4gtRR) rather than this protein.
PDB ID
4gtRR
Method AlphaFoldv2
Resolution 89.48
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50