Protein

Protein accession
5oyfg [EnVhog]
Representative
5ukhE
Source
EnVhog (cluster: phalp2_22042)
Protein name
5oyfg
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDPSNTWDGFYNHAKKAGAKFPELVAAQWALESGYGQHLAGKNNFFGLKGKGGSMSTTQEFVNGKWVTIRDGFIDFPSRAACIEYLIKLWYFDYKHYKGVNSAATVEDAARMLKSEGYATDPTYVDKLLRILHEKGYLSKAKRRPIKLSSAAKYYKGLSHQLAAWNHLEDILTEEQLIEFADLYRADP
Physico‐chemical
properties
protein length:188 AA
molecular weight:21311,9 Da
isoelectric point:8,77
hydropathy:-0,47
Representative Protein Details
Accession
5ukhE
Protein name
5ukhE
Sequence length
175 AA
Molecular weight
19758,33720 Da
Isoelectric point
9,14737
Sequence
MSTEQAFWSQCFDIARKCGARFPELVAAQCCLESGFGKHFSGKNNVLGLKGDGSTVSTKEFYDGQWVTIRAGFIDFPSIAACIEYLVTRWYKDYRHFKGINTAPNRYAAARMLYQQKYATDPEYPAKLSKLMKQYAPESTTSAMIGPKKRPQDFGFKQGDSHLVVNDAKETMKAF
Other Proteins in cluster: phalp2_22042
Total (incl. this protein): 104 Avg length: 187,5 Avg pI: 7,10

Protein ID Length (AA) pI
5ukhE 175 9,14737
14ehN 181 5,95891
15XHg 173 6,21883
15iH5 187 8,38239
1Anrf 188 8,44215
1TJQT 183 4,94774
1UAna 211 5,03925
1UP1F 189 6,75164
1Vy8N 181 4,93240
1eaau 184 8,61615
1sfSC 188 8,52261
1udPO 185 5,11008
22g0i 195 7,62480
28Tr1 182 5,43747
28c97 186 6,29534
2GB4P 187 7,70597
2HgId 132 8,42120
2Iqkb 187 6,84060
2Kgo1 184 8,30045
2M0UV 186 7,69420
2Ph4D 173 8,94926
2QnCb 267 8,11987
2RoBA 180 8,46310
2fFfS 184 8,29233
2s5JN 190 6,15591
2s6zM 186 5,13850
2uUcJ 185 4,96093
2v5eI 185 4,88931
2vCql 154 9,26142
3IOfI 188 5,80874
48OC6 188 7,66988
4CSXS 186 6,29534
4WB4P 187 6,83503
4XZCP 186 6,58442
4XZyA 186 7,70290
4Y0Bg 184 8,29845
4g8qx 225 7,06033
4grjj 189 6,91335
4jvfm 195 6,16660
4sEkl 187 6,90227
4thmN 153 9,51503
4x8KG 186 5,58076
55ZuE 188 7,66982
562RN 188 7,66453
573CR 185 8,72098
58HvA 239 6,11476
5926Q 188 7,66453
5aTva 188 7,68647
5dGdF 186 6,58454
5iVVx 188 7,66988
5kAmK 135 6,89243
5kw6d 133 6,27925
5kyeD 187 6,74272
5kzD4 214 8,86281
5le4G 135 7,66158
5lsnz 116 7,92866
5mhZG 187 8,38239
5mhgi 216 8,44035
5qrT2 196 6,17700
5unTB 188 7,66453
5vrPR 188 7,65936
5wkIp 182 9,08335
5zaOg 215 8,62421
6G48R 185 4,94513
6IvPK 187 7,68170
6JCJw 183 5,24740
6LAou 187 7,68170
6MGqk 188 7,66431
6MKGt 187 7,66720
6MxWS 187 6,83662
6Mxkb 187 7,69182
6OBgu 183 7,62037
6ONc9 183 5,14759
6P1pq 154 8,96912
6P30t 154 9,11340
6xo4s 187 8,72091
7D4GM 200 5,41337
7DB2U 189 8,47342
7DaNQ 187 7,77230
7INSV 195 6,73891
7pa5R 199 6,92381
80HDi 154 8,75534
81Cfw 211 4,99924
82PG1 187 6,90704
82Pl8 196 5,50755
831EG 195 6,17768
8CGtP 182 5,13355
8CTpV 183 5,14759
8EojP 177 5,11854
8FsNb 183 4,93240
8aAxz 195 6,17439
8axtn 187 6,90704
8g2CG 208 4,90079
8keY7 163 7,77264
8ywNW 195 6,17439
8zztd 208 4,90079
AJtf 219 9,35431
FsyO 188 7,66453
YCcl 188 6,18979
e3GT 211 4,98907
ilQM 187 7,77230
uI45 205 8,68610
H2BCV4 314 8,65999
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20092
1Qasw
159 50,0% 130 3.549E-72
2 phalp2_9157
6x7mc
75 60,6% 122 1.711E-61
3 phalp2_40632
7SWvA
1 40,1% 127 3.171E-32
4 phalp2_1473
1PLYJ
2 29,2% 164 6.292E-21
5 phalp2_8561
1Ibzf
44 30,5% 131 9.271E-19
6 phalp2_37
7CjdH
2 33,5% 134 2.361E-18
7 phalp2_32776
2jw1R
20 29,2% 154 6.012E-18
8 phalp2_5346
2cQBB
648 33,6% 116 2.851E-17
9 phalp2_6004
4Mdq6
19 29,1% 137 5.312E-17
10 phalp2_12353
9R5C
179 28,8% 111 1.350E-16

Domains

Domains
GLUCO
Disordered region
Representative sequence (used for alignment): 5ukhE (175 AA)
Member sequence: 5oyfg (188 AA)
1 175 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5ukhE) rather than this protein.
PDB ID
5ukhE
Method AlphaFoldv2
Resolution 82.41
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50