Protein

Protein accession
1KPYm [EnVhog]
Representative
4fsrt
Source
EnVhog (cluster: phalp2_2213)
Protein name
1KPYm
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MREILARRLSSTGPHVIELQTLLSKTGHYHGGVVGVFNADLEEAVKAFQSDNGLVDDGEVIYQGGVTWPKLVKAARQVPVGSYKRPPFATVLPQKMKSRGTYRGGWPRGAIVHFTAGRDGAEKTINGGIENGYTFWCIQRDGRLFTAHDADQWGYHAGSSKWPGLDGTVSDELLGIEINAAGRLTKLSDGRLQSWFKTIVPKDEARQVPGKRYPGELAGYYEKYTDPQESTLVNTLLWLKAQRPDQFSFDLVLGHDEVAPGRKNDPGGALSMTMPEFRAHLKSEWAKREGSK
Physico‐chemical
properties
protein length:292 AA
molecular weight:32297,1 Da
isoelectric point:8,98
hydropathy:-0,54
Representative Protein Details
Accession
4fsrt
Protein name
4fsrt
Sequence length
203 AA
Molecular weight
23261,08620 Da
Isoelectric point
5,95584
Sequence
MDTPFGLYPNAIFLKDLPRLKTFGNYKLDFPEGAIVHYTAGRQDQSGIMAIQHALENNYCYFFINESGMVYQQFNINRWGSHAGLSKCPVTCRSNVSKYYVGIEIACAGLLNKGKTWYNLRVPDHKIRVIDNKEYEAFTPEQEESLFDLSMWLCLNGANPNLFFGHNEVTPRKIDPAGSLSISMEEFRIRLKENIDGYNSSDY
Other Proteins in cluster: phalp2_2213
Total (incl. this protein): 110 Avg length: 251,9 Avg pI: 8,39

Protein ID Length (AA) pI
4fsrt 203 5,95584
16T1o 256 7,19124
1748p 266 6,53924
18Y0s 243 7,81661
1DXjt 277 5,94265
1IXhm 277 9,05086
1L76r 223 9,02250
1L9yP 251 8,30696
1LpYX 245 9,15885
1MvUt 249 6,97241
1QpaT 203 8,83689
1QsMx 200 9,29906
1QtUM 277 9,33439
1XtiU 243 9,03281
1j3po 304 9,18560
1lqsr 215 8,53389
1ltGk 272 9,32060
1oY4s 292 7,58229
1pyMF 243 5,93418
1q8KA 259 9,71398
26lK0 262 9,16445
28NyS 235 7,10899
2N34S 261 9,08297
2N36I 266 8,80266
2Rpps 221 6,17137
2Ud8O 222 9,24659
2ZTn1 205 5,62640
2ckR3 290 8,64761
2dBb3 281 8,76959
2iADy 243 6,86475
2iwu1 243 5,64692
2jJap 232 9,82216
2jW5k 308 8,20936
2lfDr 225 9,24098
2nnIi 232 9,82216
2tttK 300 9,06504
3FXwX 261 9,27005
3G8n2 259 9,09573
3QiSi 229 6,05934
3SdAq 210 7,04118
3bHOC 291 7,67539
3zmR7 262 6,44016
3zuW4 221 9,65686
4AOx8 290 9,14569
4AdPm 234 8,93901
4BdSM 274 8,28543
4Pf7b 242 9,87077
4QFSQ 235 9,13512
4RkEa 298 6,37548
4TDA4 186 5,93322
4UmK9 268 8,93508
4UmzZ 260 9,63133
4Up0Y 272 6,63609
4UpRj 264 7,60377
4Wl3p 260 7,68363
4f08H 242 9,13770
4fI6X 268 8,75579
4fJiQ 262 6,91937
4g2TT 257 9,14969
4gGuv 263 9,26683
4hLDI 293 8,90452
4kABm 230 8,36634
4uBdG 239 9,81378
4xxZ7 262 8,83251
55WQX 267 8,31850
5IOsv 250 9,59710
5IYCQ 250 9,61490
5brCN 211 6,38549
5d4wq 238 9,61199
5kOqe 268 8,79808
5kYXP 233 8,70422
5mwWA 268 8,69461
6BJYi 199 7,67516
6BRlZ 261 8,15411
6BWPJ 299 8,37730
6BZgo 268 8,19582
6QcDl 255 5,98972
6RECf 300 9,00451
6REYU 246 9,92157
6SA9y 276 6,71652
6U5va 247 9,18547
6VOJU 239 8,55742
6xR61 243 9,13738
7E7UE 264 9,09663
7Isri 229 9,54443
7TcsL 263 8,96860
7VuQv 266 7,05295
7bNo 240 6,90238
7eK5 268 9,86233
82Nn0 259 9,07091
86p0h 235 9,12990
89XAr 227 8,91780
8F3VC 233 9,77549
8a31E 205 6,59306
8aLJq 259 9,47680
8aLgx 248 9,27154
8bumV 229 8,52087
8eJNP 266 6,83900
8jp7X 280 9,42710
8lioD 240 7,69562
8rJ7 221 7,12701
95yv 248 9,06343
97j8 224 9,28656
Jgsx 236 9,31389
QiEN 288 9,83957
aRqC 265 9,04751
jjCD 249 9,38668
kkB 263 8,73838
sZDh 237 8,92637
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_32967
46Hv4
8 28,8% 218 2.174E-45
2 phalp2_18165
4uKtb
1 31,5% 203 4.527E-40
3 phalp2_10971
4Npcy
37 29,2% 178 3.549E-32
4 phalp2_31152
3LHkO
133 27,3% 172 2.319E-31
5 phalp2_24979
ze0C
8 29,0% 210 4.193E-28
6 phalp2_18142
4iLhs
1 27,4% 215 1.773E-26
7 phalp2_11760
45aqa
242 27,2% 154 1.615E-18
8 phalp2_21493
8lvHE
1 22,2% 207 1.911E-17
9 phalp2_14005
87m9T
1656 30,3% 132 4.821E-17
10 phalp2_4783
4YJpR
19 21,0% 200 4.165E-16

Domains

Domains
Representative sequence (used for alignment): 4fsrt (203 AA)
Member sequence: 1KPYm (292 AA)
1 203 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
1KPYm
Method AlphaFoldv2
Resolution 92.03
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4fsrt) rather than this protein.
PDB ID
4fsrt
Method AlphaFoldv2
Resolution 95.26
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50