Protein

Protein accession
4XLMX [EnVhog]
Representative
4IA3O
Source
EnVhog (cluster: phalp2_16206)
Protein name
4XLMX
Lysin probability
99%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
MPSKSAAISSDQTAPPWLLVMRAVTGLSEWEDGSNPKIEAMAVYIGKKFPDQAAYASQFDDDDIAWCGVATDFCLAASTPEGISGPFGVKDTDRWMWAESFASDPNFVHLARPVLGAITIMTREGGNHVTMYESTNSDGTINCRGGNQSNCVKVSTYDPDTVIAFVWPRAWPLPEVPEIPVEDRPMLERGDDGPDVIDLQRMIPNFTGEIDGDFGPVTEENVIRYQTTRGLEVDGIVGQETWTALYENKEPIPPPEPAEGLPDPLSREDQDAIAAIAMSSAIARYSWRDRGQAPAGYTQGVALAFANVYRKLLLDHAPALEMAQARTGSSKDVFTEYASEFKNAGMTTTGGVESLVSLWSLLLGLGMRESSGQHCCGRDQSVPPGYYGPADTTTEAGAWQTSYDAHNCDDTLDQTFEEYSMGAMRDNPQGFLSTFEQGVSCSSADWECFGPTDSPGYRHQKMSKERPAYAAEFAAITLRNLCNHFGPINRHEAEVRRDAADMFRAVQDYVDRHVPVTS
Physico‐chemical
properties
protein length:518 AA
molecular weight:56744,4 Da
isoelectric point:4,50
hydropathy:-0,43
Representative Protein Details
Accession
4IA3O
Protein name
4IA3O
Sequence length
534 AA
Molecular weight
57961,32620 Da
Isoelectric point
4,31041
Sequence
MKRQHKAQSGLDLKSDDGAPPWLNVMRAITGLTETPGEADNEKIIGMARYLGKKFSEQKSYWDQYQHDSTAWCGLTMAFCMGAATPDGISGPWGPTDTDRGMWALAWADSEDYAPLDEPVLGAVVVMEREGGGHVTCLEGVNSDGTYRCRGGNQSDAVNVQNYSRSQVVALVWPKAYGDVPRRSLSEGDSGSDVEELQQRLGLAPADCDGDYGPTTEAAVRGFQRGWALDPDGEVGPATWAALDELQNKVMGERLSASLITDISDRAARSAIAKYPWEGRGVTPMGYAKGMALAWAIAVTDLSAGEDWADVMAAGASGDPDSDVLSYYKDHLRELDWDCSRASIDTMRALWCILWALGPRESSGNHFCGRDQSADNTDADTCEAGAWQWSWNLATSSDTIGDLMDEYWADPSSFLEQWNEDGNTDAGNIENYGSGTGARSQWLSKYCPVFSALVTAVGLRKRRNHWGPINRDELDWEGVEICDELLTQVQSIMEGVDPTPIPPDPPEEVATVSIQITASGPVTVVVNGTDISDL
Other Proteins in cluster: phalp2_16206
Total (incl. this protein): 76 Avg length: 517,9 Avg pI: 4,62

Protein ID Length (AA) pI
4IA3O 534 4,31041
11E1E 512 4,40391
15EVG 501 4,46916
15GVp 491 4,49258
15HQj 518 5,42138
15HcK 496 4,28858
15IS4 503 4,72511
15Im8 525 4,72493
16QA8 514 4,67151
16RAY 498 4,22958
1DpAj 528 4,81849
1Hoax 516 4,38424
1Im82 404 4,41300
1NGU1 494 4,53936
1Zdat 515 4,50150
1cXTo 522 4,37498
1ogC8 514 4,55169
1ogEc 530 4,77774
1ogaG 516 4,34678
1ognv 499 4,40789
1zFQp 530 4,75426
2SDdg 513 4,63689
2SxZe 528 4,74551
2XOL 506 4,48917
2Yi7 535 4,68941
2dgd3 526 4,84850
2gpvA 518 4,33525
30EFb 519 4,54720
35Fjw 496 4,69322
3g5Ya 519 4,43420
47XiM 500 4,40959
4EOUw 402 4,58289
4ETUk 535 4,68077
4EZoO 524 4,17036
4FdGN 528 4,47882
4XMAv 544 4,48183
5B6Gn 524 5,33465
5EZNC 511 4,41590
5GUD1 514 4,79155
5IJgd 524 4,65843
5J2ms 533 4,37145
5J5Pk 531 4,57073
5j2n2 488 5,03579
5kimj 532 4,84850
5nDqf 531 4,50042
5nE8d 532 4,73710
5nEVW 531 4,76768
5nEYB 518 4,49428
5nEYG 508 4,69862
5nGNv 525 4,79206
5naWX 488 5,17942
5nawb 529 4,63183
5nbhT 529 4,61177
5nc7R 560 4,47018
5qQJv 528 4,84850
6DaIw 505 4,38561
6F9D4 514 4,49678
6I5t2 642 4,91790
6I8js 657 4,84839
6Kerx 514 4,55726
6KexZ 518 4,83708
6KgUh 512 4,72601
6S42z 516 4,60944
6S4eZ 493 4,26948
6UVF3 524 4,57255
6UX75 538 4,51531
6Y1VL 538 5,06637
Th0z 522 4,46825
TxiA 516 4,72397
fVgF 513 4,52412
ghyC 522 4,30552
jEps 578 4,57312
jFJ5 409 5,66704
kQxK 529 4,76063
traz 497 4,55788
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18473
6I4ij
130 47,2% 330 2.079E-161
2 phalp2_30343
4DZKQ
2 37,3% 340 4.586E-108

Domains

Domains
Unannotated
PG_1
Representative sequence (used for alignment): 4IA3O (534 AA)
Member sequence: 4XLMX (518 AA)
1 534 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4IA3O) rather than this protein.
PDB ID
4IA3O
Method AlphaFoldv2
Resolution 91.38
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50