Protein

Protein accession
43kuP [EnVhog]
Representative
4sZcx
Source
EnVhog (cluster: phalp2_4634)
Protein name
43kuP
Lysin probability
96%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MENVDWSFISKEEGTRTKGYVLDPSRKEFQNSGATIATGFDIGKHNEYELSKIFKNDPELYNLFLPYVGLKGAAAAAKLKEMEEKGEPLDLGNINIKASYVDNLVKSYKYNQMAGAWENLTSEIRFDELPREYATAVMSVAFQHGVNGAPKFFGHAARGDWSAAEAELRDFYGFYKSEEDKSKGGFSNRPVDPTKKRKVSEQLKMHPLQPRMTRTAILMAQGIKKQTALAPHLFEPSSVEISTTPPHREQSQDNLGFTI
Physico‐chemical
properties
protein length:259 AA
molecular weight:29108,5 Da
isoelectric point:6,62
hydropathy:-0,64
Representative Protein Details
Accession
4sZcx
Protein name
4sZcx
Sequence length
254 AA
Molecular weight
28815,18590 Da
Isoelectric point
5,47350
Sequence
MEADISAPPARPKRAFRNIDLDFLTIKEGFKTKAYTLNSKKKEFQNSGVTIINGFDIGKHDEGELHRMFRRGTKAYNLFLPYVGLTGAAAEAKLKEKPLELKDLEGPASPLYIEQQVMKYKYAQVAEAWANRNSEIRFDELPHEYATAVMVVAFQHGPNGAPIFFGHAARGDWSAAEAELRDFYNPKGMADDDTEDQHWNQPRMTETADYMAQGTKKMADKFPHLFGAGKFLDDIFGDKEEKYDIETGEMKEVM
Other Proteins in cluster: phalp2_4634
Total (incl. this protein): 2 Avg length: 256,5 Avg pI: 6,05

Protein ID Length (AA) pI
4sZcx 254 5,47350
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20898
6VHSS
327 34,0% 182 1.173E-21
2 phalp2_33062
4ARSx
1 31,3% 201 8.089E-18
3 phalp2_28237
t4Qh
121 32,3% 173 4.985E-17
4 phalp2_26096
7w9ed
2 33,9% 171 1.672E-16
5 phalp2_4098
1twmT
2 27,7% 191 5.595E-13
6 phalp2_6029
4Ttjs
1 26,3% 201 1.017E-12
7 phalp2_39384
4Ji2N
17 26,7% 176 1.677E-09
8 phalp2_37840
4Jpeq
1 26,1% 199 1.754E-08
9 phalp2_29328
OWz8
5 25,4% 181 1.050E-05

Domains

Domains
Representative sequence (used for alignment): 4sZcx (254 AA)
Member sequence: 43kuP (259 AA)
1 254 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF16754

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4sZcx) rather than this protein.
PDB ID
4sZcx
Method AlphaFoldv2
Resolution 83.05
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50