Protein

Protein accession
4Cp30 [EnVhog]
Representative
8mfD4
Source
EnVhog (cluster: phalp2_19328)
Protein name
4Cp30
Lysin probability
95%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
LAYGSTVLHDDEQAGWLHVIDGWVSAEYVGAAQGAETVTQELPLVVDDGLAHPLPASIITQHFYQNPPDYAQWDLPGHDGTDLGGMAEGTPIGSIAAGIVVDASVDKAYGKYVVVRHPGLDACSFYAHLNEIKVGGGQEVAAGETIGLLGATGNATGPHLHLEIRMINADGSYRQNTPMRGGRVDPQTWCCLHGLRL
Physico‐chemical
properties
protein length:197 AA
molecular weight:20833,0 Da
isoelectric point:4,83
hydropathy:-0,09
Representative Protein Details
Accession
8mfD4
Protein name
8mfD4
Sequence length
233 AA
Molecular weight
26255,05160 Da
Isoelectric point
5,44031
Sequence
MKYFRNSAYVNLRREPNGDVIDVVRKNTLVYSIGEPRWVNDILWANVITYGFSGEWATGYMGVGLEGEIFLDDWEPHVKLSSPYKSPVVYLTQMYGENPGIYSRFGYNGHNGVDLVGVKKQIYAVAPGVATVGYDANGYGHYVKIEGYSLITIYAHLAEYTVRQDQYVQQGELIGHEGNSGGESWGMGVHLHLDIRNKLDYNASNGYGGRVDPLPYLDWSNIQFPTYVNVLNF
Other Proteins in cluster: phalp2_19328
Total (incl. this protein): 6 Avg length: 230,0 Avg pI: 5,24

Protein ID Length (AA) pI
8mfD4 233 5,44031
11jjZ 239 5,65425
5nDKo 242 4,56118
8rz01 236 5,48834
8sEk8 233 5,47259
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_22730
8dH10
29 39,0% 151 5.403E-32
2 phalp2_1989
4FVbI
10 38,8% 157 1.751E-28
3 phalp2_36490
1jUYj
11 40,9% 149 4.444E-28
4 phalp2_39908
1iVNq
39 38,6% 150 3.412E-26
5 phalp2_38873
1OvVW
15 40,1% 147 6.560E-24
6 phalp2_36749
6BXWz
16 33,5% 164 1.978E-20
7 phalp2_2282
4DAIJ
7 33,8% 180 4.971E-20
8 phalp2_21320
1KuSO
3 29,8% 161 7.857E-19
9 phalp2_15826
4gsPm
3 36,4% 159 1.450E-18
10 phalp2_35838
4GGll
4 34,2% 146 9.086E-18

Domains

Domains
Representative sequence (used for alignment): 8mfD4 (233 AA)
Member sequence: 4Cp30 (197 AA)
1 233 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4Cp30
Method AlphaFoldv2
Resolution 88.14
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (8mfD4) rather than this protein.
PDB ID
8mfD4
Method AlphaFoldv2
Resolution 93.88
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50