Protein

Protein accession
1dCBF [EnVhog]
Representative
5zm1e
Source
EnVhog (cluster: phalp2_18368)
Protein name
1dCBF
Lysin probability
99%
PhaLP type
endolysin
Probability: 88% (predicted by ML model)
Protein sequence
MSNIPAWLEMARRCIGIEEHPGEANSPQIMRAPDIIAATYPEMEEYCSYYTGDEIAWCGLAVAFFMTSAGYRPVFGSDDVHRFLWAAAWEDWGDQLTTPKPGAIVCLDHHVALYERTEGSNVILVGGNHSDMVKESAFASSGIKAI
Physico‐chemical
properties
protein length:146 AA
molecular weight:16102,0 Da
isoelectric point:4,66
hydropathy:-0,07
Representative Protein Details
Accession
5zm1e
Protein name
5zm1e
Sequence length
158 AA
Molecular weight
17399,42370 Da
Isoelectric point
4,70095
Sequence
MIPAWLEIARRCIGIEEHPGDADNPQIMRAPEIIAAAYPDMATYCSYYTADSIAWCGLAVAFFVTSAGYRPVYGEDDVHRFLYAEAWSDWGQELFEPVPGAIITLDHHVALFERREGDNVILLGGNQSDQVKESAFAASGIKAIRWPLTKEPPARGGG
Other Proteins in cluster: phalp2_18368
Total (incl. this protein): 7 Avg length: 150,3 Avg pI: 5,48

Protein ID Length (AA) pI
5zm1e 158 4,70095
1FUGn 151 4,83998
1Isrv 148 5,55245
1ZdeM 150 5,05744
4HHfY 149 4,88067
6U1kY 150 8,69300
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_35973
5kZXR
1764 33,7% 151 3.797E-19
2 phalp2_35078
59R52
32 33,3% 147 1.817E-18
3 phalp2_11009
4NiJd
22 35,0% 157 3.398E-18
4 phalp2_27707
6Pypi
4 29,3% 160 3.035E-17
5 phalp2_32166
6PqA1
486 31,8% 157 3.035E-17
6 phalp2_21703
3iJm5
22 33,5% 152 1.060E-16
7 phalp2_28323
16UXy
9 31,7% 148 5.054E-16
8 phalp2_33920
1ZuoD
42 33,1% 160 6.906E-16
9 phalp2_20540
4muWr
288 35,6% 160 1.289E-15
10 phalp2_34976
4H3SU
3 28,2% 145 7.428E-14

Domains

Domains
Unannotated
Representative sequence (used for alignment): 5zm1e (158 AA)
Member sequence: 1dCBF (146 AA)
1 158 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5zm1e) rather than this protein.
PDB ID
5zm1e
Method AlphaFoldv2
Resolution 92.17
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50