Protein

Protein accession
36BqA [EnVhog]
Representative
6Dcie
Source
EnVhog (cluster: phalp2_29014)
Protein name
36BqA
Lysin probability
99%
PhaLP type
endolysin
Probability: 86% (predicted by ML model)
Protein sequence
MRLLLVSILVTILPHISDAHDGLYDSIINKYAAAYRISPALVKAIIKVESNFNEQAVGRTHKEVGLMQLHPRYFPLASFDPAANIKMGVKYLAKMRQACYSRYKEAWFICYNTGPNTKRQVNVKLNKYYKKVMNATAHFKKQEERLLYARRVAQSAH
Physico‐chemical
properties
protein length:157 AA
molecular weight:18091,0 Da
isoelectric point:9,94
hydropathy:-0,26
Representative Protein Details
Accession
6Dcie
Protein name
6Dcie
Sequence length
126 AA
Molecular weight
13744,03490 Da
Isoelectric point
4,32678
Sequence
LSPYWPDNISQWSELIIRYADLREIPADLLAAQMYQESGGDPDATGAAGEIGLMQILGRYHPCASYDPEQNISCGTAILAGHYKATGDWNTALARYNAGTAGQSMGNGYVYADKIISMYEEAESER
Other Proteins in cluster: phalp2_29014
Total (incl. this protein): 9 Avg length: 152,4 Avg pI: 7,50

Protein ID Length (AA) pI
6Dcie 126 4,32678
3QAKu 145 4,96497
4iAjh 170 9,16020
69q6j 167 6,16273
6VD1q 147 9,83286
7SsUe 137 4,50406
7ZURw 180 9,01469
8ixVG 143 9,62141
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26388
1vB2c
5 35,3% 116 1.707E-20
2 phalp2_600
139Z6
763 32,8% 125 6.829E-18
3 phalp2_10886
4wSj8
15 29,0% 117 8.497E-17
4 phalp2_16618
8yg21
13 29,4% 129 1.164E-16
5 phalp2_39937
1pBu8
6 28,4% 116 1.164E-16
6 phalp2_39988
1NzQM
1 33,3% 108 1.596E-16
7 phalp2_40124
7YNXd
9 33,8% 121 1.596E-16
8 phalp2_29647
8vJ1F
42 34,7% 95 2.996E-16
9 phalp2_38948
3NAZh
14 28,3% 127 4.106E-16
10 phalp2_17795
fa2K
4 27,7% 119 5.626E-16

Domains

Domains
Representative sequence (used for alignment): 6Dcie (126 AA)
Member sequence: 36BqA (157 AA)
1 126 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
36BqA
Method AlphaFoldv2
Resolution 88.54
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6Dcie) rather than this protein.
PDB ID
6Dcie
Method AlphaFoldv2
Resolution 96.83
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50