Protein

Protein accession
5ng2J [EnVhog]
Representative
2HC1f
Source
EnVhog (cluster: phalp2_34181)
Protein name
5ng2J
Lysin probability
88%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MNLKIALDKAIIEQALKARSKNFTLGEIIHRTEGLSLQELLIGMYRMGYAQAFCDYLALKFKKNIRLKVQSGWRSVAYNSTLKGASANSYHIWRFDDKNVIISANDFSSPDLSKEALHEEFAAFVRGETYLHRRLGFNHASDYGKDEDFTV
Physico‐chemical
properties
protein length:151 AA
molecular weight:17271,5 Da
isoelectric point:8,95
hydropathy:-0,33
Representative Protein Details
Accession
2HC1f
Protein name
2HC1f
Sequence length
134 AA
Molecular weight
15525,43030 Da
Isoelectric point
5,81931
Sequence
MNWQNLRTRNFTIQEVIHSPVDLFPERLAPVAIAAMHYMQACRDYLGVPLVITSGYRSPAYNEEIGGSENSYHVWRFTEDGHLIFAVDCYSSKLEIQLLYEDLKKLVVGEVYWHKGRGFVHVSPYGKDESWIQS
Other Proteins in cluster: phalp2_34181
Total (incl. this protein): 9 Avg length: 142,2 Avg pI: 7,52

Protein ID Length (AA) pI
2HC1f 134 5,81931
1pqrR 135 9,55823
4OLSO 155 6,96899
4U5Bu 140 6,06611
4rjdc 155 6,50576
Y8Vi 133 8,97440
jihv 146 6,90187
A0A6J5MMA9 131 7,97353
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24996
FFK4
307 30,7% 130 3.266E-11
2 phalp2_37118
1hl31
667 28,3% 127 5.450E-10
3 phalp2_9595
1rj6b
47 29,1% 134 2.597E-09
4 phalp2_1343
17yCd
10947 27,1% 107 2.597E-09
5 phalp2_26475
23cct
2 27,0% 100 9.046E-09
6 phalp2_11058
4UnMN
68 29,5% 132 4.295E-08
7 phalp2_37787
4Agyt
72 30,1% 136 3.788E-07
8 phalp2_19324
8pvH4
112 29,2% 99 1.788E-06
9 phalp2_33505
2jgI
2 24,4% 127 1.146E-05
10 phalp2_30902
HOls
632 34,1% 82 1.561E-05

Domains

Domains
Representative sequence (used for alignment): 2HC1f (134 AA)
Member sequence: 5ng2J (151 AA)
1 134 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2HC1f) rather than this protein.
PDB ID
2HC1f
Method AlphaFoldv2
Resolution 94.69
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50