Protein

Protein accession
23H8t [EnVhog]
Representative
3WBQS
Source
EnVhog (cluster: phalp2_26837)
Protein name
23H8t
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MQEVKHNGITKYIYQRLLAFIGNPYGVAGLMGNLYAESGLRSDNLQNSFEQKLGMTDEEYTDAVTNGTYTKFANDGAGYGLAQWTNSARKQNLLGLSKAIGAAIDSVEMQIIFLILELQNYSLVWNVLTKTSSSYEASDAVLEHYEKPATITDSSRDNRRSFSVQYYQLYGSDEKIAYSNDESVMFAGVLKRGSRGADVRTLQQQLKRLGYAGGTLAVDGAYGEKTGAAVSEFQRDYHLYSDGKAGAVTQYVLKLQERDGSRYTVVIGNLDFTSAMELYHKYKDAIIREGHVDAKNSDFV
Physico‐chemical
properties
protein length:300 AA
molecular weight:33328,8 Da
isoelectric point:5,62
hydropathy:-0,42
Representative Protein Details
Accession
3WBQS
Protein name
3WBQS
Sequence length
287 AA
Molecular weight
31117,63450 Da
Isoelectric point
5,18681
Sequence
MSFKMPEHAGAVWKALYALIRNPYGVAGVMGNLYAESGLIANNLQNSYSKSLGMTDAEYTKAVDTMDYLDFVTDGAGYGIAQWTYSTRKQNLLAFARACRTSVGDLETQTVFLILELNNYSYVLKLLREADSVKAASDAFLFYFEKPASVLNGTSGSSDKRAAYSEEFYNAFKEVAPDGKTIDFTVLKRGSKGDAVRELQQKLDALGYDLGKAGADGVYGAQTAAVVRTFQTDYGLVADGKAGAFTQCVLHIVSDEHGATYQVTIAGLTHTEALELVAKYPNAEIVN
Other Proteins in cluster: phalp2_26837
Total (incl. this protein): 10 Avg length: 291,4 Avg pI: 5,51

Protein ID Length (AA) pI
3WBQS 287 5,18681
21p4B 299 4,76432
21rgE 297 5,34221
23Ba4 291 5,10115
23HO8 299 4,84555
23PtJ 286 5,30185
23rzB 285 5,13457
23ws2 300 6,23935
3dXI1 270 7,59110
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6190
5KbJP
74 48,4% 196 2.812E-58
2 phalp2_9140
6h8Qx
4 38,6% 238 7.187E-58
3 phalp2_26511
41cyj
11 32,7% 275 1.006E-44
4 phalp2_2449
5kOCV
9 42,1% 190 3.701E-42
5 phalp2_16800
1gzPH
189 34,8% 227 1.127E-40

Domains

Domains
Representative sequence (used for alignment): 3WBQS (287 AA)
Member sequence: 23H8t (300 AA)
1 287 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF18013

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
23H8t
Method AlphaFoldv2
Resolution 81.42
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (3WBQS) rather than this protein.
PDB ID
3WBQS
Method AlphaFoldv2
Resolution 87.32
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50