Protein

Protein accession
175c9 [EnVhog]
Representative
6U3Fm
Source
EnVhog (cluster: phalp2_1024)
Protein name
175c9
Lysin probability
99%
PhaLP type
endolysin
Probability: 89% (predicted by ML model)
Protein sequence
MQLELLDFTGRYVHQASFKNQIVMHHTVSGDNAQKVVDYWKKQPSLIGTSHLIDRQGKIYQVYDDSFWAGHVGDTTQDMAKFSLIPRNCSKSSLGVELLSMGGLKMYHSKLLDAYGYAFKGEVEQVSHRGYSYFEKYTTAQIQSLKELLLYWRDKYNIPLEMHGGVSSIFDLQKEALNGTPGLYTHCSFRHDKSDLYPSDELMAMLETIND
Physico‐chemical
properties
protein length:211 AA
molecular weight:24180,1 Da
isoelectric point:6,19
hydropathy:-0,44
Representative Protein Details
Accession
6U3Fm
Protein name
6U3Fm
Sequence length
209 AA
Molecular weight
24361,52780 Da
Isoelectric point
5,67187
Sequence
MIIEQVENFDKYIHQESFKNQIVLSQALIGDDIQDIIQKYREKDLPVSPSYIIDRDGTVYSMYPDKFWSDHLMDTNIVMNKYSLIPRNCSKSSIGIAFVTCGAMSEGRKFTFICKDNGKEFTGKVSNISTYRGFRFFESYTKDQINSLQELLSLLRLEYNIEIRYKDAIWDVSIEALTGIAGFYLQNSFNHEFTGLYPDTKLIRMLRNL
Other Proteins in cluster: phalp2_1024
Total (incl. this protein): 2 Avg length: 210,0 Avg pI: 5,93

Protein ID Length (AA) pI
6U3Fm 209 5,67187
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14005
87m9T
1656 32,6% 202 7.080E-45
2 phalp2_4134
1HOCY
3 26,8% 220 1.442E-36
3 phalp2_40162
8puc9
1 26,6% 218 1.538E-31
4 phalp2_11814
8vJeI
1 30,2% 208 2.761E-28
5 phalp2_13122
8iXkc
2 29,7% 198 1.793E-27
6 phalp2_33891
1NRwq
35 25,7% 198 1.163E-26
7 phalp2_11040
7EhAU
132 23,8% 218 2.905E-21
8 phalp2_35105
5kI8h
32 22,7% 211 7.598E-19
9 phalp2_9246
6Vxs8
22 25,6% 195 2.227E-13
10 phalp2_28209
iKDC
1185 20,7% 222 2.227E-13

Domains

Domains
Representative sequence (used for alignment): 6U3Fm (209 AA)
Member sequence: 175c9 (211 AA)
1 209 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6U3Fm) rather than this protein.
PDB ID
6U3Fm
Method AlphaFoldv2
Resolution 95.09
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50