Protein

Protein accession
87tCp [EnVhog]
Representative
154un
Source
EnVhog (cluster: phalp2_38742)
Protein name
87tCp
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MRLSFVTCFAFIAISTVVAHADPSVETMVKRTADRFGVPKELAVNVARVETRISCGKVGAHGERGPLQIRPSSAAGLGYKNIKHASCQRQLDAGMAHLLMCYKAARRNWYRAAACHNGGPGILKKRSMRRSVKIYARMVVR
Physico‐chemical
properties
protein length:141 AA
molecular weight:15528,1 Da
isoelectric point:10,78
hydropathy:-0,11
Representative Protein Details
Accession
154un
Protein name
154un
Sequence length
213 AA
Molecular weight
22673,98050 Da
Isoelectric point
10,92457
Sequence
MQSSNYLVLLDNLVGGTGIEPVTPTMSRQASTAKSLINKERSTSKFALCSLYIHGKSGQSRPIAAKKNRIKSCLAGAALCLLLASDSSSPPTQASDATSLVVAAAKRHRVPVDLAVRVGRAESGLQCHRHNRSGASGPLQIMPSTARAMGYRGPSIRRASCAVQTEWGMRHLAMCYRGAKGDRRIAAACHYQGVSALRRVTKAGAAYARRVAR
Other Proteins in cluster: phalp2_38742
Total (incl. this protein): 15 Avg length: 159,8 Avg pI: 10,84

Protein ID Length (AA) pI
154un 213 10,92457
156nO 150 10,49225
1WPvQ 213 11,15853
29peP 141 10,62518
2IkeH 167 11,22113
2IldY 206 11,14499
2IoUd 145 11,57216
4NLSy 150 10,95094
4be2s 138 11,37682
4bfZc 132 10,51713
4nBAl 132 11,27167
5tLXk 154 10,26654
7Kcpx 161 10,70390
8sXkJ 154 9,59304
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_31882
4GC3T
2 45,2% 137 1.024E-29
2 phalp2_2234
4kUHU
1 37,5% 136 9.721E-27
3 phalp2_32294
7uqm
2 32,8% 134 3.748E-22
4 phalp2_39937
1pBu8
6 17,4% 132 5.236E-09
5 phalp2_5861
5xwUs
3 20,2% 163 2.530E-07

Domains

Domains
Disordered region
SLT
Representative sequence (used for alignment): 154un (213 AA)
Member sequence: 87tCp (141 AA)
1 213 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (154un) rather than this protein.
PDB ID
154un
Method AlphaFoldv2
Resolution 73.27
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50