Protein

Protein accession
6cIc0 [EnVhog]
Representative
5Lgvv
Source
EnVhog (cluster: phalp2_23382)
Protein name
6cIc0
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKKLIDVSSYNGTVNWEKAKTYGCQGTILKIIRKDLKIDNGFNRNYQACNKNKIGWGVYNYSYATTAAKAKSDMKLVCDILDKIDKSSFKYGVWFDIEDKVQASLSKAKIAEIINAAQQVVEERG
Physico‐chemical
properties
protein length:125 AA
molecular weight:14127,1 Da
isoelectric point:9,28
hydropathy:-0,45
Representative Protein Details
Accession
5Lgvv
Protein name
5Lgvv
Sequence length
128 AA
Molecular weight
14625,60690 Da
Isoelectric point
8,89820
Sequence
MRKLIDVSSYNGTVNWEKAKAYGCQGAILKIIRKDLKIDNGFNRNYQACNENELAWGVYNYSYAATATKAKSDMKLVCDILDKIDKTHFVYGVWFDIEDKVQASLNKTKIAEIINAAQQVVEKRGYLF
Other Proteins in cluster: phalp2_23382
Total (incl. this protein): 10 Avg length: 171,3 Avg pI: 8,72

Protein ID Length (AA) pI
5Lgvv 128 8,89820
1nur2 135 7,62497
5Pew7 128 7,62958
6b1U7 144 9,05222
6g7PW 165 9,18560
6taHi 128 7,62958
7MQkB 129 8,73013
A0A8S5P9Z7 316 9,64365
A0A8S5SNY0 315 9,55668
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_15198
D6Gl
21 46,4% 84 6.709E-27
2 phalp2_4390
2mvmf
118 32,2% 124 1.261E-26
3 phalp2_33300
6aZoh
2 34,8% 129 7.637E-25
4 phalp2_3380
3qQTc
8 31,3% 102 1.485E-21
5 phalp2_24879
a1g4
16 31,5% 133 3.825E-21
6 phalp2_25924
66tGM
6 34,3% 102 2.306E-19
7 phalp2_10753
3yWC4
1 25,3% 130 1.901E-17
8 phalp2_34287
3mtX2
6 34,3% 96 9.181E-17
9 phalp2_11218
6q99m
1 29,1% 127 3.235E-16
10 phalp2_31214
88VlE
13 33,3% 84 2.930E-15

Domains

Domains
Representative sequence (used for alignment): 5Lgvv (128 AA)
Member sequence: 6cIc0 (125 AA)
1 128 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5Lgvv) rather than this protein.
PDB ID
5Lgvv
Method AlphaFoldv2
Resolution 97.28
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50